Gene Sros_0702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0702 
Symbol 
ID8663972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp716541 
End bp717758 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003336468 
Protein GI271962272 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.554394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGGA ACGCTTTCGA TCCGATCGAG ATTCCGCCAT GGGTCTGGGA TCACGATGAA 
AGCAGCCAAA TCCTCAAGAA TCGTGACGTG GTGGGGCTTT TCAATCTCGC TGTCAAGTAC
AACGGGGCGA GCCAAACGCG CATCAGCGCA GCGACCGGAA TAGCCCAAGG CCACGTAAGC
CTGATCATCC GAGGGCAGCG CCAGGTCACC GACCTGGAGG TCTACGAACG GATCGCCACC
GGACTCGGCC TGCCCGACCA CGCCCGCATG CTCTTCGGCC TGGCCCCACT CGACACCGCG
TCCCCCACCG GAGACCACGG TGACAACCAC CAGGAGCAGG CCGACGAGCT GACGGCCCGG
ATCGAAGCGG CCGCCGCCAT CGACCCCACC ATGGTCATGA TCCTCACCAC CGACACCAAC
AACCTGCGTC TCCTGGACCG CCGACTCGGT GGAGTCGCCA TCGCCGACAA GATGCGCGCC
CAGATCTCCC AAGTAGAACG CGCCCACCGG CACGCCGTAC GGCCCGGCAT CCGCGCCCAG
CTCGCCCACG TGCTCGCCGA GACCTCATCT CTGGCCGGAT GGCAGGCCAT CGACACCGGC
GCCCTGAACG ATGCGTGGAA TCACTACGAG CACGCCAAAG CCGCCGCCCG CGAAGCCGAC
GATCCCGCGG TGCTCGCCTA CGTGTCCGCC GAACAGGCCT ACGTCCTCAT GGAGTTGGGG
CGGCCGGCCG AAGCCACCGA GCTACTCCAG CACATCCACA CCACCCACCG TGAACGTCTC
CCCGGCCGGC TACGGACATG GCTGTCGGCA GCCGAGGCCG AAGCCGCCGC GATCCTCGGC
GACGAGACCA CCTGCCGCAC GGCACTCGAC CAGGCCGCCG CGCTTCTACC GGAGGGAGCC
GCCGACGTGA GCATGCCCTA CCTCTCATTG GACGCCCACC ACCTCGCCCG CTGGCGCGGC
AACTGCCTGG TGCGCTTCGG CGACCCCGGC ACCGTCGAAG ACCTCCGCTC AGCCTTGGCC
GGGATGGATG GCACCTACAA CCGCGCCGAA TCCGGGGTCC GCTGCGACCT GGGCCACGCC
CTTCTCGCCA GAGGAGAAGC CGATGCGGCC CAACCTCACA TCCAGCGTGC CCAGCAGCTC
GCCACCATGA CCGGCTCCCG TCGCCAGCGC AAGCGCATCG AGGAGCTCGC CCGAGCGGTT
ACGCGCTCAC TCCGTTAA
 
Protein sequence
MHGNAFDPIE IPPWVWDHDE SSQILKNRDV VGLFNLAVKY NGASQTRISA ATGIAQGHVS 
LIIRGQRQVT DLEVYERIAT GLGLPDHARM LFGLAPLDTA SPTGDHGDNH QEQADELTAR
IEAAAAIDPT MVMILTTDTN NLRLLDRRLG GVAIADKMRA QISQVERAHR HAVRPGIRAQ
LAHVLAETSS LAGWQAIDTG ALNDAWNHYE HAKAAAREAD DPAVLAYVSA EQAYVLMELG
RPAEATELLQ HIHTTHRERL PGRLRTWLSA AEAEAAAILG DETTCRTALD QAAALLPEGA
ADVSMPYLSL DAHHLARWRG NCLVRFGDPG TVEDLRSALA GMDGTYNRAE SGVRCDLGHA
LLARGEADAA QPHIQRAQQL ATMTGSRRQR KRIEELARAV TRSLR