Gene Sros_5528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5528 
Symbol 
ID8668822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6046555 
End bp6048012 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content69% 
IMG OID 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003341025 
Protein GI271966829 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.217643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.1595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAT CCACTGGTAT GCCGGGAGAG ATGAACCGGC GGCAGCTCCT GCGCCGGATC 
GGCCTGACCG CCCTGGCCGC AGGTCCGGGC GCGGGCCTGC TCAGCGCCTG CGCGACGGCC
GGAAGCGGGA GTGGCGCGGG GTCGGCTCCG GCCGCGGCCA CGCCCGCCGC GACCTCGGCG
GCCAACCCGT TCGGCGTCGA CCCCGGGAAG CCGCTCGAGG TGGTGATCTT CAACGGCGGC
AACGGCGACG GCTACGCGAC CGGGCTCCAC CAGCCGCTGT ACCGCAAGAC GTATCCGCGG
GCGGAGATCA AGCACGTCCC CACGCAGAAG ATCGGCACCC AGCTGCGCCC GCGCTTCGTC
AGCGGTGACG TGCCGGACGT GGTGAACAAC TCCGGCCCGG AGGCGCTCGA CATGGCGGCG
CTGACCGCGG AGGGTCACCT CGCCGATCTG AGCATGCTCT TCGACGCGCC GTCGGTCGAC
GACCCGGGGA AGAAGGTGCG CGACACGCTG GTCGGCGGCG CCCACGAGAA CTCGCTGATC
GACGGCGTCC CGCACGTCCT CAACTACACC GTCGCCCACC GTGCCCTGTG GTACAACGCC
AAGCTCTTCG CGGACAAGGG CTGGACGGTG CCCAGGACCT GGGACGCCTT CCTCGCACTC
GGCGAGGAGA CACGGAGGGC GGGCATCACG CTGTTCGCCT ACCCCGGCCA GGTCGGGCCG
TTCTACCAGG TCTGGAACCT CGTCTACACC GCGGCGAAGA TCGGCGGCAA CCAGGTCGTC
ATCGACATCG ACAACCTGGC GGACGGCGCC TGGACCAGCC CCGCCGTCCT GGCCTCCGTG
ACCGCGTGGG CGGACCTGCA GGCCAGGTAC GGCGACAAGT CCTACTTCGG CCTGGACCAC
ACCCAGACGC AGGTCAAACA TCTGCAGGAC AAGGTGGCCT TCTACCCCTG TGGCTCCTGG
CTGGACAACG AGATGGCCAA GGACAAGCCC GACTCCTTCG AATACGCCAT CGCCCCGGTG
CCGAGCGTGA CCGCCACGGA CAAGATGCCC GCGGAGGCCA TCATGGTGGG GGTGAGCGAG
GCGTTCTTCG TCTCGGCCAA GGGCGGCAAC CTCGCGGGCG GCCTGGAGTA CCTGCGGATC
ATGCTCTCCA GGGAGGGGGC TCGCGGATAC ACCGAGAGAA CCAAGAACCT CACCGTGGTC
AACGGCGTCG CCGAGGGCCT CGACCTCCCT CCCGCCGTAC GGAGCGCCGC CAGGGCTCAG
GACGCGGCGG GCAGGAACAC CATCACCGAC GCGCGGTTCG AGAGCTGGTA CAAGGAGCTG
TTCGACTACT CCCAGACCCA GACGAACGCG GTCATGGCAG GCCGGGCGAC GCCCGAGCAG
TTCTGCGCGA ACATGCAGAA GAAGGCCGAC GAGATCAAGA AGGACCCGTC GGTCACCAAG
CAGACTCGGA GCGTCTAG
 
Protein sequence
MSQSTGMPGE MNRRQLLRRI GLTALAAGPG AGLLSACATA GSGSGAGSAP AAATPAATSA 
ANPFGVDPGK PLEVVIFNGG NGDGYATGLH QPLYRKTYPR AEIKHVPTQK IGTQLRPRFV
SGDVPDVVNN SGPEALDMAA LTAEGHLADL SMLFDAPSVD DPGKKVRDTL VGGAHENSLI
DGVPHVLNYT VAHRALWYNA KLFADKGWTV PRTWDAFLAL GEETRRAGIT LFAYPGQVGP
FYQVWNLVYT AAKIGGNQVV IDIDNLADGA WTSPAVLASV TAWADLQARY GDKSYFGLDH
TQTQVKHLQD KVAFYPCGSW LDNEMAKDKP DSFEYAIAPV PSVTATDKMP AEAIMVGVSE
AFFVSAKGGN LAGGLEYLRI MLSREGARGY TERTKNLTVV NGVAEGLDLP PAVRSAARAQ
DAAGRNTITD ARFESWYKEL FDYSQTQTNA VMAGRATPEQ FCANMQKKAD EIKKDPSVTK
QTRSV