Gene Sros_1625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1625 
Symbol 
ID8664902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1731669 
End bp1732997 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content69% 
IMG OID 
Productalpha-glucoside ABC transporter periplasmic- binding protein 
Protein accessionYP_003337359 
Protein GI271963163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.234885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.190993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAA CCATCGCGAC AGTGACGACG GCGGGCCTGG CACTCGCCCT CGCCGCGTGC 
GGCCAGTCCT CCGAGACCGG CGCCTCCCCC GCCGCGAGCA GCAGCTCCTC CGCCGCCGCC
CCGGCCGCCA AGACGCTTGA GGGCGTGACC ATCGAGGTCG CCGCCAAGTG GACCGGCGCC
GAGCAGACCA ACTTCCAGGA AGTGCTCAAG GCGTTCGAGG CCAAGACCGG CGCCAAGGTC
ACCTACGCCT CCACCGGCGA GGACACCGGC GCCTACCTCG GCCCGCGCAT CCAGGGCGGC
AACCCGCCGG ACATCGCGAT CCTCCCCCAG CCGGGCCTGG TCCAGCAGTA CGCCGACCAG
AAGGCGCTCA AGCCCCTCGC CCCCGAGGTG CTCAAGCAGA TCGACGACAA CTACACCCCG
TACTGGAAGG AGCTCGGCTC CGCCGACGGC CAGGCCTACG GCGTGCTGGT GAAGGCGGCC
CACAAGTCGC TCATCTGGTA CCGCGACCAG GCCTTCCAGG ACGCCGGGGC GCAGCCGCCG
ACCACCTGGG ACGAGCTCGT CAAGACCGCC CAGGCCGTCG CCGACTCCGG CACCCCGCCC
TTCTCCCTCT GCGGCGCGTC CGGCTGGACC CTGACCGACC TGTTCGAGAA CGTCTACCTG
TCCAGCGCGG GCCCGGAGAA CTACACCAAG CTCTCCAAGC ACGAGATCCC GTGGACCGAC
GCCAGCGTGA CCACCGCGCT GGAGAAGATC GGGCAGCTCG TCGGCAAGAA GGAGTTCCTG
CTCGGCGGCT CCTCCGGCGC CCTGCAGACC GACTTCCCGA CCTGCGTGAC CCAGGTCTAC
GGCCAGGACA AGTCGGCGAT GGTCATCGAG GCGGACTTCG TGGCCACCAC CGCCGAGGAG
TCCGGCGCGA AGCTCGGCGA GGAGGCCAAG TACTTCGCGT TCCCGAAGGC CGGCGACACC
GAGCCGGTCG TGCTGGGCGG CGACATCGCG GTGGTGCTGA AGGAATCCAA GGGCGCGATG
GCGCTGCTGG AGTTCCTCGC CTCCAAGGAG GGCGGCGAGA TCTGGGCGAA GCTCCCGGGC
TACCTGTCCC CCAACCGCAA CGTCTCTCCG GACAACTACC CGGCCGAGCT GACCAAGAAG
CTCGCCCAGA CGATCATCTC CGCCGGTGAC GCCGTCCGCT ACGACATGTC CGACCTGGCG
CCCAGCGCCT TCGGCGGCAC CGACGGCAAG GGTCAGTGGA AGCTCCTGCA GGACTTCGTC
CGCGACCCGT CCAAGATCAA GGACATCCAG TCCAAGCTTG AGGACGAGGC CAAGAAGGCC
TGGAAGTAA
 
Protein sequence
MRKTIATVTT AGLALALAAC GQSSETGASP AASSSSSAAA PAAKTLEGVT IEVAAKWTGA 
EQTNFQEVLK AFEAKTGAKV TYASTGEDTG AYLGPRIQGG NPPDIAILPQ PGLVQQYADQ
KALKPLAPEV LKQIDDNYTP YWKELGSADG QAYGVLVKAA HKSLIWYRDQ AFQDAGAQPP
TTWDELVKTA QAVADSGTPP FSLCGASGWT LTDLFENVYL SSAGPENYTK LSKHEIPWTD
ASVTTALEKI GQLVGKKEFL LGGSSGALQT DFPTCVTQVY GQDKSAMVIE ADFVATTAEE
SGAKLGEEAK YFAFPKAGDT EPVVLGGDIA VVLKESKGAM ALLEFLASKE GGEIWAKLPG
YLSPNRNVSP DNYPAELTKK LAQTIISAGD AVRYDMSDLA PSAFGGTDGK GQWKLLQDFV
RDPSKIKDIQ SKLEDEAKKA WK