Gene Sros_8741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8741 
Symbol 
ID8672079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9640133 
End bp9641866 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content70% 
IMG OID 
Productcarbohydrate ABC transporter 
Protein accessionYP_003344120 
Protein GI271969924 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGCC GGCTGCTCCG CACCTACCTG CGGCCCTACT CGTCGACGCT GACCGCCGTG 
GTGGTGCTGC AGCTGGTCGG CACCATAGCC TCGCTGTACC TGCCCAGCCT GAACGCGGAC
ATCATCGACC GGGGCGTCGC CGTCGGCGAC ACCGGCTACA TCCTGTCCGC CGGGGGCTGG
ATGCTGGCCG TGTCCCTCGT GCAGATCGCC TGCTCGATCG CGGCGGTCTA CTACGGCGCC
CGCGCGGCGA TGGGATTCGG GCGCGACGTC CGCTCGGCCG TCTTCCACCG GGTGGGCGGG
TTCTCCGCCC GGGAGGTCGC CCAGTTCGGA GCGCCCTCGC TGATCACCCG CAGCACCAAC
GACGTGCAGC AGGTGCAGAT GCTCGTGGTG ATGACCTGCA CGATGCTGGT CGCCGCGCCG
ATCATGGGCG TCGGCGGCAT CGTCATGGCG TTGCGGCAGG ACCTCGGCCT GTCCTGGCTC
ATGCTGGTCT GCGTCCCGGC GCTGCTGGTG TCGATCGGCC TGATCATCTC GCGCATGGTC
CCCCAGTTCC GGGCGATGCA GGACCGCATC GACGTGGTCA ACCAGGTGCT GCGCGAGCAG
CTCTCCGGCA TCCGGGTCGT CCGCGCCTTC GTCCGGGAAC GCGAGGAGAC CCGCCGGTTC
GCCGCGGCGA ACGACGCGCT GACCGGGACC TCGCTGCGCG TCGGGCGGCT GACCGCGCTC
ATCTTCCCCG TCGTGATGCT GATCCTCAAC GCCTCCAGCG TCGCCGTACT GTGGTTCGGC
GCGAGCCGCG TGGACAGCGG CGAGATGCAG GTCGGCGCCC TCACCGCGTT CCTCATGTAT
CTGATGCAGA TCCTTGCCTC GATGATGATG GCCACCTTCA TCTCGATGAT GATCCCGCGA
GCCGCGGTCT GCGCCGAGCG CATCATCGAG GTGCTGGACA CCGAGTCGTC GGTGACCCCG
CCCCGGGATC CCGTACGGCA GGTGCACGGC CGCGCCGAGC TGGAGCTGCG TGACGTCGAG
TTCCGCTACC CGGGCGCGGC GGCACCGGTG CTGTCCGGCA TCTCCTTCCG CGTCACCGCC
GGGCAGACCA CCGCCGTCAT CGGCAGCACC GGGGCGGGCA AGACGACACT GGTCTCCCTG
GTGCCCAGGC TGTTCGACGC CACCTCCGGC ACGGTGTCGG TCGACGGCGT CGACGTCCGC
GACCTCGATC CGCAGATGCT CTGGACACGC ATCGGCCTGG TACCGCAGAA GCCGTACCTG
TTCAGCGGCA CCGTCGCGAG CAACCTGCGC TACGGCAACC CGGACGCCAG TGACGAGGAG
CTGTGGGAGG CGCTGGAGGT CGCCCAGGCC CGCGACTTCG TCGAGGCGAT GCCCGAGGGC
CTTGAGGCCC CCATCACGCA GGGCGGCACG AACGTGTCCG GCGGGCAGCG GCAGCGGCTC
TCGATCGCCA GGGCGCTGGT CAGCAGACCC GAGATCTACC TGTTCGACGA CTCGTTCTCG
GCCCTCGACC TGACCACCGA CGCCCGGCTC CGCGCCGCCC TGCGCCCGCA CACCGCCCAG
GCCGCCGTCG TCATCGTCGG CCAGCGGGTG TCCACCATCG CCGACGCCGA CCAGATCATC
GTCCTCGACG ACGGAGTGAT CGTCGGCATG GGGACCCACG ACGAGCTGCT GGATTCCTGC
CCGACATACA TCGAGATCGT CGAGTCCCAA CTGACCGCGG GGAGCGCAGC ATGA
 
Protein sequence
MLSRLLRTYL RPYSSTLTAV VVLQLVGTIA SLYLPSLNAD IIDRGVAVGD TGYILSAGGW 
MLAVSLVQIA CSIAAVYYGA RAAMGFGRDV RSAVFHRVGG FSAREVAQFG APSLITRSTN
DVQQVQMLVV MTCTMLVAAP IMGVGGIVMA LRQDLGLSWL MLVCVPALLV SIGLIISRMV
PQFRAMQDRI DVVNQVLREQ LSGIRVVRAF VREREETRRF AAANDALTGT SLRVGRLTAL
IFPVVMLILN ASSVAVLWFG ASRVDSGEMQ VGALTAFLMY LMQILASMMM ATFISMMIPR
AAVCAERIIE VLDTESSVTP PRDPVRQVHG RAELELRDVE FRYPGAAAPV LSGISFRVTA
GQTTAVIGST GAGKTTLVSL VPRLFDATSG TVSVDGVDVR DLDPQMLWTR IGLVPQKPYL
FSGTVASNLR YGNPDASDEE LWEALEVAQA RDFVEAMPEG LEAPITQGGT NVSGGQRQRL
SIARALVSRP EIYLFDDSFS ALDLTTDARL RAALRPHTAQ AAVVIVGQRV STIADADQII
VLDDGVIVGM GTHDELLDSC PTYIEIVESQ LTAGSAA