Gene Sros_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3787 
Symbol 
ID8667077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4216217 
End bp4219054 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content72% 
IMG OID 
ProductGlucose/sorbosone dehydrogenase-like protein 
Protein accessionYP_003339451 
Protein GI271965255 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.339008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCCC GTTCTCGGCC CTGGATCAGG ACGCTCGCCA CGGCGTTGCT GGTCACGGCC 
GGCAGCCTGG CGATCCCCAC GGCCCACGCG GCCCGGACCG CTCCCGCGGC TCAGGCGACG
GCCGCCATCC CCCCGTCCGA CTACCAGCAG GTCCAGCTCG CCGTCGGCGC GGCCAAGCTC
GGCGAGGCGA TGTCCCTGGC GGTGCTGCCC GACCGGTCGG TGGTCCACAC CGCCCGCAAC
GGCACGGTCC GCGTCACCGA CGCGTCGGGC ACCACCAAGG TCGCCGGCAC CCTGAACGTC
TACACGCACG ACGAGGAAGG GCTGCAGGGC GTCGCGGCGG ACCCCGGCTT CGCCACGAAC
CGCTACATCT ACCTGTACTA CTCGCCGAAG CTCGCCACCC CGGCGGGCGA CGCCCCGAAG
ACGGGCACCG AGGCCGACTT CGCCGCCTGG AAGGGCCACC TCAACCTGTC GCGGTTCGTG
CTGAGGACCG ACGGCACGCT CGACCTCGCC AGCGAGAAGG TCGTCCTCGA AGTCCCCAAC
GACCGGGGGC AGTGCTGCCA CGTCGGCGGC GACATCGACT TCGACGCGGC CGGGAACCTG
TATCTCACCA CCGGTGACGA CACCAACCCG TTCGAGTCCG GCTCCTTCAC GCCCATCGAC
GAGCGGACCG ACCGCAACCC GCAGTTCGAC GCCCAGCGTT CCTCGGGCAA CACCAACGAC
CTGCGCGGCA AGGTCCTGCG GATCAGACCG ACCGCGGCCG GCGGCTACAC CGTCCCCTCC
GGCAACCTCT TCGCGCCCGG GACCGCCGGG ACCCGGCCGG AGATCTACGC GATGGGCTTC
CGCAACCCGT TCCGGATGTC GGTCGACAAG GCCACCGGCG TCGTCTACCT GGGCGACTAC
GGTCCCGACG CGGGTTCGGG CGACGCCAAC CGCGGCCCCG GCGGCCAGGT GGAGTTCACC
CGGATCACCG GGCCGGGCAA CTACGGCTGG CCGTACTGCA CCGGCACCAA CACCCCCGCC
GAGACCTACA ACGAGTTCAC CTTCCCCGAC GGCCCGTCCG GCGCGAAGTA CGACTGCGCG
GGCGGCCCGG CGAACAACTC CTTCCGCAAC ACCGGCCTGG CCAGGCTCCC CGCGGCCAAG
CCGAGCTGGA TCAAGTACGG CGACGCCGGC TCACCGCCGG AGTTCGGCGG CGGCTCGGAG
TCGCCGATGG GCGGGCCGGT CTACCGCTAC GACGCGAACC TCGACTCCGC CGTCAAGTTC
CCCGCCTCGC TGAACGGCCG CTACTTCGCC GGCGAGTACG GCAGGCGCTG GATCAAGGCG
ATCGAGGTCA AGGCCGACGG CTCCCCCGGC GAGATCGCGG CGTTCCCTTG GACGGGCACC
CAGGTCATGG ACATGGCCTT CGGCCCGGAC GGCGCGCTGT ACGTGCTGGA CTACGGAACC
GGCAGCGACA ACCAGGCCCT CTACCGGGTC GAGCACATCG GCGGCACCAA CCGCAACCCC
GTCGCCAAGG TGACCGCGGA CAGGACCTCG GGTCCGAACC CGCTGGCCGT CGCCTTCTCC
TCGGCCGGCA GCTCCGATCC CGAGGGCGGC GCCCTCACCT ACTCGTGGAG GTTCGGCGAC
GGCGGGACGT CCACCCAGGC CAACCCGTCC CACACCTACA CCGCCAACGG CACCTACACG
CCGACCCTGA CGGTCACCGA TCCGACCGGG CTGACCGGCA CCGCGAGCGT CATCGTGACG
GTCGGCAACA GCGCTCCGTC GGTGTCGCTC GCCTCCCCCG GCGACGGCCG GCCCTTCGCC
TTCGGCGACA CCGTCCCCTT CCAGGTCAAC GTCTCCGACC CGGAGGACGG CGCCGTCGAC
TGCGCCAAGG TGAAGGTCAC CTACCTGCTG GGCCACGACA GCCACCGCCA CGCGATCACC
TCCAGGAACG GCTGCTCCGG GAGCATCGCG GTGCCGGTCG ACGGTGAGCA CGACGCCGCG
GCCAACATCT ACGGCGTCTT CGACGCGGAG TACACCGACG CCGGCGGCCT GACCACGCAC
AGCGTCCGCG TGCTGCAGCC CCGGCACAGG CAGGCCGAGC ACTTCGGCGC GCAGTCCGGA
ATCCAGCCGG CCGACCACAC CGCGGCGGAG GGCGCCAGGA CGGCCGGGTT CATCGACAAC
GGCGACTGGA TCTCCTTCCA GCCGTACGTG CTGTCCGGCG TCAGGAGCGC GTCCTTCCGG
GTCTCCTCGG CCGGGGCGGG AGGGACCATC GAGGTGCGGG CGGGCTCGGC GACCGGCACC
CTGCTCGGCA CGGCCGCCGT ACCGGTCACC GGTAGCTGGG AGACCTTCAC CGACGTGACC
GCGAGCATCT CCGGCGCGCC CGCCGGGAGC ACCACGCTGT TCCTGGTGTT CAAGGGCCCG
ACCGGCGCGG GCAACCTGTT CGACGTGGAC GCCTTCACCC TCGTGACCGC GGCCGGCACG
ACGGCCGAGG CGGAGTCCTA CACCTCCACC TCCGGCGTGC AGATCGCCGA CCACGCCCCC
GCCAGCGGCG GCAGGACCGC CGGATACATC AACAACGGCG ACTGGACCGG CTACTCCACC
ATCACCACCA CCGGCGCCAC CGCCTTCAGC GCCCGCATCT CCTCCGCCGG ACCCGGCGGC
ACCATCCAGA TCCGCTCCGG ATCGGCCACC GGCGCCCTCC TCGGCACGGT CACCGTACCC
ACCACCGGAG GCTGGGAGAC CTTCCAGAAC GTCACCACCC CCCTGACCGC CTCCGCCACC
GGCCCCCTCT TCCTCGTCTA CACCGGCACC GGCACCGGCT TCCTGTTCGA CGTCGACACC
TTCACCCTCA CCAGGTAG
 
Protein sequence
MSPRSRPWIR TLATALLVTA GSLAIPTAHA ARTAPAAQAT AAIPPSDYQQ VQLAVGAAKL 
GEAMSLAVLP DRSVVHTARN GTVRVTDASG TTKVAGTLNV YTHDEEGLQG VAADPGFATN
RYIYLYYSPK LATPAGDAPK TGTEADFAAW KGHLNLSRFV LRTDGTLDLA SEKVVLEVPN
DRGQCCHVGG DIDFDAAGNL YLTTGDDTNP FESGSFTPID ERTDRNPQFD AQRSSGNTND
LRGKVLRIRP TAAGGYTVPS GNLFAPGTAG TRPEIYAMGF RNPFRMSVDK ATGVVYLGDY
GPDAGSGDAN RGPGGQVEFT RITGPGNYGW PYCTGTNTPA ETYNEFTFPD GPSGAKYDCA
GGPANNSFRN TGLARLPAAK PSWIKYGDAG SPPEFGGGSE SPMGGPVYRY DANLDSAVKF
PASLNGRYFA GEYGRRWIKA IEVKADGSPG EIAAFPWTGT QVMDMAFGPD GALYVLDYGT
GSDNQALYRV EHIGGTNRNP VAKVTADRTS GPNPLAVAFS SAGSSDPEGG ALTYSWRFGD
GGTSTQANPS HTYTANGTYT PTLTVTDPTG LTGTASVIVT VGNSAPSVSL ASPGDGRPFA
FGDTVPFQVN VSDPEDGAVD CAKVKVTYLL GHDSHRHAIT SRNGCSGSIA VPVDGEHDAA
ANIYGVFDAE YTDAGGLTTH SVRVLQPRHR QAEHFGAQSG IQPADHTAAE GARTAGFIDN
GDWISFQPYV LSGVRSASFR VSSAGAGGTI EVRAGSATGT LLGTAAVPVT GSWETFTDVT
ASISGAPAGS TTLFLVFKGP TGAGNLFDVD AFTLVTAAGT TAEAESYTST SGVQIADHAP
ASGGRTAGYI NNGDWTGYST ITTTGATAFS ARISSAGPGG TIQIRSGSAT GALLGTVTVP
TTGGWETFQN VTTPLTASAT GPLFLVYTGT GTGFLFDVDT FTLTR