Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3787 |
Symbol | |
ID | 8667077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4216217 |
End bp | 4219054 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Glucose/sorbosone dehydrogenase-like protein |
Protein accession | YP_003339451 |
Protein GI | 271965255 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.339008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACCCC GTTCTCGGCC CTGGATCAGG ACGCTCGCCA CGGCGTTGCT GGTCACGGCC GGCAGCCTGG CGATCCCCAC GGCCCACGCG GCCCGGACCG CTCCCGCGGC TCAGGCGACG GCCGCCATCC CCCCGTCCGA CTACCAGCAG GTCCAGCTCG CCGTCGGCGC GGCCAAGCTC GGCGAGGCGA TGTCCCTGGC GGTGCTGCCC GACCGGTCGG TGGTCCACAC CGCCCGCAAC GGCACGGTCC GCGTCACCGA CGCGTCGGGC ACCACCAAGG TCGCCGGCAC CCTGAACGTC TACACGCACG ACGAGGAAGG GCTGCAGGGC GTCGCGGCGG ACCCCGGCTT CGCCACGAAC CGCTACATCT ACCTGTACTA CTCGCCGAAG CTCGCCACCC CGGCGGGCGA CGCCCCGAAG ACGGGCACCG AGGCCGACTT CGCCGCCTGG AAGGGCCACC TCAACCTGTC GCGGTTCGTG CTGAGGACCG ACGGCACGCT CGACCTCGCC AGCGAGAAGG TCGTCCTCGA AGTCCCCAAC GACCGGGGGC AGTGCTGCCA CGTCGGCGGC GACATCGACT TCGACGCGGC CGGGAACCTG TATCTCACCA CCGGTGACGA CACCAACCCG TTCGAGTCCG GCTCCTTCAC GCCCATCGAC GAGCGGACCG ACCGCAACCC GCAGTTCGAC GCCCAGCGTT CCTCGGGCAA CACCAACGAC CTGCGCGGCA AGGTCCTGCG GATCAGACCG ACCGCGGCCG GCGGCTACAC CGTCCCCTCC GGCAACCTCT TCGCGCCCGG GACCGCCGGG ACCCGGCCGG AGATCTACGC GATGGGCTTC CGCAACCCGT TCCGGATGTC GGTCGACAAG GCCACCGGCG TCGTCTACCT GGGCGACTAC GGTCCCGACG CGGGTTCGGG CGACGCCAAC CGCGGCCCCG GCGGCCAGGT GGAGTTCACC CGGATCACCG GGCCGGGCAA CTACGGCTGG CCGTACTGCA CCGGCACCAA CACCCCCGCC GAGACCTACA ACGAGTTCAC CTTCCCCGAC GGCCCGTCCG GCGCGAAGTA CGACTGCGCG GGCGGCCCGG CGAACAACTC CTTCCGCAAC ACCGGCCTGG CCAGGCTCCC CGCGGCCAAG CCGAGCTGGA TCAAGTACGG CGACGCCGGC TCACCGCCGG AGTTCGGCGG CGGCTCGGAG TCGCCGATGG GCGGGCCGGT CTACCGCTAC GACGCGAACC TCGACTCCGC CGTCAAGTTC CCCGCCTCGC TGAACGGCCG CTACTTCGCC GGCGAGTACG GCAGGCGCTG GATCAAGGCG ATCGAGGTCA AGGCCGACGG CTCCCCCGGC GAGATCGCGG CGTTCCCTTG GACGGGCACC CAGGTCATGG ACATGGCCTT CGGCCCGGAC GGCGCGCTGT ACGTGCTGGA CTACGGAACC GGCAGCGACA ACCAGGCCCT CTACCGGGTC GAGCACATCG GCGGCACCAA CCGCAACCCC GTCGCCAAGG TGACCGCGGA CAGGACCTCG GGTCCGAACC CGCTGGCCGT CGCCTTCTCC TCGGCCGGCA GCTCCGATCC CGAGGGCGGC GCCCTCACCT ACTCGTGGAG GTTCGGCGAC GGCGGGACGT CCACCCAGGC CAACCCGTCC CACACCTACA CCGCCAACGG CACCTACACG CCGACCCTGA CGGTCACCGA TCCGACCGGG CTGACCGGCA CCGCGAGCGT CATCGTGACG GTCGGCAACA GCGCTCCGTC GGTGTCGCTC GCCTCCCCCG GCGACGGCCG GCCCTTCGCC TTCGGCGACA CCGTCCCCTT CCAGGTCAAC GTCTCCGACC CGGAGGACGG CGCCGTCGAC TGCGCCAAGG TGAAGGTCAC CTACCTGCTG GGCCACGACA GCCACCGCCA CGCGATCACC TCCAGGAACG GCTGCTCCGG GAGCATCGCG GTGCCGGTCG ACGGTGAGCA CGACGCCGCG GCCAACATCT ACGGCGTCTT CGACGCGGAG TACACCGACG CCGGCGGCCT GACCACGCAC AGCGTCCGCG TGCTGCAGCC CCGGCACAGG CAGGCCGAGC ACTTCGGCGC GCAGTCCGGA ATCCAGCCGG CCGACCACAC CGCGGCGGAG GGCGCCAGGA CGGCCGGGTT CATCGACAAC GGCGACTGGA TCTCCTTCCA GCCGTACGTG CTGTCCGGCG TCAGGAGCGC GTCCTTCCGG GTCTCCTCGG CCGGGGCGGG AGGGACCATC GAGGTGCGGG CGGGCTCGGC GACCGGCACC CTGCTCGGCA CGGCCGCCGT ACCGGTCACC GGTAGCTGGG AGACCTTCAC CGACGTGACC GCGAGCATCT CCGGCGCGCC CGCCGGGAGC ACCACGCTGT TCCTGGTGTT CAAGGGCCCG ACCGGCGCGG GCAACCTGTT CGACGTGGAC GCCTTCACCC TCGTGACCGC GGCCGGCACG ACGGCCGAGG CGGAGTCCTA CACCTCCACC TCCGGCGTGC AGATCGCCGA CCACGCCCCC GCCAGCGGCG GCAGGACCGC CGGATACATC AACAACGGCG ACTGGACCGG CTACTCCACC ATCACCACCA CCGGCGCCAC CGCCTTCAGC GCCCGCATCT CCTCCGCCGG ACCCGGCGGC ACCATCCAGA TCCGCTCCGG ATCGGCCACC GGCGCCCTCC TCGGCACGGT CACCGTACCC ACCACCGGAG GCTGGGAGAC CTTCCAGAAC GTCACCACCC CCCTGACCGC CTCCGCCACC GGCCCCCTCT TCCTCGTCTA CACCGGCACC GGCACCGGCT TCCTGTTCGA CGTCGACACC TTCACCCTCA CCAGGTAG
|
Protein sequence | MSPRSRPWIR TLATALLVTA GSLAIPTAHA ARTAPAAQAT AAIPPSDYQQ VQLAVGAAKL GEAMSLAVLP DRSVVHTARN GTVRVTDASG TTKVAGTLNV YTHDEEGLQG VAADPGFATN RYIYLYYSPK LATPAGDAPK TGTEADFAAW KGHLNLSRFV LRTDGTLDLA SEKVVLEVPN DRGQCCHVGG DIDFDAAGNL YLTTGDDTNP FESGSFTPID ERTDRNPQFD AQRSSGNTND LRGKVLRIRP TAAGGYTVPS GNLFAPGTAG TRPEIYAMGF RNPFRMSVDK ATGVVYLGDY GPDAGSGDAN RGPGGQVEFT RITGPGNYGW PYCTGTNTPA ETYNEFTFPD GPSGAKYDCA GGPANNSFRN TGLARLPAAK PSWIKYGDAG SPPEFGGGSE SPMGGPVYRY DANLDSAVKF PASLNGRYFA GEYGRRWIKA IEVKADGSPG EIAAFPWTGT QVMDMAFGPD GALYVLDYGT GSDNQALYRV EHIGGTNRNP VAKVTADRTS GPNPLAVAFS SAGSSDPEGG ALTYSWRFGD GGTSTQANPS HTYTANGTYT PTLTVTDPTG LTGTASVIVT VGNSAPSVSL ASPGDGRPFA FGDTVPFQVN VSDPEDGAVD CAKVKVTYLL GHDSHRHAIT SRNGCSGSIA VPVDGEHDAA ANIYGVFDAE YTDAGGLTTH SVRVLQPRHR QAEHFGAQSG IQPADHTAAE GARTAGFIDN GDWISFQPYV LSGVRSASFR VSSAGAGGTI EVRAGSATGT LLGTAAVPVT GSWETFTDVT ASISGAPAGS TTLFLVFKGP TGAGNLFDVD AFTLVTAAGT TAEAESYTST SGVQIADHAP ASGGRTAGYI NNGDWTGYST ITTTGATAFS ARISSAGPGG TIQIRSGSAT GALLGTVTVP TTGGWETFQN VTTPLTASAT GPLFLVYTGT GTGFLFDVDT FTLTR
|
| |