Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1110 |
Symbol | |
ID | 4895155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1147610 |
End bp | 1150918 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640111696 |
Product | trehalose synthase |
Protein accession | YP_001042992 |
Protein GI | 126461878 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.531245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAGA ACGTCTCTCC CTCCACCGTT GCCGTCCGCC GCACGGACGG CATCGATCGC AGCCAGACCG ACTGGTACAA GGACGCGATC ATCTATCAGC TGCACATCAA GGCGTTCCAG GACGCCAATG GCGACGGGAT CGGTGACTTC GCGGGCCTGA TGCAGCGGCT GGATTATGTG CAGGCGCTGG GCGTGACCGC GATCTGGCTG CTGCCTTTCT ATCCCTCGCC GCTCCGCGAC GACGGCTACG ACATCTCGGA CTATCGCTCG ATCAACCCGT CCTACGGCGC GATGCGCGAC TTCAAGCTGT TCGTGCAGGA GGCGCACAAG CGCGGGCTGC GCGTCATCAC CGAGCTCGTC ATCAACCACA CCTCCGACCA GCATCCCTGG TTCCAGCGGG CCCGGCGGGC GAAGAAGGGA TCGGCCGCGC GCGACTGGTA TGTCTGGAGC GATACCGACC AGAAATTCCC CGAAACGCGG ATCATCTTCC TCGATACGGA AAAGTCGAAC TGGACCTGGG ATCCGGTGGC AGGCGCCTAT TACTGGCACC GCTTCTATTC GCACCAGCCC GACCTGAACT TCGACAATCC CCGTGTGCTC GAGGAAGTGC TCAAGATCAT GCGCATGTGG CTCGAGATGG GCGTCGACGG GCTCCGGCTC GACGCGATCC CCTATCTGGT CGAGCGCGAG GGCACCAACA ACGAGAACCT CCCCGAGACG CACGATGTTC TCAAGAAGAT CCGGGCCAAC CTCGACCAGC ATTTCCCCGA CAGGATGCTG CTCGCCGAGG CGAACCAGTG GCCGGAGGAC ACGCGCCCCT ATTTCGGCGA GGGCGACGAA TGCCACATGG GCTTCCATTT CCCGCTGATG CCGCGGATGT ACATGGCGCT GGCACAGGCA GACCGGCATC CGATCACCGA CATCATCCGC CAGACGCCCG AGATCCCGGA AGGCTGCCAA TGGGGCATCT TCCTCCGGAA CCACGACGAG CTGACGCTCG AGATGGTGAC GGCCGAAGAG CGCGATTACA TGTGGCGCTT CTATGCCGAG GATTCTCGGG CGCGGATCAA TCTCGGCATC CGGCGCAGGC TGGCGCCGCT GATGAAGAAC GACCGGCGCA AGATCGAGCT TCTGAACCAG ATGCTGATGT CGATGCCCGG CACGCCCATC GTCTATTACG GCGACGAGAT TGGCATGGGC GACAACTATT ATCTCGGCGA CCGGGATGGT GTGCGGACGC CCATGCAATG GTCGGCCGAC CGCAACGGCG GCTTCTCGCG CTGCAATCCG CAGCAGCTCT ACCTGCCGAC CATCCTCGAT CCGGTCTATG GCCATCAGGC GATCAACGTC GAGGCGCAGG CGGCGGACCC GTCGTCGCTG CTGAACTGGA CGCGCCGCCT GATCGCGGTG CGCAAACAGC ATCCGGCCTT CGGGCGCGGC TCGATGAGCC TGCTCTATCC GCGCAACCGC AAGGTGCTGG CCTATGTCCG CAGCCACGAG GGCACAGACA TCCTCTGCGT GGCGAACCTC TCGGACACGG CGCAGGCGGT CGAGCTGGAT CTCGCCCGGT TCCGCGGCGC GGTGCCGGTC GAGCTTACGG GCCGGTCGGA GTTTCCGCCG GTGGGCGATC TGCCCTACAT GCTGACGCTG CCGCCCTACG GCTTCTACTG GTTCATCCTC TCCGACCAGC AGGCGCTGCC GAGCTGGCAC CAGCCGATGC CCGAGACGCT GCCGGATTTC ATCACGCTCA CGACGCGCGA CGGGCGGGCC GAGACGGCGC TCACCGGGCG CGAGACGCGC CAGCTCGAGG CCGACGTGCT GCCGAACTGG CTGCCGCTGC AACGCTGGTT CGGCGCCAAG GAAGAAAAGA TCAGCGCCGT GAAGCTGGCC GTTCTGGGCT CGCTGTCGGC CGATCATGCC CTCGTCCGGC TCGAGGCCGA TGTGGGCGGC GAGGTGCAGC AGTATTTCCT GCCCGCCTCC GCGCTCTGGG GCGAGGAGCA GCTCCGGGCC GGGGCGCCCA AGCTGAGCTT CACGCTGGCC AAGGTCCGGC GCGGGCCGCA GGTGGGCGCG CTCATCGACG GGGCCTATGA CGAGCAGATG GCGCAGGACA TGCTCGAGGC GCTGCGCGAC AGTCGCAAAC TGTCGGGTGC CGGGGGCGAG GTGGTGTTCG AGCCGGGCTC GGGACTGGCC GAGATCGCCG ACCCGGGCGA GCCGCGCTAT CTCGGGGCCG AGCAGTCGAA CATCTCGATC GCCTTCGGCG ACCGCATGAT CCTGAAGCTC TACCGCCGCT TGCGGGCGGG CGAGCAGCCG GACGTCGAGG TGGCGCGCTT CCTGACGGAA GTGGCGGGCT ACACCCATAC GCCCCGCTAT CTCGGCGTCG TGAGCCTGCG TCCGGCGGAG GGCGAGGCGA CCGTCCTCGC CGCGGCCTTC GCCTTCGTGG CCAACATGGG CGACGCCTGG CGCGGCCTCT TCGACGCGCT GGTGCGGGAT CTGGGCGAGT TCGGCGGCTG GAGCACGGCC GAGGCGCCGG AAGTGGAGAA GGACGGCTTC TCCTTCCCGC TCACCATTCC GGGCCTTCTC GGCCAGCGCA CGGCCGAGCT GCATCAGGCC TTCGCCACCG AGACGGACGA GGCGGCCTTC CGGATGGAGC CGCTCACGGC CCAGCATCTG ACCGGCTGGG CCGAGGGGGC GCGGCGCGAG GCCGAGGCCA TGCTGTCGGT GCTCGAACGG CAGCGGACGC ACCTGCCCGA CGAGGTGCTG CCCCTGGCCG AGGCGCTGCT CGCGCGGCGC GACGACCTGC TGGCGCGGCT CGACCGCGTC ACCGGCTTCG AACCCTCGGG CGCGCTGACC CGCATCCATG GCGACTACCA TCTGGGGCAG GTGCTGCTTG CGCAGGACGA TGTCGCCATC ATCGACTTCG AGGGCGAACC GCGCCGCACG CTTGCAGAGC GCCGCGAGAA GTCCTCGCCG CTCCGCGACG TGGCGGGGAT GCTGCGCTCG TTTGACTATG CCGCCGCCGC CGCCCTCGCA CGTCACGAGG AGAGCTTCGG CCCGGCGAGC GAGCGGGCGG TGGAGCGGGC CGAGGCCTGG CGGCAACAGG CGGTGGCCGA TTTCCTTGCC GCTTACGAGG GCGCGTCCGC GGGCACCGCG AGCCTGCCCT CCGACCCGGC GCTGAAGGAG GCGCTGCTCG ATCTCTTCCT CGTGCAGAAG GCGGTCTACG AGACCTCCTA CGAATTGGGC AGCCGTCCCG CCTGGGTCGG CATCCCGCTG CGCGGGCTTC TGGAACTGCT GGACAGGAAA CCTGCATGA
|
Protein sequence | MNQNVSPSTV AVRRTDGIDR SQTDWYKDAI IYQLHIKAFQ DANGDGIGDF AGLMQRLDYV QALGVTAIWL LPFYPSPLRD DGYDISDYRS INPSYGAMRD FKLFVQEAHK RGLRVITELV INHTSDQHPW FQRARRAKKG SAARDWYVWS DTDQKFPETR IIFLDTEKSN WTWDPVAGAY YWHRFYSHQP DLNFDNPRVL EEVLKIMRMW LEMGVDGLRL DAIPYLVERE GTNNENLPET HDVLKKIRAN LDQHFPDRML LAEANQWPED TRPYFGEGDE CHMGFHFPLM PRMYMALAQA DRHPITDIIR QTPEIPEGCQ WGIFLRNHDE LTLEMVTAEE RDYMWRFYAE DSRARINLGI RRRLAPLMKN DRRKIELLNQ MLMSMPGTPI VYYGDEIGMG DNYYLGDRDG VRTPMQWSAD RNGGFSRCNP QQLYLPTILD PVYGHQAINV EAQAADPSSL LNWTRRLIAV RKQHPAFGRG SMSLLYPRNR KVLAYVRSHE GTDILCVANL SDTAQAVELD LARFRGAVPV ELTGRSEFPP VGDLPYMLTL PPYGFYWFIL SDQQALPSWH QPMPETLPDF ITLTTRDGRA ETALTGRETR QLEADVLPNW LPLQRWFGAK EEKISAVKLA VLGSLSADHA LVRLEADVGG EVQQYFLPAS ALWGEEQLRA GAPKLSFTLA KVRRGPQVGA LIDGAYDEQM AQDMLEALRD SRKLSGAGGE VVFEPGSGLA EIADPGEPRY LGAEQSNISI AFGDRMILKL YRRLRAGEQP DVEVARFLTE VAGYTHTPRY LGVVSLRPAE GEATVLAAAF AFVANMGDAW RGLFDALVRD LGEFGGWSTA EAPEVEKDGF SFPLTIPGLL GQRTAELHQA FATETDEAAF RMEPLTAQHL TGWAEGARRE AEAMLSVLER QRTHLPDEVL PLAEALLARR DDLLARLDRV TGFEPSGALT RIHGDYHLGQ VLLAQDDVAI IDFEGEPRRT LAERREKSSP LRDVAGMLRS FDYAAAAALA RHEESFGPAS ERAVERAEAW RQQAVADFLA AYEGASAGTA SLPSDPALKE ALLDLFLVQK AVYETSYELG SRPAWVGIPL RGLLELLDRK PA
|
| |