Gene Rsph17029_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1110 
Symbol 
ID4895155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1147610 
End bp1150918 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content68% 
IMG OID640111696 
Producttrehalose synthase 
Protein accessionYP_001042992 
Protein GI126461878 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.531245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGA ACGTCTCTCC CTCCACCGTT GCCGTCCGCC GCACGGACGG CATCGATCGC 
AGCCAGACCG ACTGGTACAA GGACGCGATC ATCTATCAGC TGCACATCAA GGCGTTCCAG
GACGCCAATG GCGACGGGAT CGGTGACTTC GCGGGCCTGA TGCAGCGGCT GGATTATGTG
CAGGCGCTGG GCGTGACCGC GATCTGGCTG CTGCCTTTCT ATCCCTCGCC GCTCCGCGAC
GACGGCTACG ACATCTCGGA CTATCGCTCG ATCAACCCGT CCTACGGCGC GATGCGCGAC
TTCAAGCTGT TCGTGCAGGA GGCGCACAAG CGCGGGCTGC GCGTCATCAC CGAGCTCGTC
ATCAACCACA CCTCCGACCA GCATCCCTGG TTCCAGCGGG CCCGGCGGGC GAAGAAGGGA
TCGGCCGCGC GCGACTGGTA TGTCTGGAGC GATACCGACC AGAAATTCCC CGAAACGCGG
ATCATCTTCC TCGATACGGA AAAGTCGAAC TGGACCTGGG ATCCGGTGGC AGGCGCCTAT
TACTGGCACC GCTTCTATTC GCACCAGCCC GACCTGAACT TCGACAATCC CCGTGTGCTC
GAGGAAGTGC TCAAGATCAT GCGCATGTGG CTCGAGATGG GCGTCGACGG GCTCCGGCTC
GACGCGATCC CCTATCTGGT CGAGCGCGAG GGCACCAACA ACGAGAACCT CCCCGAGACG
CACGATGTTC TCAAGAAGAT CCGGGCCAAC CTCGACCAGC ATTTCCCCGA CAGGATGCTG
CTCGCCGAGG CGAACCAGTG GCCGGAGGAC ACGCGCCCCT ATTTCGGCGA GGGCGACGAA
TGCCACATGG GCTTCCATTT CCCGCTGATG CCGCGGATGT ACATGGCGCT GGCACAGGCA
GACCGGCATC CGATCACCGA CATCATCCGC CAGACGCCCG AGATCCCGGA AGGCTGCCAA
TGGGGCATCT TCCTCCGGAA CCACGACGAG CTGACGCTCG AGATGGTGAC GGCCGAAGAG
CGCGATTACA TGTGGCGCTT CTATGCCGAG GATTCTCGGG CGCGGATCAA TCTCGGCATC
CGGCGCAGGC TGGCGCCGCT GATGAAGAAC GACCGGCGCA AGATCGAGCT TCTGAACCAG
ATGCTGATGT CGATGCCCGG CACGCCCATC GTCTATTACG GCGACGAGAT TGGCATGGGC
GACAACTATT ATCTCGGCGA CCGGGATGGT GTGCGGACGC CCATGCAATG GTCGGCCGAC
CGCAACGGCG GCTTCTCGCG CTGCAATCCG CAGCAGCTCT ACCTGCCGAC CATCCTCGAT
CCGGTCTATG GCCATCAGGC GATCAACGTC GAGGCGCAGG CGGCGGACCC GTCGTCGCTG
CTGAACTGGA CGCGCCGCCT GATCGCGGTG CGCAAACAGC ATCCGGCCTT CGGGCGCGGC
TCGATGAGCC TGCTCTATCC GCGCAACCGC AAGGTGCTGG CCTATGTCCG CAGCCACGAG
GGCACAGACA TCCTCTGCGT GGCGAACCTC TCGGACACGG CGCAGGCGGT CGAGCTGGAT
CTCGCCCGGT TCCGCGGCGC GGTGCCGGTC GAGCTTACGG GCCGGTCGGA GTTTCCGCCG
GTGGGCGATC TGCCCTACAT GCTGACGCTG CCGCCCTACG GCTTCTACTG GTTCATCCTC
TCCGACCAGC AGGCGCTGCC GAGCTGGCAC CAGCCGATGC CCGAGACGCT GCCGGATTTC
ATCACGCTCA CGACGCGCGA CGGGCGGGCC GAGACGGCGC TCACCGGGCG CGAGACGCGC
CAGCTCGAGG CCGACGTGCT GCCGAACTGG CTGCCGCTGC AACGCTGGTT CGGCGCCAAG
GAAGAAAAGA TCAGCGCCGT GAAGCTGGCC GTTCTGGGCT CGCTGTCGGC CGATCATGCC
CTCGTCCGGC TCGAGGCCGA TGTGGGCGGC GAGGTGCAGC AGTATTTCCT GCCCGCCTCC
GCGCTCTGGG GCGAGGAGCA GCTCCGGGCC GGGGCGCCCA AGCTGAGCTT CACGCTGGCC
AAGGTCCGGC GCGGGCCGCA GGTGGGCGCG CTCATCGACG GGGCCTATGA CGAGCAGATG
GCGCAGGACA TGCTCGAGGC GCTGCGCGAC AGTCGCAAAC TGTCGGGTGC CGGGGGCGAG
GTGGTGTTCG AGCCGGGCTC GGGACTGGCC GAGATCGCCG ACCCGGGCGA GCCGCGCTAT
CTCGGGGCCG AGCAGTCGAA CATCTCGATC GCCTTCGGCG ACCGCATGAT CCTGAAGCTC
TACCGCCGCT TGCGGGCGGG CGAGCAGCCG GACGTCGAGG TGGCGCGCTT CCTGACGGAA
GTGGCGGGCT ACACCCATAC GCCCCGCTAT CTCGGCGTCG TGAGCCTGCG TCCGGCGGAG
GGCGAGGCGA CCGTCCTCGC CGCGGCCTTC GCCTTCGTGG CCAACATGGG CGACGCCTGG
CGCGGCCTCT TCGACGCGCT GGTGCGGGAT CTGGGCGAGT TCGGCGGCTG GAGCACGGCC
GAGGCGCCGG AAGTGGAGAA GGACGGCTTC TCCTTCCCGC TCACCATTCC GGGCCTTCTC
GGCCAGCGCA CGGCCGAGCT GCATCAGGCC TTCGCCACCG AGACGGACGA GGCGGCCTTC
CGGATGGAGC CGCTCACGGC CCAGCATCTG ACCGGCTGGG CCGAGGGGGC GCGGCGCGAG
GCCGAGGCCA TGCTGTCGGT GCTCGAACGG CAGCGGACGC ACCTGCCCGA CGAGGTGCTG
CCCCTGGCCG AGGCGCTGCT CGCGCGGCGC GACGACCTGC TGGCGCGGCT CGACCGCGTC
ACCGGCTTCG AACCCTCGGG CGCGCTGACC CGCATCCATG GCGACTACCA TCTGGGGCAG
GTGCTGCTTG CGCAGGACGA TGTCGCCATC ATCGACTTCG AGGGCGAACC GCGCCGCACG
CTTGCAGAGC GCCGCGAGAA GTCCTCGCCG CTCCGCGACG TGGCGGGGAT GCTGCGCTCG
TTTGACTATG CCGCCGCCGC CGCCCTCGCA CGTCACGAGG AGAGCTTCGG CCCGGCGAGC
GAGCGGGCGG TGGAGCGGGC CGAGGCCTGG CGGCAACAGG CGGTGGCCGA TTTCCTTGCC
GCTTACGAGG GCGCGTCCGC GGGCACCGCG AGCCTGCCCT CCGACCCGGC GCTGAAGGAG
GCGCTGCTCG ATCTCTTCCT CGTGCAGAAG GCGGTCTACG AGACCTCCTA CGAATTGGGC
AGCCGTCCCG CCTGGGTCGG CATCCCGCTG CGCGGGCTTC TGGAACTGCT GGACAGGAAA
CCTGCATGA
 
Protein sequence
MNQNVSPSTV AVRRTDGIDR SQTDWYKDAI IYQLHIKAFQ DANGDGIGDF AGLMQRLDYV 
QALGVTAIWL LPFYPSPLRD DGYDISDYRS INPSYGAMRD FKLFVQEAHK RGLRVITELV
INHTSDQHPW FQRARRAKKG SAARDWYVWS DTDQKFPETR IIFLDTEKSN WTWDPVAGAY
YWHRFYSHQP DLNFDNPRVL EEVLKIMRMW LEMGVDGLRL DAIPYLVERE GTNNENLPET
HDVLKKIRAN LDQHFPDRML LAEANQWPED TRPYFGEGDE CHMGFHFPLM PRMYMALAQA
DRHPITDIIR QTPEIPEGCQ WGIFLRNHDE LTLEMVTAEE RDYMWRFYAE DSRARINLGI
RRRLAPLMKN DRRKIELLNQ MLMSMPGTPI VYYGDEIGMG DNYYLGDRDG VRTPMQWSAD
RNGGFSRCNP QQLYLPTILD PVYGHQAINV EAQAADPSSL LNWTRRLIAV RKQHPAFGRG
SMSLLYPRNR KVLAYVRSHE GTDILCVANL SDTAQAVELD LARFRGAVPV ELTGRSEFPP
VGDLPYMLTL PPYGFYWFIL SDQQALPSWH QPMPETLPDF ITLTTRDGRA ETALTGRETR
QLEADVLPNW LPLQRWFGAK EEKISAVKLA VLGSLSADHA LVRLEADVGG EVQQYFLPAS
ALWGEEQLRA GAPKLSFTLA KVRRGPQVGA LIDGAYDEQM AQDMLEALRD SRKLSGAGGE
VVFEPGSGLA EIADPGEPRY LGAEQSNISI AFGDRMILKL YRRLRAGEQP DVEVARFLTE
VAGYTHTPRY LGVVSLRPAE GEATVLAAAF AFVANMGDAW RGLFDALVRD LGEFGGWSTA
EAPEVEKDGF SFPLTIPGLL GQRTAELHQA FATETDEAAF RMEPLTAQHL TGWAEGARRE
AEAMLSVLER QRTHLPDEVL PLAEALLARR DDLLARLDRV TGFEPSGALT RIHGDYHLGQ
VLLAQDDVAI IDFEGEPRRT LAERREKSSP LRDVAGMLRS FDYAAAAALA RHEESFGPAS
ERAVERAEAW RQQAVADFLA AYEGASAGTA SLPSDPALKE ALLDLFLVQK AVYETSYELG
SRPAWVGIPL RGLLELLDRK PA