Gene Rsph17025_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1784 
Symbol 
ID5083690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1827352 
End bp1830660 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content68% 
IMG OID640483344 
Producttrehalose synthase 
Protein accessionYP_001167982 
Protein GI146277823 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGA ACGTGCCCCC CGCAACCGTC GCCGTGCGCC GGACCGACGG GATCGATCGC 
AGCCAGAGCG ACTGGTACAA GGATGCGATC ATCTATCAAT TGCACATCAA GGCTTTTCAG
GATGCGAACG GCGACGGGAT CGGCGACTTT GCCGGCCTGA TGCAGCGGCT CGACTATGTG
CAGGCGCTGG GCGTGACGGC GATCTGGCTC TTGCCCTTCT ATCCCTCGCC GCTGCGCGAT
GACGGCTACG ACATCTCGGA CTACCGCTCG ATCAACCCGT CCTACGGCAC GATGCGGGAC
TTCAAGCTCT TCGTGCAGGA AGCCCACAAG CGCGGGCTGC GCGTCATCAC CGAGCTGGTC
ATCAACCACA CCTCGGACCA GCATCCCTGG TTCCAGCGGG CGCGGCGGGC GAAGAAGGGC
TCGGCCGCGC GCGACTGGTA TGTCTGGAGC GACACCGACC AGAAGTTCCC CGAGACGCGG
ATCATCTTCC TCGATACGGA AAAGTCCAAC TGGACCTGGG ACCCTGTGGC CGGGGCCTAT
TACTGGCACC GCTTCTACTC GCACCAGCCC GACCTGAACT TCGACAACCC GCGGGTGCTC
GAGGAAGTTC TCAAGGTGAT GCGGATGTGG CTCGACATGG GGGTGGACGG GCTCCGGCTC
GACGCGATCC CCTATCTGGT CGAGCGCGAG GGCACCAACA ACGAGAATCT CCCCGAGACG
CACGATGTCC TGAAGAAGAT CCGCGCCGAT CTCGACGCCC ATTTCCCCGA CCGGATGCTG
CTGGCCGAGG CGAACCAGTG GCCCGAGGAC ACGCGCCCCT ATTTCGGCGA CGGCGACGAA
TGCCACATGG GCTTCCACTT CCCGCTGATG CCGCGGATGT ACATGGCGCT GGCGCAGGCC
GACCGGCATC CGATCACCGA CATCATCCGC CAGACGCCCG AGATCCCCGA AAGCTGCCAG
TGGGGGATCT TCCTGCGCAA CCATGACGAG CTGACGCTGG AGATGGTGAC GGCGGAAGAG
CGCGACTACA TGTGGCGCTT CTACGCCGAC GATGCGCGGG CCCGGATCAA CCTCGGCATC
CGGCGCAGGC TGGCGCCGCT GATGAAGAAC GACCGGCGCA AGATCGAGCT GCTGAACCAG
ATGCTCATGT CGATGCCGGG CACGCCCATC GTCTATTATG GCGACGAGAT CGGCATGGGC
GACAACTACT ACCTCGGAGA CCGCGACGGG GTGCGCACGC CGATGCAATG GTCGGCCGAC
CGCAACGGCG GCTTTTCGCG CTCCAATCCG CAGCAGCTTT ACCTGCCGGT GATCCTCGAC
CCGATCTATG GCCATCAGGC GATCAACGTC GAGGCGCAGG CGGCGGACCC CTCGTCCCTG
CTCAACTGGA TGCGGCGGCT GATCGCCGTG CGCAAGCAGC ATCCGGCCTT TGGACGCGGC
ACGATGAGCC TGCTCTATCC GCGCAACCGC AAGGTGCTGG CCTATGTGCG CAGCCACGAG
GACACCCACA TCCTGTGCGT CGCGAACCTG TCGGACACGG CGCAGGCGGT CGAGCTGGAT
CTGGGGCGCT TCCGGGGCAC GGTGCCGGTG GAACTGACGG GCCGATCCGA ATTTCCGCCG
GTGGGCGATC TGCCGTACAT GCTCACGCTG CCGCCCTACG GGTTCTACTG GTTCGTGCTC
TCGGACCAGC AGACGCTGCC CAGCTGGCAC CAGCCGATGC CCGAGATCCT GCCCGATTTC
ATCACGCTCA CCACGCGCGA CGGGCGGGCC GAGACGGCGC TCAACGGGCG CGAAAAGGGC
CAGCTCGAGG CGGACGTCCT GCCGAACTGG CTGCCGCTCC AGAGGTGGTT CGGCGCCAAG
GAGGAACGGA TCGGCTCGGT CCGGCTTCGG GTTCTGGGGG CGCTGTCCGA GGCCCATGCG
CTCGTGCGGC TCGATGCCGA GGTGGGCGGC GAGGCCCACC AGTATTTCCT GCCGGCCTCG
ACCCTCTGGG GCGAGGACCA GCTCCGCTCG GGCGCGCCGA AGCTGAGCTA CACGCTCGCC
AAGGTGCGGC GCGGCCCGCA GGTGGGCGCG CTGATCGACG GCGCCTATGA CGAACGTCTG
GCGCAGGCGA TGCTCGATGC GCTGCGGCAG GCGCGGCGGC TGAAGGGGCC GGCGGGCGAT
GTCCTGTTCG AGCCGGGCTC GGGCCTGGCC GAGATCACCG ATCCGGGCGA GCCGCGCTAT
CTGGGCGCCG AGCAGTCGAA CATCTCGATC GCCTTTGGCG ACCGGATCAT CCTCAAGCTC
TATCGCCGCT TGCGTGCGGG CGAGCAGCCG GATGTCGAGG TGGCGCGCTT CCTGACCGAG
GTCGCGGGCT ACGCGCACAC GCCGCGGTTC CTGGGTGTGG TGAGCCTCCA GCCGCCGGAG
GGCGAGGCCA CGGTCGTGGC GGCGGCCTTC GCCTTCGTGG CCAACATGGG CGACGCCTGG
CGCGGCCTTT TCGACGCCCT CGTGCGCGAT CTGGGCGAGC ATGGTGGCTG GGCCGCCTCC
GAGGCGCCGC CCGCGGATGC GGACGGCTTC ACCTTCCCGC TCACCATCGC GGGGTTGCTG
GGCCAGCGGA CGGCGGAACT GCACCGCGCC TTCGCCACCG AAACGGACGA TCCCGCCTTC
CGGCTGGAGC CGCTGACGGG CGAGCTGCTG TCGCGGTGGG CCGAAGGGGC GCGGGGCGAG
GCCGAGGCCA TGCTCACCGT GCTCGGGCGG CAGCGCATGA CCCTGCCCGA GGAGGTGCAG
CCGCTGGCCG AGACGCTGCT TGCGCGGCGC GAGGCGCTGC TGTCCCGTCT CGACCGGGTG
AAGGGCTTCG AGGCCTCGGG CGCGCTCAGC CGGATCCACG GCGACTATCA TCTGGGGCAG
GTGCTGCTGG CGCAGGATGA CGTGGCCATC ATCGATTTCG AGGGCGAACC CCGCCGTTCG
CTGGACGAGC GTCGGCAGAA GTCATCGCCG CTGCGCGACG TGGCCGGGAT GCTGCGCTCG
TTCGACTATG CGGCGGCCGC GGCCCTTGCG CGCCATGCCG AGGCCCTCGG CCCGGCGAAC
GATCAGGCGC TGGCGCGGGC CGAGGCCTGG CGGCAGCGCG CGGTGGCCGA GTTCCTCGCC
GCCTATGAGG CCGCCGCCGA GGGCGTGGCA AGCCTGCCGC AGGATGCCAG CCTTCGCGGC
GCGCTGCTCG ACCTCTTCCT CGTGCAGAAG GCGGTCTACG AGACCTCCTA CGAACTGGGC
AGCCGACCGG CCTGGGTCGG GATCCCGCTG CGCGGCCTTC TGGAACTGCT GGACAGGGAG
CCTGCATGA
 
Protein sequence
MNQNVPPATV AVRRTDGIDR SQSDWYKDAI IYQLHIKAFQ DANGDGIGDF AGLMQRLDYV 
QALGVTAIWL LPFYPSPLRD DGYDISDYRS INPSYGTMRD FKLFVQEAHK RGLRVITELV
INHTSDQHPW FQRARRAKKG SAARDWYVWS DTDQKFPETR IIFLDTEKSN WTWDPVAGAY
YWHRFYSHQP DLNFDNPRVL EEVLKVMRMW LDMGVDGLRL DAIPYLVERE GTNNENLPET
HDVLKKIRAD LDAHFPDRML LAEANQWPED TRPYFGDGDE CHMGFHFPLM PRMYMALAQA
DRHPITDIIR QTPEIPESCQ WGIFLRNHDE LTLEMVTAEE RDYMWRFYAD DARARINLGI
RRRLAPLMKN DRRKIELLNQ MLMSMPGTPI VYYGDEIGMG DNYYLGDRDG VRTPMQWSAD
RNGGFSRSNP QQLYLPVILD PIYGHQAINV EAQAADPSSL LNWMRRLIAV RKQHPAFGRG
TMSLLYPRNR KVLAYVRSHE DTHILCVANL SDTAQAVELD LGRFRGTVPV ELTGRSEFPP
VGDLPYMLTL PPYGFYWFVL SDQQTLPSWH QPMPEILPDF ITLTTRDGRA ETALNGREKG
QLEADVLPNW LPLQRWFGAK EERIGSVRLR VLGALSEAHA LVRLDAEVGG EAHQYFLPAS
TLWGEDQLRS GAPKLSYTLA KVRRGPQVGA LIDGAYDERL AQAMLDALRQ ARRLKGPAGD
VLFEPGSGLA EITDPGEPRY LGAEQSNISI AFGDRIILKL YRRLRAGEQP DVEVARFLTE
VAGYAHTPRF LGVVSLQPPE GEATVVAAAF AFVANMGDAW RGLFDALVRD LGEHGGWAAS
EAPPADADGF TFPLTIAGLL GQRTAELHRA FATETDDPAF RLEPLTGELL SRWAEGARGE
AEAMLTVLGR QRMTLPEEVQ PLAETLLARR EALLSRLDRV KGFEASGALS RIHGDYHLGQ
VLLAQDDVAI IDFEGEPRRS LDERRQKSSP LRDVAGMLRS FDYAAAAALA RHAEALGPAN
DQALARAEAW RQRAVAEFLA AYEAAAEGVA SLPQDASLRG ALLDLFLVQK AVYETSYELG
SRPAWVGIPL RGLLELLDRE PA