Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1784 |
Symbol | |
ID | 5083690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 1827352 |
End bp | 1830660 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640483344 |
Product | trehalose synthase |
Protein accession | YP_001167982 |
Protein GI | 146277823 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGA ACGTGCCCCC CGCAACCGTC GCCGTGCGCC GGACCGACGG GATCGATCGC AGCCAGAGCG ACTGGTACAA GGATGCGATC ATCTATCAAT TGCACATCAA GGCTTTTCAG GATGCGAACG GCGACGGGAT CGGCGACTTT GCCGGCCTGA TGCAGCGGCT CGACTATGTG CAGGCGCTGG GCGTGACGGC GATCTGGCTC TTGCCCTTCT ATCCCTCGCC GCTGCGCGAT GACGGCTACG ACATCTCGGA CTACCGCTCG ATCAACCCGT CCTACGGCAC GATGCGGGAC TTCAAGCTCT TCGTGCAGGA AGCCCACAAG CGCGGGCTGC GCGTCATCAC CGAGCTGGTC ATCAACCACA CCTCGGACCA GCATCCCTGG TTCCAGCGGG CGCGGCGGGC GAAGAAGGGC TCGGCCGCGC GCGACTGGTA TGTCTGGAGC GACACCGACC AGAAGTTCCC CGAGACGCGG ATCATCTTCC TCGATACGGA AAAGTCCAAC TGGACCTGGG ACCCTGTGGC CGGGGCCTAT TACTGGCACC GCTTCTACTC GCACCAGCCC GACCTGAACT TCGACAACCC GCGGGTGCTC GAGGAAGTTC TCAAGGTGAT GCGGATGTGG CTCGACATGG GGGTGGACGG GCTCCGGCTC GACGCGATCC CCTATCTGGT CGAGCGCGAG GGCACCAACA ACGAGAATCT CCCCGAGACG CACGATGTCC TGAAGAAGAT CCGCGCCGAT CTCGACGCCC ATTTCCCCGA CCGGATGCTG CTGGCCGAGG CGAACCAGTG GCCCGAGGAC ACGCGCCCCT ATTTCGGCGA CGGCGACGAA TGCCACATGG GCTTCCACTT CCCGCTGATG CCGCGGATGT ACATGGCGCT GGCGCAGGCC GACCGGCATC CGATCACCGA CATCATCCGC CAGACGCCCG AGATCCCCGA AAGCTGCCAG TGGGGGATCT TCCTGCGCAA CCATGACGAG CTGACGCTGG AGATGGTGAC GGCGGAAGAG CGCGACTACA TGTGGCGCTT CTACGCCGAC GATGCGCGGG CCCGGATCAA CCTCGGCATC CGGCGCAGGC TGGCGCCGCT GATGAAGAAC GACCGGCGCA AGATCGAGCT GCTGAACCAG ATGCTCATGT CGATGCCGGG CACGCCCATC GTCTATTATG GCGACGAGAT CGGCATGGGC GACAACTACT ACCTCGGAGA CCGCGACGGG GTGCGCACGC CGATGCAATG GTCGGCCGAC CGCAACGGCG GCTTTTCGCG CTCCAATCCG CAGCAGCTTT ACCTGCCGGT GATCCTCGAC CCGATCTATG GCCATCAGGC GATCAACGTC GAGGCGCAGG CGGCGGACCC CTCGTCCCTG CTCAACTGGA TGCGGCGGCT GATCGCCGTG CGCAAGCAGC ATCCGGCCTT TGGACGCGGC ACGATGAGCC TGCTCTATCC GCGCAACCGC AAGGTGCTGG CCTATGTGCG CAGCCACGAG GACACCCACA TCCTGTGCGT CGCGAACCTG TCGGACACGG CGCAGGCGGT CGAGCTGGAT CTGGGGCGCT TCCGGGGCAC GGTGCCGGTG GAACTGACGG GCCGATCCGA ATTTCCGCCG GTGGGCGATC TGCCGTACAT GCTCACGCTG CCGCCCTACG GGTTCTACTG GTTCGTGCTC TCGGACCAGC AGACGCTGCC CAGCTGGCAC CAGCCGATGC CCGAGATCCT GCCCGATTTC ATCACGCTCA CCACGCGCGA CGGGCGGGCC GAGACGGCGC TCAACGGGCG CGAAAAGGGC CAGCTCGAGG CGGACGTCCT GCCGAACTGG CTGCCGCTCC AGAGGTGGTT CGGCGCCAAG GAGGAACGGA TCGGCTCGGT CCGGCTTCGG GTTCTGGGGG CGCTGTCCGA GGCCCATGCG CTCGTGCGGC TCGATGCCGA GGTGGGCGGC GAGGCCCACC AGTATTTCCT GCCGGCCTCG ACCCTCTGGG GCGAGGACCA GCTCCGCTCG GGCGCGCCGA AGCTGAGCTA CACGCTCGCC AAGGTGCGGC GCGGCCCGCA GGTGGGCGCG CTGATCGACG GCGCCTATGA CGAACGTCTG GCGCAGGCGA TGCTCGATGC GCTGCGGCAG GCGCGGCGGC TGAAGGGGCC GGCGGGCGAT GTCCTGTTCG AGCCGGGCTC GGGCCTGGCC GAGATCACCG ATCCGGGCGA GCCGCGCTAT CTGGGCGCCG AGCAGTCGAA CATCTCGATC GCCTTTGGCG ACCGGATCAT CCTCAAGCTC TATCGCCGCT TGCGTGCGGG CGAGCAGCCG GATGTCGAGG TGGCGCGCTT CCTGACCGAG GTCGCGGGCT ACGCGCACAC GCCGCGGTTC CTGGGTGTGG TGAGCCTCCA GCCGCCGGAG GGCGAGGCCA CGGTCGTGGC GGCGGCCTTC GCCTTCGTGG CCAACATGGG CGACGCCTGG CGCGGCCTTT TCGACGCCCT CGTGCGCGAT CTGGGCGAGC ATGGTGGCTG GGCCGCCTCC GAGGCGCCGC CCGCGGATGC GGACGGCTTC ACCTTCCCGC TCACCATCGC GGGGTTGCTG GGCCAGCGGA CGGCGGAACT GCACCGCGCC TTCGCCACCG AAACGGACGA TCCCGCCTTC CGGCTGGAGC CGCTGACGGG CGAGCTGCTG TCGCGGTGGG CCGAAGGGGC GCGGGGCGAG GCCGAGGCCA TGCTCACCGT GCTCGGGCGG CAGCGCATGA CCCTGCCCGA GGAGGTGCAG CCGCTGGCCG AGACGCTGCT TGCGCGGCGC GAGGCGCTGC TGTCCCGTCT CGACCGGGTG AAGGGCTTCG AGGCCTCGGG CGCGCTCAGC CGGATCCACG GCGACTATCA TCTGGGGCAG GTGCTGCTGG CGCAGGATGA CGTGGCCATC ATCGATTTCG AGGGCGAACC CCGCCGTTCG CTGGACGAGC GTCGGCAGAA GTCATCGCCG CTGCGCGACG TGGCCGGGAT GCTGCGCTCG TTCGACTATG CGGCGGCCGC GGCCCTTGCG CGCCATGCCG AGGCCCTCGG CCCGGCGAAC GATCAGGCGC TGGCGCGGGC CGAGGCCTGG CGGCAGCGCG CGGTGGCCGA GTTCCTCGCC GCCTATGAGG CCGCCGCCGA GGGCGTGGCA AGCCTGCCGC AGGATGCCAG CCTTCGCGGC GCGCTGCTCG ACCTCTTCCT CGTGCAGAAG GCGGTCTACG AGACCTCCTA CGAACTGGGC AGCCGACCGG CCTGGGTCGG GATCCCGCTG CGCGGCCTTC TGGAACTGCT GGACAGGGAG CCTGCATGA
|
Protein sequence | MNQNVPPATV AVRRTDGIDR SQSDWYKDAI IYQLHIKAFQ DANGDGIGDF AGLMQRLDYV QALGVTAIWL LPFYPSPLRD DGYDISDYRS INPSYGTMRD FKLFVQEAHK RGLRVITELV INHTSDQHPW FQRARRAKKG SAARDWYVWS DTDQKFPETR IIFLDTEKSN WTWDPVAGAY YWHRFYSHQP DLNFDNPRVL EEVLKVMRMW LDMGVDGLRL DAIPYLVERE GTNNENLPET HDVLKKIRAD LDAHFPDRML LAEANQWPED TRPYFGDGDE CHMGFHFPLM PRMYMALAQA DRHPITDIIR QTPEIPESCQ WGIFLRNHDE LTLEMVTAEE RDYMWRFYAD DARARINLGI RRRLAPLMKN DRRKIELLNQ MLMSMPGTPI VYYGDEIGMG DNYYLGDRDG VRTPMQWSAD RNGGFSRSNP QQLYLPVILD PIYGHQAINV EAQAADPSSL LNWMRRLIAV RKQHPAFGRG TMSLLYPRNR KVLAYVRSHE DTHILCVANL SDTAQAVELD LGRFRGTVPV ELTGRSEFPP VGDLPYMLTL PPYGFYWFVL SDQQTLPSWH QPMPEILPDF ITLTTRDGRA ETALNGREKG QLEADVLPNW LPLQRWFGAK EERIGSVRLR VLGALSEAHA LVRLDAEVGG EAHQYFLPAS TLWGEDQLRS GAPKLSYTLA KVRRGPQVGA LIDGAYDERL AQAMLDALRQ ARRLKGPAGD VLFEPGSGLA EITDPGEPRY LGAEQSNISI AFGDRIILKL YRRLRAGEQP DVEVARFLTE VAGYAHTPRF LGVVSLQPPE GEATVVAAAF AFVANMGDAW RGLFDALVRD LGEHGGWAAS EAPPADADGF TFPLTIAGLL GQRTAELHRA FATETDDPAF RLEPLTGELL SRWAEGARGE AEAMLTVLGR QRMTLPEEVQ PLAETLLARR EALLSRLDRV KGFEASGALS RIHGDYHLGQ VLLAQDDVAI IDFEGEPRRS LDERRQKSSP LRDVAGMLRS FDYAAAAALA RHAEALGPAN DQALARAEAW RQRAVAEFLA AYEAAAEGVA SLPQDASLRG ALLDLFLVQK AVYETSYELG SRPAWVGIPL RGLLELLDRE PA
|
| |