Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2446 |
Symbol | |
ID | 3720043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 1080739 |
End bp | 1084047 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640070627 |
Product | putative trehalose synthase |
Protein accession | YP_352508 |
Protein GI | 77463004 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.500894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAGA ATGTCTCTCC CTCCACCGTT GCCGTCCGCC GCACGGACGG CATCGATCGC AGCCAAACCG ACTGGTACAA GGACGCGATC ATCTATCAGC TGCACATCAA GGCGTTTCAG GACGCCAATG GCGACGGGAT CGGTGACTTC GCGGGCCTGA TGCAGCGGCT GGATTATGTG CAGGCGCTGG GCGTGACCGC GATCTGGCTG CTGCCTTTCT ATCCCTCGCC GCTCCGCGAC GACGGCTACG ACATCTCGGA CTATCGCTCG ATCAACCCGT CCTACGGCGC GATGCGCGAC TTCAAGCTGT TCGTGCAGGA AGCGCACAAG CGCGGGCTGC GCGTCATCAC CGAGCTGGTC ATCAACCACA CCTCCGACCA GCATCCCTGG TTCCAGCGGG CCCGGCGGGC GAAGAAGGGA TCGGCCGCGC GCGACTGGTA CGTCTGGAGC GATACCGACC AGAAATTCCC CGAAACGCGG ATCATCTTCC TCGATACGGA AAAGTCGAAC TGGACCTGGG ATCCGGTGGC AGGCGCCTAT TACTGGCACC GCTTCTATTC GCACCAGCCC GACCTGAACT TCGACAATCC CCGTGTGCTC GAGGAAGTGC TCAAGATCAT GCGCATGTGG CTCGAGATGG GCGTCGACGG GCTCCGGCTC GACGCGATCC CCTATCTGGT CGAGCGCGAG GGCACCAACA ACGAGAACCT CCCCGAGACG CACGATGTTC TCAAGAAGAT CCGGGCCAAC CTCGACCAGC ATTTCCCCGA CAGGATGCTG CTCGCCGAGG CGAACCAGTG GCCGGAGGAC ACGCGCCCCT ATTTCGGCGA GGGCGACGAA TGCCACATGG GCTTCCATTT CCCGCTGATG CCGCGGATGT ACATGGCGCT GGCGCAGGCC GACCGGCATC CGATCACCGA CATCATCCGC CAGACGCCCG AGATCCCGGA AGGCTGCCAA TGGGGCATCT TCCTCCGGAA CCACGACGAG CTGACGCTCG AGATGGTGAC GGCCGAAGAG CGCGACTACA TGTGGCGCTT CTACGCCGAA GATTCGCGGG CGCGGATCAA TCTCGGCATC CGGCGCAGGC TGGCGCCGCT GATGAAGAAC GACCGGCGCA AGATCGAGCT TCTGAACCAG ATGCTGATGT CGATGCCCGG CACGCCCATC GTTTATTACG GCGACGAGAT CGGCATGGGC GACAACTACT ATCTCGGCGA CCGGGATGGC GTGCGGACGC CCATGCAATG GTCGGCCGAC CGCAACGGCG GCTTCTCGCG CTGCAATCCG CAGCAGCTCT ACCTGCCGAC GATCCTCGAT CCGGTCTACG GCCATCAGGC GATCAACGTC GAGGCGCAGG CGGCGGACCC GTCGTCGCTC CTGAACTGGA CGCGCCGCCT GATCGCGGTG CGCAAGCAGC ATCCGGCCTT CGGGCGCGGC TCGATGAGCC TGCTCTATCC GCGCAACCGC AAGGTGCTGG CCTATGTCCG CAGCCACGAG GGCACGGACA TCCTCTGCGT GGCGAACCTC TCGGACACGG CGCAGGCGGT CGAGCTGGAT CTCGGGCGGT TCCGCGGCGC GGTGCCGGTC GAGCTCACGG GCCGGTCGGA GTTTCCGCCG GTGGGCGATC TGCCCTACAT GCTGACGCTG CCGCCCTACG GCTTCTACTG GTTCATCCTC TCCGACCAGC AGGCGCTGCC GAGCTGGCAC CAGCCGATGC CCGAGACGCT GCCGGATTTC ATCACGCTCA CGACGCGCGA CGGGCGGGCC GAGACGGCGC TCACCGGGCG CGAGACGCGC CAGCTCGAGG CCGACGTGCT GCCGAACTGG CTGCCGCTGC AACGCTGGTT CGGGGCCAAG GAGGAGAAGA TCAGCGCGGT GAAACTGGCC GTTCTGGGCT CGCTGTCGGC CGATCATGCC CTCGTCCGGC TCGAGGCCGA TGTGGGCGGC GAGGTGCAGC AGTATTTCCT GCCCGCCTCC GCACTCTGGG GCGAGGAGCA GCTCCGGGCC GGGGCGCCCA AGCTGAGCTT CACCCTGGCC AAGGTCCGGC GCGGGCCGCA GGTGGGCGCG CTCATCGACG GGGCCTATGA CGAGCAGATG GCGCAGGCGA TGCTCGAGGC GCTGCGCGAC GGTCGCAGAC TGTCTGGTGC CGGGGGCGAG GTGGTGTTCG AGCCGGGCTC GGGACTGGCC GAGATCGCCG ACCCGGGCGA GCCGCGCTAT CTCGGGGCCG AGCAGTCGAA CATTTCCATC GCCTTCGGCG ACCGCCTGAT CCTGAAGCTC TACCGACGCC TGCGGGCCGG CGAGCAGCCG GACGTCGAGG TGGCGCGCTT CCTGACGGAA GTGGCGGGCT ACACCCATAC GCCCCGCTAT CTCGGCGTCG TGAGCCTGCG CCCGGCGGAG GGCGAGGCGA CCGTCCTCGC CGCGGCCTTC GCCTTCGTGG CCAACATGGG CGACGCCTGG CGCGGCCTCT TCGACGCGCT GGTGCGGGAT CTGGGCGAGT TCGGCGGCTG GAGCACGGCC GAGGCGCCGG AAGTGGAGAA GGACGGCTTC TCCTTCCCGC TCACCATTCC GGGCCTTCTC GGCCAGCGCA CGGCCGAGCT GCATCAGGCT TTCGCCACCG AGACGGACGA GGCGGCCTTC CGGATGGAGC CGCTCACGGC CCAGCATCTG ACCGGCTGGG CCGAGGGGGC GCGGCGCGAG GCCGAGGCCA TGCTGTCGGT GCTCGAACGG CAGCGGACGC ACCTGCCCGA CGAGGTGCTG CCCCTGGCCG AGGCGCTGCT CGCGCGGCGC GAAGACCTGC TGGCGCGGCT CGACCGCGTT ACCGGCTTCG AACCCTCGGG CGCGCTGACC CGCATCCATG GCGACTACCA TCTGGGGCAG GTGCTGCTTG CGCAGGACGA TGTCGCCATC ATCGACTTCG AGGGCGAACC GCGCCGCACG CTTGCAGAGC GTCGCGAGAA GTCCTCGCCG CTCCGCGACG TGGCGGGGAT GCTGCGTTCG TTCGACTATG CCGCCGCCGC CGCCCTCGCA CGCCACGAGG AAAGCTTCGG CCCGGCGAGC GAGCGGGCGG TGGAGCGGGC CGAGGCCTGG CGGCAACAGG CGGTGGCCGA TTTCCTTGCC GCCTACGAGG GCGCGTCCGC GGGCACCGCG AGCCTGCCCT CCGACCCGGC GCTGAAGGAG GCGCTGCTCG ATCTCTTCCT CGTGCAGAAG GCGGTCTACG AGACCTCCTA CGAACTGGGC AGCCGTCCCG CCTGGGTCGG CATCCCGCTG CGCGGGCTTT TGGAACTGCT GGACAGGAAA CCTGCATGA
|
Protein sequence | MNQNVSPSTV AVRRTDGIDR SQTDWYKDAI IYQLHIKAFQ DANGDGIGDF AGLMQRLDYV QALGVTAIWL LPFYPSPLRD DGYDISDYRS INPSYGAMRD FKLFVQEAHK RGLRVITELV INHTSDQHPW FQRARRAKKG SAARDWYVWS DTDQKFPETR IIFLDTEKSN WTWDPVAGAY YWHRFYSHQP DLNFDNPRVL EEVLKIMRMW LEMGVDGLRL DAIPYLVERE GTNNENLPET HDVLKKIRAN LDQHFPDRML LAEANQWPED TRPYFGEGDE CHMGFHFPLM PRMYMALAQA DRHPITDIIR QTPEIPEGCQ WGIFLRNHDE LTLEMVTAEE RDYMWRFYAE DSRARINLGI RRRLAPLMKN DRRKIELLNQ MLMSMPGTPI VYYGDEIGMG DNYYLGDRDG VRTPMQWSAD RNGGFSRCNP QQLYLPTILD PVYGHQAINV EAQAADPSSL LNWTRRLIAV RKQHPAFGRG SMSLLYPRNR KVLAYVRSHE GTDILCVANL SDTAQAVELD LGRFRGAVPV ELTGRSEFPP VGDLPYMLTL PPYGFYWFIL SDQQALPSWH QPMPETLPDF ITLTTRDGRA ETALTGRETR QLEADVLPNW LPLQRWFGAK EEKISAVKLA VLGSLSADHA LVRLEADVGG EVQQYFLPAS ALWGEEQLRA GAPKLSFTLA KVRRGPQVGA LIDGAYDEQM AQAMLEALRD GRRLSGAGGE VVFEPGSGLA EIADPGEPRY LGAEQSNISI AFGDRLILKL YRRLRAGEQP DVEVARFLTE VAGYTHTPRY LGVVSLRPAE GEATVLAAAF AFVANMGDAW RGLFDALVRD LGEFGGWSTA EAPEVEKDGF SFPLTIPGLL GQRTAELHQA FATETDEAAF RMEPLTAQHL TGWAEGARRE AEAMLSVLER QRTHLPDEVL PLAEALLARR EDLLARLDRV TGFEPSGALT RIHGDYHLGQ VLLAQDDVAI IDFEGEPRRT LAERREKSSP LRDVAGMLRS FDYAAAAALA RHEESFGPAS ERAVERAEAW RQQAVADFLA AYEGASAGTA SLPSDPALKE ALLDLFLVQK AVYETSYELG SRPAWVGIPL RGLLELLDRK PA
|
| |