Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_2782 |
Symbol | |
ID | 6462058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | - |
Start bp | 2872902 |
End bp | 2876198 |
Gene Length | 3297 bp |
Protein Length | 1098 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642728935 |
Product | trehalose synthase |
Protein accession | YP_002019550 |
Protein GI | 194337756 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCAAC CCGAACCGCT CTGGTACAAA GACGCCATTA TTTATGAGGC GCATGTCAAG ACGTTCTATG ACAGCAACAA TGATGGTGTT GGAGATTTTC AGGGACTGCG CCAGAAGCTT GGTTACCTGC AAAGTCTCGG TGTTACTGCC ATCTGGCTGC TTCCCTTTTA TCCATCGCCT TTACGAGATG ATGGTTACGA TATTGCCGAT TATATGAGTG TCAATCCCGA TTACGGGACG ATGGAAGATT TCCGGGAGTT TATTGAAGAG GCACACTCTC TTGGTATAAA GGTGATTACC GAGCTGGTCG TGAACCACAC TTCCGACCAA CATGCCTGGT TCCAGCGTGC CCGGAAGGCT CCGGCAGGCT CTCCAGAGAG AAACTTCTAT GTCTGGAGTG ACGATTCCAG CAAATATTCT GAAGCCCGGA TTATTTTCCA GGACTTCGAG GCTTCAAACT GGACATGGGA TCCGGTTGCC GGGCAATATT TCTGGCATCG CTTCTACCAT CACCAGCCCG ACCTGAACTT TGAAAACCCC GAAGTACACA AGGCTCTGCT CGGTGTGCTT GATTTCTGGC TTGGCATGGG CGTTGATGGT CTCCGCCTTG ATGCTGTTCC CTATCTCTAT GAAGAGGAGG GGAGTAACTG CGAAAATTTA CCGAGGACTT ATGAGTATCT GAGGGCGTTG CGCTCCTATG TGGATGAGCA CTATCCCAAC CGGATGCTGC TTGCCGAAGC CAACCAGTGG CCTGAAGATT CTGCAGCCTA TTTTGGTACC GGTGACCTCT GTCACATGAA CTTCCACTTT CCCCTGATGC CGCGCATGTA TATGGCGCTT GCTACCGAAG ATCGTTTTCC CATTCTCGAT ATTCTGGAAC AGACACCGGA AATTCCGGAG ATCTGCCAGT GGGCTTCGTT CCTTCGCAAT CATGATGAGC TGACGCTTGA AATGGTTACT GACGAGGAGC GCGACTACAT GCGCAGGGTC TATGCCAACG ATCCCAAGGC CCGTATCAAT CTTGGTATCC GCCGCCGTCT TGCACCGCTC ATGGCCAATG ATCGCCGCAA GATTGAACTG ATGAACATCA TGCTGCTCTC ACTGCCCGGA ACTCCGGTAC TCTACTACGG TGACGAAATT GGCATGGGCG ATAACTTTTA TCTTGGTGAT CGAGATGGAG TACGTACACC CATGCAGTGG AACTCTGACC GTAATGCAGG GTTTTCGCGT GCCAATCCCC AGCGGCTGCA GCTCCCGGTC ATCATCGATC CGGAATATCA TTATGAGGCG GTCAACGTAG AGGTGCAGGA GAGCAATGTC AACTCCCTGC TCTGGTGGAT GCGCCATACC ATATCAACTG CTCATCGCTA CAAATCGCTC AGCCGTGGTA CCATTGAGTT CCTGCAGGTG AGCAACCCCA AAGTGCTGAT TTTTATTCGT CAGTTCGAGG ATGAAACCAT GCTTAGTGTG ATCAACCTCT CAAGAAATGC CCAGGCGGTG ATGATTGATC TATCCCGCTT TGACGGCTAT ATCCCTGAAG AGGTGTTCAG CATGAACCGT TTTCCGAAAA TCAGGAAAAT TCCCTACATG GTGGCCCTGG GATCCTATGG ATATTTCTGG TTGCAGTTGG TCAAGGATAC CGAAAATACG GATGGTCGTC CATACCTCGA CAAACCTTTT GCGACACCCT CTCGCTGGCA AAGCCTCTTT ACCGGCAAAA GCCGCGAACG GCTTGAAACA GAGATTCTGC CCCAGTACTT CAAGGTAAGC CGCTGGTTTG GCGGCAAGGT GCGTCATATC ATGCGTATCT CCATCGTCGA TACCATTCCT GTGGCGGGAA TGGCAAAAGC CAAGTTGCTG GTTACGGAAG TGCGCTATTC AAGTGGTGAG AATGAACGCT ATCAACTCCC GGTATGCTTT TCTCCGCTTT CCCTGGTTTC TCTCCAGGAT GATAATTTCT ACAAGCGCGT TATTGCAAGG GTCGTTATTG GCGATGAAGA GGGTTATCTG TGCGACGCCA CCTTTGATGG GCGTTTCCTG AACCATCTCT ATCAGCTTGT TACCGGCAAG GAGAGTTGGC AGGGTAAAGA GGGCACTGTT TCCGGAATGA AGTCCATGAA GATGGATGCT GTTGCGGAAG AGGGAATGGA GCATGAGCCC CTCCTGATGG GTGTCGAACA ATCGAATACC TCAATTCGCT TCAATAATGA TCTCTGCCTG AAGCTCTATC GCCGCATTGA AATCGGAATC TCTCCCGAAG TGGAGATGTG CCGTGCGCTG AGTGAGCATA CTTCGTTCAA AAACCTGCCG GGCTATCTTG GTTCACTGAA CTATGAGCAG AGTCGCACCA ACGGCTATTC TCTCGGTATT TTGCAGCATT TTGTGAAAAA CGAAGGGGAT GCCTGGCAGC TTTCGCTTAG TCAGGTGAAA CGCTATTATG ATGATATCCT TGCAAAAATC AACTCCGGCA TGGTATTGCC TGCGTTGCCG AAACTCAGTG GTGATCCCGT GCAGCTTCCC GAAATCATGC ATGAACTGAT TGGTGAAGCC TATCTTGGCA TGCTTGAAAA GCTTGCAGAA CGGACTGCTG AAATGCACCT CTCCCTTGCT TCGCTCGACA GCGATCCGGC ATTTGCGCCA GAGGCTTTTA CTACACTCTA TCAGCGTTCT ATCTACCAGG CGATGTGCGA ACAGGTGAAG CGGGCGGTTC TTCTTATCCG TGAAATCATG CATCAGATGG CTCCGGAGCA GCAGCAGCTT GCATCTCTTT TTGTGCAGAG GCAGAAACAG ATTTTGCAGC AGTTTGATCC CATCAGGATA GAGAAAATCG ATACACTCAA AATCAGGATT CACGGCGATT TTCATCTTGG CCAGGTACTC TTTACCGGCA AGGATTTTGT GATTATCGAT TTTGAGGGAG AGCCTGCCCG GCCCCTCTCC GAACGGAAAA TCAAACGTTC GGTTTTCCGT GATATTTCAG GCATGTTGCG CTCCTTCGAC TATGCGGCCT TTAATGTGCT GCAGCAGGAC AACACCCTCT TCCGTCCGGA AGAGCGGCTT GCCCTTGAGC CTTGGGCTGA CCGCTGGAGT TTCTATGCTG GCCAGTACTT TCTCGACAGC TATTTTGCAA AGACCGGAGG CAGTAATATT GTTCCGGCAG ATCCGAAGCA GCGTGAACAT CTGATGCGGG CCTATTTGAT GAACAAGGCG GTCTACGAAC TCAATTATGA GCTCAACAAT CGTCCTGACT GGGCCTCCAT ACCGCTTCGG GGTATTATGA AGATTCTGGA GTCGTAA
|
Protein sequence | MYQPEPLWYK DAIIYEAHVK TFYDSNNDGV GDFQGLRQKL GYLQSLGVTA IWLLPFYPSP LRDDGYDIAD YMSVNPDYGT MEDFREFIEE AHSLGIKVIT ELVVNHTSDQ HAWFQRARKA PAGSPERNFY VWSDDSSKYS EARIIFQDFE ASNWTWDPVA GQYFWHRFYH HQPDLNFENP EVHKALLGVL DFWLGMGVDG LRLDAVPYLY EEEGSNCENL PRTYEYLRAL RSYVDEHYPN RMLLAEANQW PEDSAAYFGT GDLCHMNFHF PLMPRMYMAL ATEDRFPILD ILEQTPEIPE ICQWASFLRN HDELTLEMVT DEERDYMRRV YANDPKARIN LGIRRRLAPL MANDRRKIEL MNIMLLSLPG TPVLYYGDEI GMGDNFYLGD RDGVRTPMQW NSDRNAGFSR ANPQRLQLPV IIDPEYHYEA VNVEVQESNV NSLLWWMRHT ISTAHRYKSL SRGTIEFLQV SNPKVLIFIR QFEDETMLSV INLSRNAQAV MIDLSRFDGY IPEEVFSMNR FPKIRKIPYM VALGSYGYFW LQLVKDTENT DGRPYLDKPF ATPSRWQSLF TGKSRERLET EILPQYFKVS RWFGGKVRHI MRISIVDTIP VAGMAKAKLL VTEVRYSSGE NERYQLPVCF SPLSLVSLQD DNFYKRVIAR VVIGDEEGYL CDATFDGRFL NHLYQLVTGK ESWQGKEGTV SGMKSMKMDA VAEEGMEHEP LLMGVEQSNT SIRFNNDLCL KLYRRIEIGI SPEVEMCRAL SEHTSFKNLP GYLGSLNYEQ SRTNGYSLGI LQHFVKNEGD AWQLSLSQVK RYYDDILAKI NSGMVLPALP KLSGDPVQLP EIMHELIGEA YLGMLEKLAE RTAEMHLSLA SLDSDPAFAP EAFTTLYQRS IYQAMCEQVK RAVLLIREIM HQMAPEQQQL ASLFVQRQKQ ILQQFDPIRI EKIDTLKIRI HGDFHLGQVL FTGKDFVIID FEGEPARPLS ERKIKRSVFR DISGMLRSFD YAAFNVLQQD NTLFRPEERL ALEPWADRWS FYAGQYFLDS YFAKTGGSNI VPADPKQREH LMRAYLMNKA VYELNYELNN RPDWASIPLR GIMKILES
|
| |