Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2578 |
Symbol | |
ID | 4809185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3050535 |
End bp | 3052055 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107992 |
Product | Ppx/GppA phosphatase |
Protein accession | YP_001038971 |
Protein GI | 125975061 |
COG category | [F] Nucleotide transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0248] Exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAA AAATAGGAAT AATTGACCTT GGCTCAAACT CCGTAAGGCT CGTTATATTC GAAATATTGC CCAGTGGTAC TTTCAGACTC GTCGATGACA TAAAAGAGAC CATAAGACTG AGCGAAAACA TGATTGACGG AAGGCTACTA AATGATATTG CCATGCACAA AACCGTAAAA ACCGTCAAAC TATACAGGAA ATTGTGCAAT TCATACGGCA TACCCTGCAA TCGCATCATA GGTGTTGCCA CTGCAGCGGT AAGAAAGGCG GAAAACAGGG ATCAGTTTTT AAAGCTTTTA ACCGGTGCCA CAGACATTAA TTTCAGACTG CTTTCCGGTG AGGAAGAGGC TTTTTACGCT TTCAAGGCGG CAACTTACTC CCTTAACATC CATGAGGGAA TTATTGTTGA TATTGGCGGC GGAAGTACGG AAATAATTTT TTTTAAAGAC AACAGTATGA TTAATTCCAT ATCAATTCCT GTAGGTGCAG TTGTTGCAAC GGAAAACTTT ATCGGCAAAG ATGTTATAAA TCAGGAAAAT CTCTCCCGTC TGGAAGACAG TATCTGCGAG ATGCTAAAAG ACCAGGACTG GCTCCGGCAG GAAAAAAACA AGACACTGAT AGGTTTGGGA GGAACAATCC GAAACCTCGG AAAAATCCAC AGAAACCGTA TTGAATATCC CATAAATTAT ACTCACAATT ATGAGATACC TATTGAAGAT TTTAACAGCA TATATAATGA CCTTGCAAAT ATGGATCTAA AGTCCCGCAA AAAAGTAAAA GGTCTTTCCT CCAAACGGGC CGACATAATT GTAGGCGGGC TTGCAATCCT TAAAGCCATT ATATCGGTTT GTTCTCCGTC CCGCATTTTA ACAAGCGGGT TTGGCCTTAG GGAAGGTATT CTTTTTGACT ATATTTCCAA AACCAATCCC AGGGCAAAAT TTACAGACGC CTTGAGCTTT AGCCTCAAAA GCTTCATGGA GCTTTACGGC GTAAGAAAGA ATCACGCAAA ACATGTCTGT TTTCTTGCTC TCTCCTTATT CGATCAGCTA AAATACCTGC ATAACTACGG AGAGGACGCA AGAAAGCTTC TGGAAGTGGC ATCGCTTCTG CATGACATCG GTATTTCCAT AAGCTACTAC GAACACCACC GACATTCATT TTACATAATC TTAAACTCCA GACTTGCCGG ATTTACCCAC AGGGAAACTT TGCTGGCAGC CGCCATAGCT GCATCCCATT CCGATGAGGA TTTCAAGGAG GACTGGCAAA CGCGCTATGA AAAAATTCTT CTGCCGGGAG ATATAGAGCT GTGGAAAAAG CTTTCGGTGT TTTTGAAACT GGCAGAATGT CTTGACAGAA GCGAAATGTC TGTGATAAAG GCTTTGGAAT GTCAAATCCT TGGTGACACG GTCAAGATCC GGACAATAAG AGAAGGGGAT GCCGAACTTG AAATAAGTCT TGCCAACGAA CACAGCAATA CTTTTCGCAA GGTCTTTGGA AAATTTCTTG TCGTAACATA A
|
Protein sequence | MSKKIGIIDL GSNSVRLVIF EILPSGTFRL VDDIKETIRL SENMIDGRLL NDIAMHKTVK TVKLYRKLCN SYGIPCNRII GVATAAVRKA ENRDQFLKLL TGATDINFRL LSGEEEAFYA FKAATYSLNI HEGIIVDIGG GSTEIIFFKD NSMINSISIP VGAVVATENF IGKDVINQEN LSRLEDSICE MLKDQDWLRQ EKNKTLIGLG GTIRNLGKIH RNRIEYPINY THNYEIPIED FNSIYNDLAN MDLKSRKKVK GLSSKRADII VGGLAILKAI ISVCSPSRIL TSGFGLREGI LFDYISKTNP RAKFTDALSF SLKSFMELYG VRKNHAKHVC FLALSLFDQL KYLHNYGEDA RKLLEVASLL HDIGISISYY EHHRHSFYII LNSRLAGFTH RETLLAAAIA ASHSDEDFKE DWQTRYEKIL LPGDIELWKK LSVFLKLAEC LDRSEMSVIK ALECQILGDT VKIRTIREGD AELEISLANE HSNTFRKVFG KFLVVT
|
| |