Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2874 |
Symbol | |
ID | 4809154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3395284 |
End bp | 3397101 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108293 |
Product | phosphoenolpyruvate carboxykinase |
Protein accession | YP_001039265 |
Protein GI | 125975355 |
COG category | [C] Energy production and conversion |
COG ID | [COG1274] Phosphoenolpyruvate carboxykinase (GTP) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00158332 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCAA CAAACATGAC AAAAAACAAA AAACTGCTGG ATTGGGTTAA GGAAATGGCT GAAATGTGTC AGCCTGATGA AATTTATTGG TGCGATGGTT CGGAGGAAGA AAATGAGCGC TTGATAAAGT TGATGGTGGA TTCAGGTTTG GCTACGCCTT TGAATCCTGA AAAGCGACCT GGATGTTATC TCTTCCGCAG CGATCCGTCC GACGTTGCCC GTGTTGAGGA CAGAACTTTT ATTGCATCCA AAACCAAAGA AGATGCAGGA CCTACAAACA ACTGGATAGA TCCGGTTGAG CTCAAGGCAA CTATGAAAGA GTTGTACAAG GGTTGTATGA AGGGAAGAAC AATGTATGTT ATTCCTTTCT CCATGGGACC TATCGGTTCA CCCATTTCAA AAATCGGCGT TGAATTGACC GACAGCCCTT ATGTTGTTGT TAACATGCGC ATTATGACTC GCATAGGCAA GGCTGTGTTG GATCAGCTCG GAGAAGACGG AGATTTTGTA CCTTGTCTCC ACTCAGTCGG TGCTCCGCTC AAAGAGGGAG AAAAGGATAA AGGTTGGCCA TGCGCACCAA TCGAAAAGAA ATACATAAGC CACTTCCCGG AAGAAAGGAC TATATGGTCA TATGGTTCCG GATACGGTGG AAATGCGCTT TTAGGAAAGA AATGCTTTGC ACTTCGTATT GCATCTGTTA TGGCACGTGA CGAAGGTTGG CTTGCTGAAC ACATGCTTAT CCTTCGCATA ACAGACCCTG AAGGAAACAA GACATATGTT ACAGGTGCTT TCCCAAGCGC ATGCGGAAAG ACGAACCTGG CTATGCTTAT TCCTACAATT CCCGGATGGA AAGTTGAAAC AATCGGTGAC GATATTGCAT GGATGAGATT TGGAAAAGAC GGCCGTTTGT ATGCTATCAA CCCTGAAGCA GGATTCTTTG GTGTTGCTCC GGGTACATCC ATGGATTCAA ATCCGAACGC AATGCATACA ATTAAGAAAA ATACTATATT TACAAACGTT GCATTGACTG ATGACGGCGA TGTTTGGTGG GAAGGCATCG GAACTGAACC GCCGGCTCAT CTCATAGACT GGCAGGGTAA AGACTGGACT CCTGATTCCG GAACTTTGGC AGCACATCCC AACGGACGTT TTACAGCACC TGCAAGTCAG TGCCCTGTAA TTGCTCCTGA ATGGGAGGAT CCGGAAGGTG TGCCGATTTC AGCAATCCTT ATCGGTGGAC GCCGTCCGAA CACCATTCCG CTTGTTCATG AAAGCTTTGA CTGGAACCAT GGTGTATTCA TGGGTTCAAT CATGGGTTCT GAAATTACGG CTGCCGCAAT TTCAAACAAA ATCGGACAGG TACGCCGTGA CCCGTTTGCT ATGCTGCCTT TCATAGGCTA CAACGTAAAT GACTATTTGC AGCACTGGTT GAACATGGGT ACCAAGACTG ACCCAAGCAA GCTTCCCAAG ATATTCTATG TAAACTGGTT CCGCAAGGAC AGCAACGGTA AATGGTTGTG GCCTGGATAC GGTGAAAACA GCCGTGTTCT CAAGTGGATT GTTGAAAGAG TCAACGGAAA AGGTAAAGCA GTAAAGACAC CTATAGGATA TATGCCTACA GTTGACGCTA TCGACACAAC CGGCCTTGAT GTAAGCAAAG AGGATATGGA AGAACTCTTG AGCGTTAACA AAGAACAGTG GCTCCAGGAA GTTGAGTCAA TAAAAGAACA TTATAAGTCA TACGGAGAAA AACTGCCGAA AGAATTGTGG GCACAATTGG AGGCTCTTGA ACAACGTTTG AAAGAGTATA ACGGTTAA
|
Protein sequence | MTSTNMTKNK KLLDWVKEMA EMCQPDEIYW CDGSEEENER LIKLMVDSGL ATPLNPEKRP GCYLFRSDPS DVARVEDRTF IASKTKEDAG PTNNWIDPVE LKATMKELYK GCMKGRTMYV IPFSMGPIGS PISKIGVELT DSPYVVVNMR IMTRIGKAVL DQLGEDGDFV PCLHSVGAPL KEGEKDKGWP CAPIEKKYIS HFPEERTIWS YGSGYGGNAL LGKKCFALRI ASVMARDEGW LAEHMLILRI TDPEGNKTYV TGAFPSACGK TNLAMLIPTI PGWKVETIGD DIAWMRFGKD GRLYAINPEA GFFGVAPGTS MDSNPNAMHT IKKNTIFTNV ALTDDGDVWW EGIGTEPPAH LIDWQGKDWT PDSGTLAAHP NGRFTAPASQ CPVIAPEWED PEGVPISAIL IGGRRPNTIP LVHESFDWNH GVFMGSIMGS EITAAAISNK IGQVRRDPFA MLPFIGYNVN DYLQHWLNMG TKTDPSKLPK IFYVNWFRKD SNGKWLWPGY GENSRVLKWI VERVNGKGKA VKTPIGYMPT VDAIDTTGLD VSKEDMEELL SVNKEQWLQE VESIKEHYKS YGEKLPKELW AQLEALEQRL KEYNG
|
| |