Gene Cthe_2874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2874 
Symbol 
ID4809154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3395284 
End bp3397101 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content45% 
IMG OID640108293 
Productphosphoenolpyruvate carboxykinase 
Protein accessionYP_001039265 
Protein GI125975355 
COG category[C] Energy production and conversion 
COG ID[COG1274] Phosphoenolpyruvate carboxykinase (GTP) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00158332 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCAA CAAACATGAC AAAAAACAAA AAACTGCTGG ATTGGGTTAA GGAAATGGCT 
GAAATGTGTC AGCCTGATGA AATTTATTGG TGCGATGGTT CGGAGGAAGA AAATGAGCGC
TTGATAAAGT TGATGGTGGA TTCAGGTTTG GCTACGCCTT TGAATCCTGA AAAGCGACCT
GGATGTTATC TCTTCCGCAG CGATCCGTCC GACGTTGCCC GTGTTGAGGA CAGAACTTTT
ATTGCATCCA AAACCAAAGA AGATGCAGGA CCTACAAACA ACTGGATAGA TCCGGTTGAG
CTCAAGGCAA CTATGAAAGA GTTGTACAAG GGTTGTATGA AGGGAAGAAC AATGTATGTT
ATTCCTTTCT CCATGGGACC TATCGGTTCA CCCATTTCAA AAATCGGCGT TGAATTGACC
GACAGCCCTT ATGTTGTTGT TAACATGCGC ATTATGACTC GCATAGGCAA GGCTGTGTTG
GATCAGCTCG GAGAAGACGG AGATTTTGTA CCTTGTCTCC ACTCAGTCGG TGCTCCGCTC
AAAGAGGGAG AAAAGGATAA AGGTTGGCCA TGCGCACCAA TCGAAAAGAA ATACATAAGC
CACTTCCCGG AAGAAAGGAC TATATGGTCA TATGGTTCCG GATACGGTGG AAATGCGCTT
TTAGGAAAGA AATGCTTTGC ACTTCGTATT GCATCTGTTA TGGCACGTGA CGAAGGTTGG
CTTGCTGAAC ACATGCTTAT CCTTCGCATA ACAGACCCTG AAGGAAACAA GACATATGTT
ACAGGTGCTT TCCCAAGCGC ATGCGGAAAG ACGAACCTGG CTATGCTTAT TCCTACAATT
CCCGGATGGA AAGTTGAAAC AATCGGTGAC GATATTGCAT GGATGAGATT TGGAAAAGAC
GGCCGTTTGT ATGCTATCAA CCCTGAAGCA GGATTCTTTG GTGTTGCTCC GGGTACATCC
ATGGATTCAA ATCCGAACGC AATGCATACA ATTAAGAAAA ATACTATATT TACAAACGTT
GCATTGACTG ATGACGGCGA TGTTTGGTGG GAAGGCATCG GAACTGAACC GCCGGCTCAT
CTCATAGACT GGCAGGGTAA AGACTGGACT CCTGATTCCG GAACTTTGGC AGCACATCCC
AACGGACGTT TTACAGCACC TGCAAGTCAG TGCCCTGTAA TTGCTCCTGA ATGGGAGGAT
CCGGAAGGTG TGCCGATTTC AGCAATCCTT ATCGGTGGAC GCCGTCCGAA CACCATTCCG
CTTGTTCATG AAAGCTTTGA CTGGAACCAT GGTGTATTCA TGGGTTCAAT CATGGGTTCT
GAAATTACGG CTGCCGCAAT TTCAAACAAA ATCGGACAGG TACGCCGTGA CCCGTTTGCT
ATGCTGCCTT TCATAGGCTA CAACGTAAAT GACTATTTGC AGCACTGGTT GAACATGGGT
ACCAAGACTG ACCCAAGCAA GCTTCCCAAG ATATTCTATG TAAACTGGTT CCGCAAGGAC
AGCAACGGTA AATGGTTGTG GCCTGGATAC GGTGAAAACA GCCGTGTTCT CAAGTGGATT
GTTGAAAGAG TCAACGGAAA AGGTAAAGCA GTAAAGACAC CTATAGGATA TATGCCTACA
GTTGACGCTA TCGACACAAC CGGCCTTGAT GTAAGCAAAG AGGATATGGA AGAACTCTTG
AGCGTTAACA AAGAACAGTG GCTCCAGGAA GTTGAGTCAA TAAAAGAACA TTATAAGTCA
TACGGAGAAA AACTGCCGAA AGAATTGTGG GCACAATTGG AGGCTCTTGA ACAACGTTTG
AAAGAGTATA ACGGTTAA
 
Protein sequence
MTSTNMTKNK KLLDWVKEMA EMCQPDEIYW CDGSEEENER LIKLMVDSGL ATPLNPEKRP 
GCYLFRSDPS DVARVEDRTF IASKTKEDAG PTNNWIDPVE LKATMKELYK GCMKGRTMYV
IPFSMGPIGS PISKIGVELT DSPYVVVNMR IMTRIGKAVL DQLGEDGDFV PCLHSVGAPL
KEGEKDKGWP CAPIEKKYIS HFPEERTIWS YGSGYGGNAL LGKKCFALRI ASVMARDEGW
LAEHMLILRI TDPEGNKTYV TGAFPSACGK TNLAMLIPTI PGWKVETIGD DIAWMRFGKD
GRLYAINPEA GFFGVAPGTS MDSNPNAMHT IKKNTIFTNV ALTDDGDVWW EGIGTEPPAH
LIDWQGKDWT PDSGTLAAHP NGRFTAPASQ CPVIAPEWED PEGVPISAIL IGGRRPNTIP
LVHESFDWNH GVFMGSIMGS EITAAAISNK IGQVRRDPFA MLPFIGYNVN DYLQHWLNMG
TKTDPSKLPK IFYVNWFRKD SNGKWLWPGY GENSRVLKWI VERVNGKGKA VKTPIGYMPT
VDAIDTTGLD VSKEDMEELL SVNKEQWLQE VESIKEHYKS YGEKLPKELW AQLEALEQRL
KEYNG