Gene Cthe_3157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3157 
Symbol 
ID4809607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3729405 
End bp3730784 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content44% 
IMG OID640108590 
Productpyruvate carboxyltransferase 
Protein accessionYP_001039545 
Protein GI125975635 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000266472 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGAGT TTAACAAGAA AACCAACACA TTAGAACAGG TTCAATATAA ATACACGCTC 
CAGGATGTTT CAGAACCAAA CCTGTACAGG GATATTTTCA GTTATGATGA AATACCCAAG
TGCACATTCA ATCACAGAAA AGTGCCTATG GCACCTCCGG ACGAGATATG GATAACGGAT
ACCACATTCA GGGACGGTCA GCAGTCAAGG GCACCTTACA CTGTGGAACA GATTGTGCAT
CTTTACGATC TTTTACATAA ACTGGGCGGA CCTAAAGGTA TTATCAGACA GTGTGAGTTT
TTCCTTTACA GTGACAGGGA CAAACAGGCT GTCTACAAAT GTTTGGAGAG AGGATACAAA
TATCCTGAGG TTACAAGCTG GATAAGGGCG ACAAAGAGCG ATTTCCAGCT GGCAAAGGAC
ATGGGAATGA AGGAAAGCGG TATTCTTGTA AGCTGTTCCG ACTATCATAT ATTTAAGAAG
CTCAACATGA CGAGAAAACA GGCGCTGGAG CACTATATGA GCATTGTAAA GAGCGCCATA
GAAGTGGGAA TAAGACCAAG ATGCCATTTT GAGGATATTA CAAGGGCTGA TTTTTACGGC
TTTGTTGTGC CTTTTGCCAT AGAACTTAGA AAACTCATGG AAGAAAGCGG AGTGCCCATC
AAGATTCGCG CATGTGACAC CCTGGGCTAT GGAGTTTCAT ATCCGGGTGC CGCACTGCCA
AGAAGTGTTC CGGGAATAAT CTATGGACTC AGACACTATG CAGGTTTCCC GAGCGAGCTT
ATAGAATGGC ACGGTCACAA TGACTTCTAC AAAGCTGTAT GCAATGCGGC AACTGCATGG
CTCTACGGTG CTTCTGCCGT AAACTGCTCG CTTCTCGGCA TAGGGGAAAG AACAGGAAAC
ACGCCTCTTG AAGCAATGGT TATTGAGTAT GCTCAGCTCA GGGGAACTAC GGACAGTATG
GATACAACCG TAATAACCGA GATTGCGGAA TACTATGAAA AAGAACTGGG TTATCAGATA
CCTCCGAGAA CTCCTTTCGT CGGAAAGCAC TTTAACGTCA CCCAGGCGGG AATACATGCC
GACGGGCTTT TAAAAGATGA AGAAATATAC AATATATTTG ATACGGCAAA ACTTTTAAAC
AGGCCTGTAG GTGTTGCAAT TAACCAGACT TCCGGTCTTG CCGGAATTGC TCATTGGATA
AACAGCCACT TTGGACTTGA AGGAGCCAAA AGAATTGACA AAAGGGATGA AAGAATAGTA
AAAATTAAAG AGTGGGTTGA TGAACAGTAT AAAGCCGGAC GTGTAACTTC AATAGGTGAT
GACGAGCTTG AAGAGGTTAT AAGAAAACTG GCGCCGGAAA TATTTGATCT GGCACTTTAA
 
Protein sequence
MIEFNKKTNT LEQVQYKYTL QDVSEPNLYR DIFSYDEIPK CTFNHRKVPM APPDEIWITD 
TTFRDGQQSR APYTVEQIVH LYDLLHKLGG PKGIIRQCEF FLYSDRDKQA VYKCLERGYK
YPEVTSWIRA TKSDFQLAKD MGMKESGILV SCSDYHIFKK LNMTRKQALE HYMSIVKSAI
EVGIRPRCHF EDITRADFYG FVVPFAIELR KLMEESGVPI KIRACDTLGY GVSYPGAALP
RSVPGIIYGL RHYAGFPSEL IEWHGHNDFY KAVCNAATAW LYGASAVNCS LLGIGERTGN
TPLEAMVIEY AQLRGTTDSM DTTVITEIAE YYEKELGYQI PPRTPFVGKH FNVTQAGIHA
DGLLKDEEIY NIFDTAKLLN RPVGVAINQT SGLAGIAHWI NSHFGLEGAK RIDKRDERIV
KIKEWVDEQY KAGRVTSIGD DELEEVIRKL APEIFDLAL