Gene Athe_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2226 
Symbol 
ID7407645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2359000 
End bp2360358 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content40% 
IMG OID643716592 
Productpyruvate carboxyltransferase 
Protein accessionYP_002574071 
Protein GI222530189 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTAA AATTTAATCA AAAGACTCAC ATTCTTGAGA TTGAGTATCA GTTCAGGGAT 
GTTGAAGAGC CAAATCTTTT CAGGAACATC TATCCTTACA ATGAGGTACC AAGGCTTGTT
TTCAACCACA GAATTGTGCC AATGAATGTT CCTGAGCGGC TTTACATCAC AGACACAACT
TTCAGAGACG GCCAGCAATC ACGCTCACCG TACACTGTTG ATCAAATTTG TAGAATTTAT
GACTACTTAC ATGAACTTGA CAATGGGAGT GGCGTTATTC TTCACACAGA GTTTTTTGTG
TACTCAAAAC AGGATAAAGA AGCAGTTTTA AAGTGTTTAG AAAAAGGTTA TGATTTTCCA
AAAGTCACCG CTTGGATTCG GGCAAAAAAA GAGGATTTTG AGATTGTAAA GAACCTTGGT
ATCAAGGAAA CAGGAATTTT GGTTTCATGT TCTGACTATC ATATCTTCAA AAAATTAAAG
ATGACAAGAA GTCAAGCAAT GAAGCAATAC TTAGAAATTG TGTCGGCAGC TTTAGAGGCA
GGAATCATAC CACGCTGCCA TTTTGAAGAC ATCACAAGAG CTGATTTTTA CGGATTTGTC
TTGCCGTTTA TTAATGAGCT TATGAAACTT TCAAAAGAGG CAAATATGCC TGTGAAAATC
AGAGCCTGTG ACACGCTGGG GCTTGGATCA CCAATACCGG GTGTGGCTCT CCCAAGAAGT
GTGCCTCAAA TAATCTATGG TATTGTCAAC TATGGTGAAG TGCCATCTGA GTGGCTTGAG
TGGCACGGTC ACAACGATTT TTACAAGGCA GTCATAAACT CAACAATGGC ATGGCTGTAC
GGTGCTTCAA TGGTAAACAC ATCACTTTTG GGAATAGGTG AGCGCACGGG TAACACCCCA
TTAGAGGCAA TGGTGATGGA ATACATTCAG ATAAGAGGCA GCGCAGATGG CATGAATGTT
GCTGTAATAT CTGAGATTGC TGAGTATTTC AAAAAAGAAA TCGGGTATGA AATTCCTCCA
ATGACGCCTT TTGTTGGTGA GAACTTTAAC GCAACACGGG CTGGAATTCA TGCAGATGGA
CTTATGAAAG ATGAGGAGAT TTACAATATA TTTGATACAG GAAAGATTCT GGGGAAGCCT
CCTAAGGTGA TTATAGATGC ATACTCAGGC ATTGCAGGAA TTGTAATCTG GATAAATAGG
TATTTCAAAG ATAGCGGCAT TGACATTCAG GTTGACAAAA AAGACCCAAG GGTTCAGAAG
GTCAAAGAGT GGGTAGACAG TCAATATGAA AATGGAAGAA ATACATCAAT AAGTGATGAA
GAGCTAAAAG AGGTTGTAAC CAGAATATTT GGACTGTAA
 
Protein sequence
MNVKFNQKTH ILEIEYQFRD VEEPNLFRNI YPYNEVPRLV FNHRIVPMNV PERLYITDTT 
FRDGQQSRSP YTVDQICRIY DYLHELDNGS GVILHTEFFV YSKQDKEAVL KCLEKGYDFP
KVTAWIRAKK EDFEIVKNLG IKETGILVSC SDYHIFKKLK MTRSQAMKQY LEIVSAALEA
GIIPRCHFED ITRADFYGFV LPFINELMKL SKEANMPVKI RACDTLGLGS PIPGVALPRS
VPQIIYGIVN YGEVPSEWLE WHGHNDFYKA VINSTMAWLY GASMVNTSLL GIGERTGNTP
LEAMVMEYIQ IRGSADGMNV AVISEIAEYF KKEIGYEIPP MTPFVGENFN ATRAGIHADG
LMKDEEIYNI FDTGKILGKP PKVIIDAYSG IAGIVIWINR YFKDSGIDIQ VDKKDPRVQK
VKEWVDSQYE NGRNTSISDE ELKEVVTRIF GL