Gene GM21_2289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2289 
Symbol 
ID8137629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2669190 
End bp2670251 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content61% 
IMG OID644869904 
ProductThreonine aldolase 
Protein accessionYP_003022096 
Protein GI253700907 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.000000548444 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAGA AAAAGCAAAG CGGACACAAC GAGCTGAAGC ACCATTTCGC CAGTGACAAC 
TATGCGGGGA TCTGCAACGA AGCCTGGGCA GCGATGGCGG AGGCCAACCG CGGCATGGCC
AGCTCCTATG GCGACGATTA CTGGACCGCG GAGGCCTGCG AAAAGATCCG CGAACTGTTC
GAGACCGACT GCGAGGTTTT CTTCGTCTTC AACGGCACCG CAGCCAACTC GCTCGCCCTC
GCCTCCTTGT GCCAGTCCTA CCACTCCATA GTCTGCCACG AGATGGCGCA CATAGAGACG
GACGAGTGCG GCGCCTCCGA GTTCTTCTCA AACGGCACCA AGGTGCTGCT GGTACCGGGA
GAAAACGGGA AAATCGACCT CGATGCGGTG GAGCACACCA TCCACAAGCG CAGCGACATC
CACTACCCGA AGCCGAAAGC GCTCAGCATC ACCCAGGCCA CCGAACTGGG GACGCTTTAC
AGCTTGCAGG AGTTGCAGGC GATTGGGGAA CTGGCCAAGA AACACTCGCT GCGGGTGCAT
ATGGACGGCG CGCGCTTCGC GAACGCGGTA GCATCCCTGA ACGTCGCCCC GAAAGAGATC
AGCTGGCAGG CCGGGGTCGA CGTCCTCACC TTCGGGGGAA CCAAGAACGG TTTTGCCGTG
GGCGAGGCGG TGGTCTTCTT CAACAAGGAA CTCGCCTTCG AGTTCGACTA CCGCTGCAAG
CAGGCGGGAC AGCTCGCCTC CAAGATGCGC TTTCTCACCG CCCCCTGGAT CGGCATGCTG
GAGAGCGGCG CCTGGCTTAA AAACGCCGCC CACGCCAACA ACTGCGCGAG GCTTTTGGAA
AGCGAGATCA AGAAGATACC GCAGGTGCGG ATCATGTTCC CGAGCCAGGC CAACTCCGTG
TTTCTGGAAA TGGCGCCCGA TGCGCTGGAG GCGCTGCGAG CTCGCGGCTG GCACTTCTAC
ACCTTCATAG GATCCGGCGG AGCCAGGTTC ATGTGCTCCT GGGACACCGA TACAGCCGAA
GTAGCCAACC TGGTAGCCGA CATCAAGGCA TCGGTGGCAT AA
 
Protein sequence
MSKKKQSGHN ELKHHFASDN YAGICNEAWA AMAEANRGMA SSYGDDYWTA EACEKIRELF 
ETDCEVFFVF NGTAANSLAL ASLCQSYHSI VCHEMAHIET DECGASEFFS NGTKVLLVPG
ENGKIDLDAV EHTIHKRSDI HYPKPKALSI TQATELGTLY SLQELQAIGE LAKKHSLRVH
MDGARFANAV ASLNVAPKEI SWQAGVDVLT FGGTKNGFAV GEAVVFFNKE LAFEFDYRCK
QAGQLASKMR FLTAPWIGML ESGAWLKNAA HANNCARLLE SEIKKIPQVR IMFPSQANSV
FLEMAPDALE ALRARGWHFY TFIGSGGARF MCSWDTDTAE VANLVADIKA SVA