Gene GM21_3480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3480 
Symbol 
ID8138852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4026172 
End bp4027191 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content67% 
IMG OID644871100 
ProductThreonine aldolase 
Protein accessionYP_003023260 
Protein GI253702071 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.000000277913 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGACGG TAGATCTGAG AAGCGACACG GTGACATCGC CGTCGCAGGC AATGCGCCGG 
GAAATGGCGA ATGCCCCGGT CGGAGACGAC GTCTACGGGG AGGACCCGAC GGTGAACCGG
TTGGAGTCCA TGGCGGCGAA GTTGCTGGGG AAGGAGGCGG CGCTCTTCGT CCCCACTGGG
ACCATGGGGA ACCTGATCGC CCTTTTGTCG CACTGCGGCC GCGGCGACGA ATACATCGCG
GGGCAGGAAG CGCACATCTA CCGGTGGGAG GGAGGAGGCG GGGCCGTCTT CGGCGGGATC
CAGCCGCAGC CGGTCGAATT CGAAGAGAAC GGAACGCTCG ACCTCGACAA GGTGCGGCGC
GTCGTGAAGC CGGCGGATTA CCATCACCCC GTCACCAGAC TCCTCTGCCT TGAGAACACG
CAAGGGGGGA AGGTGTTGCC GCTCGACTAT CTGGCAAAGG CTGCGGAGAC GGCCCAAGGT
CTCGGCCTTT CCCTGCATCT CGACGGCGCC CGGGTCTTCA ACGCGGCCGT GTACCTGGGG
GTACCCGTCG CCACCATCGC CGCCCATTTC GACTCGGTCT CGGTCTGCCT CTCCAAGGGG
CTCGGCGCCC CGGCCGGCAC GGTACTTTGC GCCAGCCGCG AGCTCATCGG CCGCGCGCGC
CGCTGGCGCA AGGTGGCCGG CGGCGGCATG CGCCAGGCCG GCATCTTGGC CGCGGCAGGC
ATTTACGCTC TGGAGAACAA CGTAGAGCGG CTCGCCGAGG ACCACGAGAA CGCGGAACTC
CTTTCCGCCG GGCTTGGCCA CATCGAGGAA CTCCTGGTGA GCCAGGCCCG CACCAACATC
CTCTTCGTCA CCCCCCCGGC CGGTAGCGCC GACCGGCTGC GCAAGACTCT CGCCGCCGAG
GGGATACTCC TTGGAGGAGG CGACCAGATA CGCCTTGTCA CCCACCTGGA CGTAACCAGC
GCCGACGTCG AGCGCACCGT CGCCGCCTTC AAACGCTTCT TTGCGGTACG GGGCAACTGA
 
Protein sequence
MKTVDLRSDT VTSPSQAMRR EMANAPVGDD VYGEDPTVNR LESMAAKLLG KEAALFVPTG 
TMGNLIALLS HCGRGDEYIA GQEAHIYRWE GGGGAVFGGI QPQPVEFEEN GTLDLDKVRR
VVKPADYHHP VTRLLCLENT QGGKVLPLDY LAKAAETAQG LGLSLHLDGA RVFNAAVYLG
VPVATIAAHF DSVSVCLSKG LGAPAGTVLC ASRELIGRAR RWRKVAGGGM RQAGILAAAG
IYALENNVER LAEDHENAEL LSAGLGHIEE LLVSQARTNI LFVTPPAGSA DRLRKTLAAE
GILLGGGDQI RLVTHLDVTS ADVERTVAAF KRFFAVRGN