Gene GM21_2146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2146 
SymbolaksA 
ID8137482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2504905 
End bp2506053 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content64% 
IMG OID644869761 
Producttrans-homoaconitate synthase 
Protein accessionYP_003021956 
Protein GI253700767 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02660] homocitrate synthase NifV 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAT TTCCCAAGGC CCAGGTTTTC ATCGCAGACA CCACGCTGCG AGACGGCGAA 
CAGACGGCGG GGGTGGTCTT CACAGCGAAG GAAAAAATCT CCATCGCGAG GCAGTTGGAC
GCCATGGGGG TCCACGAACT GGAATGCGGG ATTCCCGCCA TGGGCGAGGA GGAGCGCGAC
TCGATCCGGG CACTTGTGGC GTTGGGCCTT TCGGCCCGGC TCGTCACCTG GAACCGCGCG
CTGGTGTCGG ACATCGAGGC GAGCATCGCC TGCGGCATCA AGGCCGTGGA CATCTCGCTC
TGCGTCTCGG ACATCATGAT CGAACACAAG ATCAATAAGA GCAGGGCATT TGTGAAGGAA
CAGCTGAAGC GGGCGCTATG CTTCGCCAAG GATAAGGGGC TCTACGTCTG CGTCGGGGGC
GAGGACGCCA GCCGCGCCGA CGGCGATTTC CTGATCGAGC TGATGCAGAT CGCCCAGGCA
AACGGCGCCG AGCGCTTCCG GTTCTGCGAC ACGCTCGGCA TCCTCGACCC CTTTGCCATG
TTCGAAAAGG TGGGGCGCCT GAGAGCCGCG GTCCCCGGTC TCGACATCGA GGTGCACACC
CACAACGACC TCGGGCTTGC CACGGCGAAC GCCCTGGCAG GGGTGAGGGG AGGGGCTTCC
TACATCAGCA CCACGGTCAA CGGCCTCGGC GAGCGGGCGG GGAACGCCGC GCTGGAAGAG
GTGGTCATGG CGCTGAAGGT CGCCTGCGGC ATCGATGCCG GCATCGACAC CAGGCGTTTT
AAGTCGGTGT CCCGGCTGGT GGGACGCGCC TCCAACCGCG AGGTCCCCCC CTGGAAGGCC
GTCGTGGGAG AGAGGGTCTT CTCGCACGAA TCCGGGCTGC ATGCGGACGG CGTTCTGAAG
GACCCGAGGA ACTACGAGGG GTTCACCCCT GAGGAAGTGG GGCTCAAAAG GCATATCGTC
GCGGGGAAAC ATTCCGGGAC CAACGGGATC GTGGAAAGCT ACCGTCAGAT CGGCATCCCC
ATTTCCAGGG AGGAGGCGCA GGAGCTGATG GACAAGGTGA GGAGCACGGC TCAGCGCATC
AAGGGCGCGC TGGCCCCGGT GGACCTGCTC AAACTGCACC AGGGGAGAGG GGTTTCGCTG
GCTGCTTAG
 
Protein sequence
MAGFPKAQVF IADTTLRDGE QTAGVVFTAK EKISIARQLD AMGVHELECG IPAMGEEERD 
SIRALVALGL SARLVTWNRA LVSDIEASIA CGIKAVDISL CVSDIMIEHK INKSRAFVKE
QLKRALCFAK DKGLYVCVGG EDASRADGDF LIELMQIAQA NGAERFRFCD TLGILDPFAM
FEKVGRLRAA VPGLDIEVHT HNDLGLATAN ALAGVRGGAS YISTTVNGLG ERAGNAALEE
VVMALKVACG IDAGIDTRRF KSVSRLVGRA SNREVPPWKA VVGERVFSHE SGLHADGVLK
DPRNYEGFTP EEVGLKRHIV AGKHSGTNGI VESYRQIGIP ISREEAQELM DKVRSTAQRI
KGALAPVDLL KLHQGRGVSL AA