Gene GM21_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3018 
Symbol 
ID8138364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3505119 
End bp3506705 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content63% 
IMG OID644870619 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_003022805 
Protein GI253701616 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones107 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTGG TGAAACTGTA CGATACGACG CTTAGGGACG GAACGCAGGC AGAAGACATC 
TCTTTCCTGG TGGAGGACAA GATCCGCATC GCCCACAAAC TGGACGAGAC CGGCATCGCT
TACATAGAAG GAGGGTGGCC CGGCAGCAAC CCCAAGGACG TCGCCTTCTT CAAGGACATC
AAGAAAGAGA AGCTCTCCCA GGCGAAGATC GCGGCTTTCG GCTCCACCAG GCGCGCCAAG
ATCACCCCGG ACAAGGATCA GAACATCCGC ACCCTGGTGC AGTCCGAGGC GGACGCGGTC
ACCATCTTCG GCAAGAGCTG GGACTTCCAG GTCCACGAGG CGCTCAGGAT CCCGCTCGAG
GAGAACCTGG AGCTGATCTT CGACTCGCTG GAGTACCTGA AGGCGCGCAT GCCCGAGGTG
TTCTACGACG CCGAGCACTT CTTCGACGGC TACAAGGCCA ATCCTGAGTA CGCCATCAAG
ACGCTGCTGG CGGCACAGCA GGCGGGTGCG GACTGCATCA TCCTTTGCGA CACCAACGGC
GGCACCATGC CCTTCGAGAT CGCCAGCATC GTGGCCGAGG TGCAAAAGGC GGTCTCCACC
CCGCTCGGCA TCCACACCCA CAACGACGGC GAGTGCGCCG TCGCCAACTC GATAGTCGCG
GTGCAAAGCG GCATCGTCCA GGTGCAGGGG ACCATCAACG GGTTCGGCGA GCGCTGCGGC
AACGCGAACC TCTGCTCCGT CATCCCGGCC CTCAAGGTGA AGATGAATAT GGGGTGCGTG
AGCGACCAGC AGATGCGCCA GTTGCGCGAC CTCTCCCGCT ACGTCTACGA ACTGGCGAAC
CTGGCGCCCA ACAAGCACCA GGCCTACGTC GGGAACTCCG CCTTCGCCCA CAAGGGTGGG
GTGCACGTCT CCGCCATCCA GCGCCACCCC GAAACCTACG AGCACATGAG GCCGGAGTTG
GTGGGAAACA GCACCCGCGT CCTCGTCTCC GACCTCTCCG GTCGCGCCAA CATCCTCGCC
AAGGCGACCG AATTCAACAT CAACCTGGAC AGCAAGGATC CGGTGACCCT GGAGATCCTG
GAAGACATCA AGGCGATGGA GAACCGCGGC TACCAGTTCG AGGGGGCGGA GGCGTCATTC
GAGCTCCTGA TGAAGCGCGC GCTCGGCACG CACCGCAAGT TCTTCTCCGT GATCGGCTTC
AGGGTCATCG ACGAGAAGCG CCATGAGGAC GAGCAGCCGA TCTCCGAGGC CACCATCAAG
GTGAAGGTGG GGGGGAAGAT CGAGCACACG GCGGCGGAAG GGTCCGGCCC TGTCAACGCG
CTCGACAACG CGCTCAGGAA GGCGCTGGAG AAGTTCTATC CCAAGCTTCG GGACGTGAAG
CTGCACGACT ACAAGGTAAG GGTGCTCCCG GCAGGGCAGG GGACGGCCTC CTCGATCCGG
GTGTTGATCG AGTCCGGCGA CAAGGAAGGG CGCTGGGGGA CCGTCGGTGT CTCCTCCAAC
GTCATCGAGG CCTCCTACCA GGCGCTGGTC GACGCCATAG AATTCAAGCT CCACAAGGAA
GAGGAGGCGG CGGCGCCGAA ACAGTGA
 
Protein sequence
MSLVKLYDTT LRDGTQAEDI SFLVEDKIRI AHKLDETGIA YIEGGWPGSN PKDVAFFKDI 
KKEKLSQAKI AAFGSTRRAK ITPDKDQNIR TLVQSEADAV TIFGKSWDFQ VHEALRIPLE
ENLELIFDSL EYLKARMPEV FYDAEHFFDG YKANPEYAIK TLLAAQQAGA DCIILCDTNG
GTMPFEIASI VAEVQKAVST PLGIHTHNDG ECAVANSIVA VQSGIVQVQG TINGFGERCG
NANLCSVIPA LKVKMNMGCV SDQQMRQLRD LSRYVYELAN LAPNKHQAYV GNSAFAHKGG
VHVSAIQRHP ETYEHMRPEL VGNSTRVLVS DLSGRANILA KATEFNINLD SKDPVTLEIL
EDIKAMENRG YQFEGAEASF ELLMKRALGT HRKFFSVIGF RVIDEKRHED EQPISEATIK
VKVGGKIEHT AAEGSGPVNA LDNALRKALE KFYPKLRDVK LHDYKVRVLP AGQGTASSIR
VLIESGDKEG RWGTVGVSSN VIEASYQALV DAIEFKLHKE EEAAAPKQ