Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3018 |
Symbol | |
ID | 8138364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3505119 |
End bp | 3506705 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870619 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_003022805 |
Protein GI | 253701616 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 107 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTGG TGAAACTGTA CGATACGACG CTTAGGGACG GAACGCAGGC AGAAGACATC TCTTTCCTGG TGGAGGACAA GATCCGCATC GCCCACAAAC TGGACGAGAC CGGCATCGCT TACATAGAAG GAGGGTGGCC CGGCAGCAAC CCCAAGGACG TCGCCTTCTT CAAGGACATC AAGAAAGAGA AGCTCTCCCA GGCGAAGATC GCGGCTTTCG GCTCCACCAG GCGCGCCAAG ATCACCCCGG ACAAGGATCA GAACATCCGC ACCCTGGTGC AGTCCGAGGC GGACGCGGTC ACCATCTTCG GCAAGAGCTG GGACTTCCAG GTCCACGAGG CGCTCAGGAT CCCGCTCGAG GAGAACCTGG AGCTGATCTT CGACTCGCTG GAGTACCTGA AGGCGCGCAT GCCCGAGGTG TTCTACGACG CCGAGCACTT CTTCGACGGC TACAAGGCCA ATCCTGAGTA CGCCATCAAG ACGCTGCTGG CGGCACAGCA GGCGGGTGCG GACTGCATCA TCCTTTGCGA CACCAACGGC GGCACCATGC CCTTCGAGAT CGCCAGCATC GTGGCCGAGG TGCAAAAGGC GGTCTCCACC CCGCTCGGCA TCCACACCCA CAACGACGGC GAGTGCGCCG TCGCCAACTC GATAGTCGCG GTGCAAAGCG GCATCGTCCA GGTGCAGGGG ACCATCAACG GGTTCGGCGA GCGCTGCGGC AACGCGAACC TCTGCTCCGT CATCCCGGCC CTCAAGGTGA AGATGAATAT GGGGTGCGTG AGCGACCAGC AGATGCGCCA GTTGCGCGAC CTCTCCCGCT ACGTCTACGA ACTGGCGAAC CTGGCGCCCA ACAAGCACCA GGCCTACGTC GGGAACTCCG CCTTCGCCCA CAAGGGTGGG GTGCACGTCT CCGCCATCCA GCGCCACCCC GAAACCTACG AGCACATGAG GCCGGAGTTG GTGGGAAACA GCACCCGCGT CCTCGTCTCC GACCTCTCCG GTCGCGCCAA CATCCTCGCC AAGGCGACCG AATTCAACAT CAACCTGGAC AGCAAGGATC CGGTGACCCT GGAGATCCTG GAAGACATCA AGGCGATGGA GAACCGCGGC TACCAGTTCG AGGGGGCGGA GGCGTCATTC GAGCTCCTGA TGAAGCGCGC GCTCGGCACG CACCGCAAGT TCTTCTCCGT GATCGGCTTC AGGGTCATCG ACGAGAAGCG CCATGAGGAC GAGCAGCCGA TCTCCGAGGC CACCATCAAG GTGAAGGTGG GGGGGAAGAT CGAGCACACG GCGGCGGAAG GGTCCGGCCC TGTCAACGCG CTCGACAACG CGCTCAGGAA GGCGCTGGAG AAGTTCTATC CCAAGCTTCG GGACGTGAAG CTGCACGACT ACAAGGTAAG GGTGCTCCCG GCAGGGCAGG GGACGGCCTC CTCGATCCGG GTGTTGATCG AGTCCGGCGA CAAGGAAGGG CGCTGGGGGA CCGTCGGTGT CTCCTCCAAC GTCATCGAGG CCTCCTACCA GGCGCTGGTC GACGCCATAG AATTCAAGCT CCACAAGGAA GAGGAGGCGG CGGCGCCGAA ACAGTGA
|
Protein sequence | MSLVKLYDTT LRDGTQAEDI SFLVEDKIRI AHKLDETGIA YIEGGWPGSN PKDVAFFKDI KKEKLSQAKI AAFGSTRRAK ITPDKDQNIR TLVQSEADAV TIFGKSWDFQ VHEALRIPLE ENLELIFDSL EYLKARMPEV FYDAEHFFDG YKANPEYAIK TLLAAQQAGA DCIILCDTNG GTMPFEIASI VAEVQKAVST PLGIHTHNDG ECAVANSIVA VQSGIVQVQG TINGFGERCG NANLCSVIPA LKVKMNMGCV SDQQMRQLRD LSRYVYELAN LAPNKHQAYV GNSAFAHKGG VHVSAIQRHP ETYEHMRPEL VGNSTRVLVS DLSGRANILA KATEFNINLD SKDPVTLEIL EDIKAMENRG YQFEGAEASF ELLMKRALGT HRKFFSVIGF RVIDEKRHED EQPISEATIK VKVGGKIEHT AAEGSGPVNA LDNALRKALE KFYPKLRDVK LHDYKVRVLP AGQGTASSIR VLIESGDKEG RWGTVGVSSN VIEASYQALV DAIEFKLHKE EEAAAPKQ
|
| |