Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2050 |
Symbol | |
ID | 8137386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2373324 |
End bp | 2376128 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644869665 |
Product | aconitate hydratase |
Protein accession | YP_003021860 |
Protein GI | 253700671 |
COG category | [C] Energy production and conversion |
COG ID | [COG1048] Aconitase A |
TIGRFAM ID | [TIGR01341] aconitate hydratase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 1.5045300000000002e-23 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGGCA CGACATCCAA CAGCTACAAC ACCCTGAGCA CGCTCACAGC GGGAGGCAAG AGCTACCGCT ACCACTCGTT GCAGGCGTTC GAACAAAATT CCGGCTTGGA CATCTCCAGG CTTCCATATT CCCTCAAGAT CCTCCTGGAG AACCTGCTGA GACGGGAGGA CGGGGTGGTG GTCAAAAGGG AGGACATCGA GGCCGTGGCG CGCTGGGACG CCGCCGCCGA ACCGGACAAG GAGCTGCAGT TCATGCCCGC CCGCATACTG CTCCAGGATT TTACCGGGGT GCCCGCCGTG GCGGATCTGG CGGCGATGCG ATCGGCCCTG GCGCGGCTAG GGGGAAGCCC TTCCCACATC AACCCGATGC AGCCCGCCGA CCTGATCATC GACCACTCGG TGCAGGTGGA CCTCTACGGC ACCACGGGCG CCCTCTGGGG GAACTCCTCC ATCGAGTTCG AGCGCAACCA CGAGCGCTAC CAGTTCCTGC GCTGGGGGCA GTCCGCCTTC AGGAACTTCA GCGTGGTCCC GCCCGCCACC GGCATCTGCC ACCAGGTAAA CCTCGAGTAC CTGTCGCAGG TGGCCATGGT GGCGAAAACC GGGGATGAGG ATTGGGTCTT TCCCGACACG CTCGTCGGGA CCGATTCCCA CACCACCATG ATCAACGGCC TGGGCGTCGT CGGCTGGGGA GTAGGGGGGA TCGAGGCGGA AGCGGCCATG CTGGGCCAGC CCTGCTCCAT GCTCATTCCC CAGGTGGTCG GCTTTCGCCT GACCGGCGCG CTCGCCCCAG GCGCCACGGC GACGGATCTC GTGCTTACCG TGACCCAGAT GCTGCGCAAG AAGGGTGTGG TCGGCAAGTT CGTGGAATTC TTCGGCCCCG GCGCCGCCTC CCTCACCATC GCCGACCGGG CCACCATCGG CAACATGGCG CCGGAATATG GCGCAACCAT CGGGGTCTTC CCCGTCGACG GGCAGACTAC CGAATACCTG CGCCTGACCG GCAGGGGCGA CATCGTCCCG CTGGTGGAGG CGTATTACCA GCAGCAGAGG CTCTGGCACG ATGTGAACCA GCCGGAGCCC TTATTCAGCG ACATCCTGGA ATTGGACTTG GCCCAGGTCG AGCCGTCCCT CGCCGGGCCG ATGCGCCCCA TGGACCGGGT CAACCTGAAG GAGGTACGGG CGTCGTTCCG CAAACAGCTG ATCACCCTGA GGTCCCACGA CGCCGCCCGC GTCGACAAGG AAACCATGAG CCGGTGGTTA GGGGAGGGGG GAGCGCCGGT GACCGTCGCG CCCGAGCTGA TGGAGCACCC CGACAAGTCG CTGGGAGCTT TAGGGCACTG CGTCCCGGTG CGCCAGCCCG ACGGCACCGC CTACAATCTC TGCCATGGCT CAGTGGTCAT CGCCGCCATC ACCAGTTGCA CCAATACCTC CAACCCCTCG GTGATGATTG GGGCGGGTCT CCTGGCGCGC AACGCGGTGC GGCGCGGCTT GCAGGTGCGG CCTTGGGTGA AGACGAGCCT CGCCCCCGGC TCCAAGGTGG TCACCGACTA CTTGACCGCC GCAGGGCTCA CGCCTTACCT TGAAGCGCTG CGCTTCCACC TGGTAGGCTA CGGCTGCACC ACCTGCATCG GCAACAGCGG CCCCTTGGCG GAGCACATAT CCGGCGCCAT CGTCAAGGGG GATCTCGCCG TCGCAGCCGT TCTCTCCGGC AATCGCAACT TCGAGGGGCG CATAAACAGC CATGTCCGGG CAAACTACCT CGCCTCGCCG CCGCTCGTCG TGGCCTATGC GCTGGCGGGA AACGTCAGCA TCGACCTCAC CCTGGACGCC ATCGGCAGCG ACCCCAATGG CAACCCGGTC TATTTGAAAG ACATCTGGCC TAACGAGCAG GAAGTCGCGC AGCTGGTGCA AAGCTGCGTC CATGCCGAAT CCTTCGCCCG CAACTACGCC GACGTCTTCC ACGGAGACGA GCAGTGGATG GCTCTGCAGG TGCCGACCGG CGAGCTGTAC CAGTGGCAGG AGGGTTCCTC CTACATCAAG GAGCCCCCCT TCTTCGCCGA TCTTGCAAGG GAACCGCAGC CTGTCCGGGA CGTGAAGGAG GCGCGGGTCC TGGCCCTTTT GGGGGATTCC ATCACCACCG ACCACATCTC TCCGGCAGGC TCGATCGGGA AAGAGTCTCC GGCCGGGCGC TATCTCATCT CGCTGGGGAT ACCCCCAAAG GAGTTCAACT CCTATGGCGC GCGCCGCGGC AACCACGAGG TAATGGTGCG GGGGACCTTT GCCAACACCC GCATCAGGAA CAAGTTGGTC CCGGGAGTGG AAGGTGGCCT GACCCAGTGC TTTGTGGGGG AGGACGCCGG AGGCGGCCGG ATGTCCATTT ACGACGCCGC CGAGAAGTAC CGTGCAGCCG GCGTCCCGCT GGTCGTGATC GCCGGCAAGG AGTACGGCAC CGGTTCCTCC CGCGACTGGG CCGCCAAAGG GACCAAGCTC CTCGGGGTGC GGGCCGTCAT CGCCGAGAGT TTCGAGAGGA TCCACCGTTC CAACCTGGTT GGGATGGGCG TTCTTCCCCT GCAGTTTCTT CCCGGCGAGA ACCCTGCCAC GCTCGGGCTC GACGGCACGG AAAACTTCGA CCTGGAAGGG CTGGCCGAGC TCGCGCCGGG GCAAAAACTT AAGGTCAACT ACCGGAAGTC CGACGGCGCT GCCGGAACGT TCACGGTACA GGTGCGGATC GACACCACCA ACGAGCTCGA CTACTACCGC CACGGCGGCA TCCTCCCCTT CGTGCTACGG CAGTTCTTGA AGTAA
|
Protein sequence | MSGTTSNSYN TLSTLTAGGK SYRYHSLQAF EQNSGLDISR LPYSLKILLE NLLRREDGVV VKREDIEAVA RWDAAAEPDK ELQFMPARIL LQDFTGVPAV ADLAAMRSAL ARLGGSPSHI NPMQPADLII DHSVQVDLYG TTGALWGNSS IEFERNHERY QFLRWGQSAF RNFSVVPPAT GICHQVNLEY LSQVAMVAKT GDEDWVFPDT LVGTDSHTTM INGLGVVGWG VGGIEAEAAM LGQPCSMLIP QVVGFRLTGA LAPGATATDL VLTVTQMLRK KGVVGKFVEF FGPGAASLTI ADRATIGNMA PEYGATIGVF PVDGQTTEYL RLTGRGDIVP LVEAYYQQQR LWHDVNQPEP LFSDILELDL AQVEPSLAGP MRPMDRVNLK EVRASFRKQL ITLRSHDAAR VDKETMSRWL GEGGAPVTVA PELMEHPDKS LGALGHCVPV RQPDGTAYNL CHGSVVIAAI TSCTNTSNPS VMIGAGLLAR NAVRRGLQVR PWVKTSLAPG SKVVTDYLTA AGLTPYLEAL RFHLVGYGCT TCIGNSGPLA EHISGAIVKG DLAVAAVLSG NRNFEGRINS HVRANYLASP PLVVAYALAG NVSIDLTLDA IGSDPNGNPV YLKDIWPNEQ EVAQLVQSCV HAESFARNYA DVFHGDEQWM ALQVPTGELY QWQEGSSYIK EPPFFADLAR EPQPVRDVKE ARVLALLGDS ITTDHISPAG SIGKESPAGR YLISLGIPPK EFNSYGARRG NHEVMVRGTF ANTRIRNKLV PGVEGGLTQC FVGEDAGGGR MSIYDAAEKY RAAGVPLVVI AGKEYGTGSS RDWAAKGTKL LGVRAVIAES FERIHRSNLV GMGVLPLQFL PGENPATLGL DGTENFDLEG LAELAPGQKL KVNYRKSDGA AGTFTVQVRI DTTNELDYYR HGGILPFVLR QFLK
|
| |