Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3479 |
Symbol | pgi |
ID | 8138851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4024563 |
End bp | 4026155 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644871099 |
Product | glucose-6-phosphate isomerase |
Protein accession | YP_003023259 |
Protein GI | 253702070 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0166] Glucose-6-phosphate isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.0000274515 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGAAAA AGCAGCTTTG GGAGCGGTAC AAGAACCTGC TCTACCATGA CGCCGAGCTA GAGCTGAGCG TCGACACCAG CCGTATCGAC TTCCCGGAGG GGTTCCTGGA GGAGATGGAA CCGCGACTGC AGCAGGCGTA CCAGGAGATG GAAGCGCTGG AAAAAGGAGC CGTCGCCAAC CCGGACGAAA ACCGGATGGT GGGGCACTAC TGGCTCAGGG CTCCCGAACA GGCACCGGAG GCGGCCCTTT CCGAGGAGAT AACGTCGACG CTGGCCGCGA TCAAGGCGTT TGCCTCCTCA GTACACGGGG GGAATATCGC CGCCCCCGAC GGCCACCGCT TCACCGATCT TTTGATCATC GGCATCGGAG GCTCCGCGTT GGGGCCCCAG TTCGTCGCCG ACAGCCTGGG AGGGCCCGGG GACCGCTTGC GGATCTGGTT CTTCGACAAC ACCGACCCGG ACGGCATGGA CAAGGTCCTC TCCAGGATCG GCACAGCTTT GAAACAGACC CTGGTGGTCG TCATATCCAA GAGCGGCGGC ACCAAAGAGA CCCGCAACGG GATGCTGGAG GCGCGCCGGG CGTTCGAGCG CGCCGGGCTG CATTTCGCCG CGCACGCCGT CGCCGTAACC GGCAGCGGCA GCGAACTGGA CGGGACCGCC TCCTGGGAAA GCTGGCTCGG GGTCTTCCCG ATGTGGGACT GGGTCGGCGG GCGCACCTCG GTCACGTCGG CGGTGGGGCT TCTCCCCGCC GCCCTGCAAG GGATCGACGT GGAGCGGTTG CTGGCCGGAG CGCGGGCCTG CGACCAAAAA ACCCGCTCCA GGGTGACACG GGAAAACCCC GCCGCCCTCC TGGCCCTCTC CTGGTTCCAT GCCACCCGTG GCAAAGGCGC CCGCGACATG GTGCTGCTCC CCTACAAGGA CAGGCTGCTT TTGTTTTCCC GCTACCTGCA GCAGCTCATC ATGGAGTCCC TCGGCAAGGA GTTGGACCGC GACGGAAACA GGGTGCTGCA GGGGATCGCC GTCTACGGCA ACAAGGGCTC CACCGATCAG CACGCCTACG TGCAGCAGCT GCGGGAAGGG GTGCACAACT TCTTCGTCAC CTTCATCGAG GTGCTCAAGG ACCGGGAGGG CCCGTCGCTG GAAGTGGAGC CGGGTGCGAC CTCCGGCGAT TATCTCTCGG GCTTTTTCCA GGGGACGCGA GCAGCCCTCT ATGAAAAGGG ACGGGAATCG GTTACTATCA CCGTCCGCGA GCTTTCCCCG GTGAGCATCG GGGCACTCAT CGCCCTTTAC GAGCGCGCGG TTGGGCTGTA CGCCTCGCTC GTCAACGTGA ACGCCTACCA CCAGCCGGGA GTCGAGGCGG GGAAGAAGGC CGCAGGCGCC GTGCTCAAGC TGCAGGGGGA GATCGTCGAG ATGCTTAGGC GTCAGCCCAA CCGCGAATTC ACCGGCGAGG AGATGGCACT GGCCCTCGCG CGCCCGGAAG AGGTGGAAAC CGTCTTCATG ATCTTGAGGC ACCTCGCCGC CAACGGCGAC CACGGGGTGA GCGTGACCGT AAAGGACAAG ATCTGGGAAA ACAAGTACCG CAGCAAAGGT TAA
|
Protein sequence | MQKKQLWERY KNLLYHDAEL ELSVDTSRID FPEGFLEEME PRLQQAYQEM EALEKGAVAN PDENRMVGHY WLRAPEQAPE AALSEEITST LAAIKAFASS VHGGNIAAPD GHRFTDLLII GIGGSALGPQ FVADSLGGPG DRLRIWFFDN TDPDGMDKVL SRIGTALKQT LVVVISKSGG TKETRNGMLE ARRAFERAGL HFAAHAVAVT GSGSELDGTA SWESWLGVFP MWDWVGGRTS VTSAVGLLPA ALQGIDVERL LAGARACDQK TRSRVTRENP AALLALSWFH ATRGKGARDM VLLPYKDRLL LFSRYLQQLI MESLGKELDR DGNRVLQGIA VYGNKGSTDQ HAYVQQLREG VHNFFVTFIE VLKDREGPSL EVEPGATSGD YLSGFFQGTR AALYEKGRES VTITVRELSP VSIGALIALY ERAVGLYASL VNVNAYHQPG VEAGKKAAGA VLKLQGEIVE MLRRQPNREF TGEEMALALA RPEEVETVFM ILRHLAANGD HGVSVTVKDK IWENKYRSKG
|
| |