Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2112 |
Symbol | |
ID | 8137448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2455485 |
End bp | 2457149 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644869727 |
Product | PfaD family protein |
Protein accession | YP_003021922 |
Protein GI | 253700733 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | [TIGR02814] PfaD family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 101 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGATCCAT TTTCACTACA GGGTGAACAT ACCGCTCGCT CTGCAAACCT GGAGAACCTG GGATCGTGGC ACCCCGCCTC GAACGCCCCC CCTCAGAAGG CCGCCAACCT GAGAGACGCC CTTCGCTACG TACGCCAGCC GCTGTATCTC GTGGAAAAGG AAGGGACCAT GGTCCCGAGG CTGGGAGGCG TCGGCCGCCT CGGCGCCGTC AACCCGGGCG CGCTGCCGAT CGCAGCCTAT GCCCCTGCCT GCTTTCCCGA AAACCTGGGG GATCCTTCTT TTTGCCGCGA ACTCGGCATC CGCTACCCCT ACGTCGGCGG TTCCATGGCC AAGGGGATCA GCTCCGCGGC CATGGCCGAG GAGCTGGGTC GCGCCGGGAT GCTCGGCTTC TTCGGCGCCG CCGGCCTTCC GCTCGCCACC GTCTCGGAAA CCGCCGACCG CCTCAAGGCT TCCCTGGGCG ACATCCCCTA CGGCTTCAAC CTGATCCACT CTCCGCACGA GCCCGAGCTG GAGAGCGAGC TCGCCGAGCT GTACATCAGG AAAGGGATCC GCATCATCGA GGCCTCGGCG TTCCTGGCCC TGACGCTGCC GCTGGTCCGG TACCGCCTGC ACGGCATCAA GCGCGCCGCC GACGGGACCA TCGTGACCCC CAACCGCATC ATCGCCAAGG TCTCCCGCGA GGAACTGGCG ACTAAGTTCT TCGCACCGGC TCCCGAGAAG CTCCTGCGCG CTCTGGTCGC CAACGGCTCC ATCACCGCCG AGCAGGCCGA ACTGGCCGCG CTGGTCCCGC TGGCGCAGGA CGTGACGGCC GAGGCGGACT CCGGCGGCCA TACCGACAAC CGACCCGCCC TCGCCCTCTT CCCGACCATC AACGCGCTGG CGGCGAAGCT CAAGCGCCAG TACGGCTACA GCTGCCGCCT GCGGGTAGGG CTTGGCGGCG GCGTCTCGAC GCCGGCCTCC GCCGCAGCCG CCTTCTCCAT GGGGGCCGCC TACCTCGTGA CCGGGTCGGT GAACCAGGCC TGCGTGGAAT CCGGCACCTC CGACACCGTG CGCGGCATGC TCGCCGGCAC CCGCCAGGCC GACGTGACCA TGGCACCCGC CGCCGACATG TTCGAGATGG GGGTCACCGT GCAGGTCTTG AAGCGCGGCA CCATGTTCCC CATGCGCGCA CAAAAGCTCT ACGAGATCTA CCGCGCCTGC AAAAGCCTCG ACGACATCCC CGCCGCCGAG CGCGAGAAGC TGGAGAAGAC CATGTTCCAG GCGCCGCTCG CCGACATCTG GCGCGACACC CGCGCCTTCT TCTTAAAGCG CGACCCCTCC CAGGTCGAGC GCGCCGAGCG CGACCCGAAG CACCTGATGG CGTTGGTCTT CCGCTGGTAC CTCGGCATGG CCGCGCACTG GGCGAAAGAC GGCCTCGAGC CCCGGCGCAT GGACTACCAG GTCTGGTGCG GCCCCGCCAT GGGAGCCTTC AACGAATGGG CCTCCGGCTC CTTCCTTGAT ACCCCGGGCA ATCGCACGGT CGAAGCCGTG GCCCTCAACA TCCTGCACGG AGCGGCCGCA CTTAACCGCG CCAACTTCCT GAGCAGCCAG GGCATCGAAC TCAGGATGGA TGAAATCGCA CCGCAACCTC TCGAAATCGC ACAAATCAAG GAGTACCTTT GTTGA
|
Protein sequence | MDPFSLQGEH TARSANLENL GSWHPASNAP PQKAANLRDA LRYVRQPLYL VEKEGTMVPR LGGVGRLGAV NPGALPIAAY APACFPENLG DPSFCRELGI RYPYVGGSMA KGISSAAMAE ELGRAGMLGF FGAAGLPLAT VSETADRLKA SLGDIPYGFN LIHSPHEPEL ESELAELYIR KGIRIIEASA FLALTLPLVR YRLHGIKRAA DGTIVTPNRI IAKVSREELA TKFFAPAPEK LLRALVANGS ITAEQAELAA LVPLAQDVTA EADSGGHTDN RPALALFPTI NALAAKLKRQ YGYSCRLRVG LGGGVSTPAS AAAAFSMGAA YLVTGSVNQA CVESGTSDTV RGMLAGTRQA DVTMAPAADM FEMGVTVQVL KRGTMFPMRA QKLYEIYRAC KSLDDIPAAE REKLEKTMFQ APLADIWRDT RAFFLKRDPS QVERAERDPK HLMALVFRWY LGMAAHWAKD GLEPRRMDYQ VWCGPAMGAF NEWASGSFLD TPGNRTVEAV ALNILHGAAA LNRANFLSSQ GIELRMDEIA PQPLEIAQIK EYLC
|
| |