Gene GM21_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2112 
Symbol 
ID8137448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2455485 
End bp2457149 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content67% 
IMG OID644869727 
ProductPfaD family protein 
Protein accessionYP_003021922 
Protein GI253700733 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID[TIGR02814] PfaD family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATCCAT TTTCACTACA GGGTGAACAT ACCGCTCGCT CTGCAAACCT GGAGAACCTG 
GGATCGTGGC ACCCCGCCTC GAACGCCCCC CCTCAGAAGG CCGCCAACCT GAGAGACGCC
CTTCGCTACG TACGCCAGCC GCTGTATCTC GTGGAAAAGG AAGGGACCAT GGTCCCGAGG
CTGGGAGGCG TCGGCCGCCT CGGCGCCGTC AACCCGGGCG CGCTGCCGAT CGCAGCCTAT
GCCCCTGCCT GCTTTCCCGA AAACCTGGGG GATCCTTCTT TTTGCCGCGA ACTCGGCATC
CGCTACCCCT ACGTCGGCGG TTCCATGGCC AAGGGGATCA GCTCCGCGGC CATGGCCGAG
GAGCTGGGTC GCGCCGGGAT GCTCGGCTTC TTCGGCGCCG CCGGCCTTCC GCTCGCCACC
GTCTCGGAAA CCGCCGACCG CCTCAAGGCT TCCCTGGGCG ACATCCCCTA CGGCTTCAAC
CTGATCCACT CTCCGCACGA GCCCGAGCTG GAGAGCGAGC TCGCCGAGCT GTACATCAGG
AAAGGGATCC GCATCATCGA GGCCTCGGCG TTCCTGGCCC TGACGCTGCC GCTGGTCCGG
TACCGCCTGC ACGGCATCAA GCGCGCCGCC GACGGGACCA TCGTGACCCC CAACCGCATC
ATCGCCAAGG TCTCCCGCGA GGAACTGGCG ACTAAGTTCT TCGCACCGGC TCCCGAGAAG
CTCCTGCGCG CTCTGGTCGC CAACGGCTCC ATCACCGCCG AGCAGGCCGA ACTGGCCGCG
CTGGTCCCGC TGGCGCAGGA CGTGACGGCC GAGGCGGACT CCGGCGGCCA TACCGACAAC
CGACCCGCCC TCGCCCTCTT CCCGACCATC AACGCGCTGG CGGCGAAGCT CAAGCGCCAG
TACGGCTACA GCTGCCGCCT GCGGGTAGGG CTTGGCGGCG GCGTCTCGAC GCCGGCCTCC
GCCGCAGCCG CCTTCTCCAT GGGGGCCGCC TACCTCGTGA CCGGGTCGGT GAACCAGGCC
TGCGTGGAAT CCGGCACCTC CGACACCGTG CGCGGCATGC TCGCCGGCAC CCGCCAGGCC
GACGTGACCA TGGCACCCGC CGCCGACATG TTCGAGATGG GGGTCACCGT GCAGGTCTTG
AAGCGCGGCA CCATGTTCCC CATGCGCGCA CAAAAGCTCT ACGAGATCTA CCGCGCCTGC
AAAAGCCTCG ACGACATCCC CGCCGCCGAG CGCGAGAAGC TGGAGAAGAC CATGTTCCAG
GCGCCGCTCG CCGACATCTG GCGCGACACC CGCGCCTTCT TCTTAAAGCG CGACCCCTCC
CAGGTCGAGC GCGCCGAGCG CGACCCGAAG CACCTGATGG CGTTGGTCTT CCGCTGGTAC
CTCGGCATGG CCGCGCACTG GGCGAAAGAC GGCCTCGAGC CCCGGCGCAT GGACTACCAG
GTCTGGTGCG GCCCCGCCAT GGGAGCCTTC AACGAATGGG CCTCCGGCTC CTTCCTTGAT
ACCCCGGGCA ATCGCACGGT CGAAGCCGTG GCCCTCAACA TCCTGCACGG AGCGGCCGCA
CTTAACCGCG CCAACTTCCT GAGCAGCCAG GGCATCGAAC TCAGGATGGA TGAAATCGCA
CCGCAACCTC TCGAAATCGC ACAAATCAAG GAGTACCTTT GTTGA
 
Protein sequence
MDPFSLQGEH TARSANLENL GSWHPASNAP PQKAANLRDA LRYVRQPLYL VEKEGTMVPR 
LGGVGRLGAV NPGALPIAAY APACFPENLG DPSFCRELGI RYPYVGGSMA KGISSAAMAE
ELGRAGMLGF FGAAGLPLAT VSETADRLKA SLGDIPYGFN LIHSPHEPEL ESELAELYIR
KGIRIIEASA FLALTLPLVR YRLHGIKRAA DGTIVTPNRI IAKVSREELA TKFFAPAPEK
LLRALVANGS ITAEQAELAA LVPLAQDVTA EADSGGHTDN RPALALFPTI NALAAKLKRQ
YGYSCRLRVG LGGGVSTPAS AAAAFSMGAA YLVTGSVNQA CVESGTSDTV RGMLAGTRQA
DVTMAPAADM FEMGVTVQVL KRGTMFPMRA QKLYEIYRAC KSLDDIPAAE REKLEKTMFQ
APLADIWRDT RAFFLKRDPS QVERAERDPK HLMALVFRWY LGMAAHWAKD GLEPRRMDYQ
VWCGPAMGAF NEWASGSFLD TPGNRTVEAV ALNILHGAAA LNRANFLSSQ GIELRMDEIA
PQPLEIAQIK EYLC