Gene Gbem_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbem_2106 
Symbol 
ID6782100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter bemidjiensis Bem 
KingdomBacteria 
Replicon accessionNC_011146 
Strand
Start bp2443629 
End bp2445293 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content67% 
IMG OID642768101 
ProductPfaD family protein 
Protein accessionYP_002138915 
Protein GI197118488 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID[TIGR02814] PfaD family protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATCCAT TTTCACTACA GGGTGACCAT ACCGCTCGCT CTGCAAACCT GGAAAACCTG 
GGTTCGTGGC ACCCCGCCTC GAACGCCCCC CCTCAAAAGG CCGCCAACCT GAGAGACGCC
CTCCGCTACG TACGCCAGCC GCTGTATCTC GTGGAAAAGG AAAGGACCAT GGTCCCGAGG
CTGGGAGGGA TCGGCCGGCT CGGCGCCGTC AACCCGGGCG CGCTGCCTAT CGCCGCTTAC
GCCCCTCCCT GCTTTCCGGA AAACCTGGGG GATCCTTCTT TTTGCCGCGA ACTTGGCATC
CGCTACCCCT ACGTCGGCGG TTCCATGGCC AAGGGGATCA GTTCCGCGGC CATGGCCGAG
GAGTTGGGCC GCGCCGGGAT GCTCGGCTTC TTCGGCGCCG CCGGCCTTCC GCTTGCCACC
GTCTCCGAGA CCGCCGACCG CCTCAAGGCC TCCCTCGGCG ATATCCCCTA CGGTTTCAAC
CTGATCCACT CCCCGCACGA GCCCGAGTTG GAGCGCGAGC TCGCCGAGCT GTACATAAAG
AAGGGGATCC GCACAATCGA GGCCTCGGCC TTCCTGGCCC TGACGCTACC CTTGGTCAGG
TACCGGCTGC ACGGCATCAA GCGCGCCGCC GACGGGTCCA TCGTCACCCC CAACCGCATC
ATCGCCAAGG TCTCCCGCGA GGAACTGGCG GCGAAGTTCT TCGCACCGGC TCCCGAGAAG
CTCCTGCGCG CGCTGGTCGC CAACGGCTCC ATCACCGCCG AGCAGGCCGA ACTGGCCGCG
CTGGTACCGC TGGCGCAGGA CGTGACGGCC GAGGCTGATT CCGGCGGCCA TACCGACAAC
CGCCCCGCCC TCGCCCTCTT CCCGACCATC AACGCGCTGG CGGCGAAGCT GCAGCGGCAG
TACGGCTACA GCTGCCGCCT GCGGGTGGGG CTTGGCGGCG GAGTCTCGAC GCCGGCCTCA
GCGGCAGCCG CCTTCTCCAT GGGCGCCGCC TACCTCGTGA CCGGGTCGGT GAATCAGGCC
TGCGTCGAGT CCGGCACCTC CGACACCGTG CGCGGCATGC TCGCCGGCAC CCGCCAGGCT
GACGTGACCA TGGCCCCCGC CGCCGACATG TTCGAGATGG GGGTCACCGT GCAGGTCCTA
AAGCGCGGCA CCATGTTCCC CATGCGCGCA CAGAAGCTCT ACGAGATCTA CCGCGCCTGC
AGCAGCCTCG ACGACATCCC CGCCGCCGAG CGCGAGAAGC TGGAGAAGAC CATGTTCCAG
GCGTCGCTCG CCGACATCTG GCACGACACC CGCGCCTTCT TCGCCAAGCG CGACCCCTCC
CAGGTCGAGC GTGCCGAGCG CGACCCGAAG CACCTGATGG CGTTGGTCTT CCGCTGGTAT
CTCGGCATGG CCGCGCACTG GGCCAAAGAC GGAGCGGAAG AGCGGCGCAT GGACTACCAG
GTCTGGTGCG GCCCCGCCAT GGGAGCCTTC AACGAATGGG CCTCAGGTTC CTTCCTCGAC
GCCCCGGGCA ATCGCACGGT CGAAGCCGTG GCCCTAAACA TCCTGCACGG AGCGGCCGCA
CTTAACCGCG CCAACTTCCT GAGCAGCCAG GGCATCGAAC TCAGGATGGA TGAAATCGCA
CCGCAACCTC TCGAAATCGC ACAAATCAAG GAGTACCTTT GTTGA
 
Protein sequence
MDPFSLQGDH TARSANLENL GSWHPASNAP PQKAANLRDA LRYVRQPLYL VEKERTMVPR 
LGGIGRLGAV NPGALPIAAY APPCFPENLG DPSFCRELGI RYPYVGGSMA KGISSAAMAE
ELGRAGMLGF FGAAGLPLAT VSETADRLKA SLGDIPYGFN LIHSPHEPEL ERELAELYIK
KGIRTIEASA FLALTLPLVR YRLHGIKRAA DGSIVTPNRI IAKVSREELA AKFFAPAPEK
LLRALVANGS ITAEQAELAA LVPLAQDVTA EADSGGHTDN RPALALFPTI NALAAKLQRQ
YGYSCRLRVG LGGGVSTPAS AAAAFSMGAA YLVTGSVNQA CVESGTSDTV RGMLAGTRQA
DVTMAPAADM FEMGVTVQVL KRGTMFPMRA QKLYEIYRAC SSLDDIPAAE REKLEKTMFQ
ASLADIWHDT RAFFAKRDPS QVERAERDPK HLMALVFRWY LGMAAHWAKD GAEERRMDYQ
VWCGPAMGAF NEWASGSFLD APGNRTVEAV ALNILHGAAA LNRANFLSSQ GIELRMDEIA
PQPLEIAQIK EYLC