Gene GM21_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2197 
Symbol 
ID8137533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2567110 
End bp2568309 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content61% 
IMG OID644869812 
ProductFAD dependent oxidoreductase 
Protein accessionYP_003022007 
Protein GI253700818 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.000547368 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTTCG ACAACGCCGG AATACTTATC GTCGGGGGGG GCATTATCGG CCTCACCATC 
GCCCGCGAAC TGGTGAAGCA GGGACACGGC GACATCGTCA TCATCGAGAA GGAAACGGAG
CTGGGCGTCC ACGCCTCGGG CCGCAACAGC GGCGTGCTCC ATGCCGGCAT CTACTATTCC
CCGGATAGCC TCAAGGCCAA ATCCTGCTTG AACGGCAACT TCCTGATGCG GGAATACTGC
AAGGAGAAGG GGCTTCCGCT TCTGGAGAGC GGCAAGGTCA TCGTCACCCG CACCGCGGCC
GAACTCCCGG TCTTGGACGA ACTGCACCGG CGAGCGACGG CAAACGGCGC CAAGGTGGAG
ATGATCGACG AGCGGCAACT GGCTGCCATA GAGCCGAACG CCCGGACGGT GGAGCGCGCG
CTCTTCTCGC ACTACACCGC GGTGGTTGAC CCTAAAGCGG TGCTGAAGAG CCTCAAAAAG
GACCTGGAAC AGACCGGACG GGTGAAGCTT CATCTGGGCT GCAAAATGAC CGGCCTCAAG
GGAAGCTCCA CGGCGGTGAC CAACAAGGGG GATATAAGCT TCGAAAGGTT CATCAACGCC
GCCGGCGCCT ACTGCAACAA GGTGGCGGGC TTCTTCGGGG TGGGTGCCAA ATACCGGCTG
ATCCCCTTCA AGGGGGTGTA CCGACTGCTG AAAAAGGATG CCCCCTTTAC CGTCAATTCC
AACATCTACC CGGTGCCCGA CATCCGGAAC CCCTTTCTGG GGATCCACTT CACCCGCAGC
GTCCACGGCG ACGTCTACCT GGGCCCCACT GCCATCCCCG CTTTCGGGCG GGAGAACTAC
GGCATCCTCT CGGGCATCGA CGCCGAAGCC TTCAGCATTG CCTGGCAGGA CCTGGTCCTA
TTTCTCGTCA ACCGGCCTTT CCGCAATGTC GCTCTCTCGG AGCCGCTCAA GTATTTTCCC
TCTTACTTCT TCCGCGACGC AGCGAAGCTG GTGAAGGAGT TGGCCCCCTC CGACGTGGTG
CATGCTTCCA AGGTGGGGAT ACGTCCGCAG TTGGTCGACT GGGAGAAGAA GGAGCTGGTG
ATGGATTTCC TGGTGGTGGC CGATGGGTCG TCGCTCCACG TGCTGAACCC GATCTCCCCC
GCTTTCACCT CGTCGATGGA TCTGGCGCAG GGGATGGTGG CGGAGCATTT CTCGTCCTGA
 
Protein sequence
MSFDNAGILI VGGGIIGLTI ARELVKQGHG DIVIIEKETE LGVHASGRNS GVLHAGIYYS 
PDSLKAKSCL NGNFLMREYC KEKGLPLLES GKVIVTRTAA ELPVLDELHR RATANGAKVE
MIDERQLAAI EPNARTVERA LFSHYTAVVD PKAVLKSLKK DLEQTGRVKL HLGCKMTGLK
GSSTAVTNKG DISFERFINA AGAYCNKVAG FFGVGAKYRL IPFKGVYRLL KKDAPFTVNS
NIYPVPDIRN PFLGIHFTRS VHGDVYLGPT AIPAFGRENY GILSGIDAEA FSIAWQDLVL
FLVNRPFRNV ALSEPLKYFP SYFFRDAAKL VKELAPSDVV HASKVGIRPQ LVDWEKKELV
MDFLVVADGS SLHVLNPISP AFTSSMDLAQ GMVAEHFSS