Gene GM21_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2049 
Symbol 
ID8137385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2371782 
End bp2373116 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content63% 
IMG OID644869664 
Productpyridine nucleotide-disulphide oxidoreductase dimerisation region 
Protein accessionYP_003021859 
Protein GI253700670 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value5.49451e-22 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGACTG TGATCATCGG CGGGGTCGCG GCAGGACTGT CGGCGGCAAG CCAGGCAAAG 
CGGCTGTCCC CCGAATCGGA AGTGGTAGTG CTGGAGAAAA CCGGCGACGT ATCCTATGCC
GCGTGCGGGA TGCCTTACAA CCTTTTCTTC AAGGAGAAGC CGGTCGAAAA GCTCTACGCG
CTGTCGCTTG AGACCATTCG CAAGGAGCGT GGAATAGACT ATAGGCTGCG GCAGGAGGTC
ACCGGCATCG ACCCGGTCGG CAAGGTGGTG AGCGTGACGG ATCTCGCCAC AGGCAAGAGC
TACGAAGAGC GCTACGACTT CCTGGTCTAC GCCACCGGCA ACAGCGCCAT CAGGCTCACC
GCACCCGGCT TCGACGACGG CGACGTCTTT TGTTTCAAGA CGCTCGACGA CACCCGCCAC
GTCAAGCAGT TCATCTACGA CAAGGCGCCG AAGCGGGCGG TCTTGGTCGG CGCCGGCTAC
ACCAACCTGG AGGTCGCCGA CGTACTCACC AACATGAAGA TCAAGCCGGT CATCCTGGAG
AAAGCCCCCA CCATACTCCC TTCCTTCTGC GAGGAGGCGA GGGAGAAGGT AATGGAGAAG
GTGAAGGAGA GGGGGGTCGA GCTTATAACC GGTGTCGATA TCGCCGAGAA GGCGGGGGGC
GAGGTCCGGT CCTCGGCAGG CGTTTTCCCC GCCGACCTCG TGGTGGTCGC CGTCGGCACC
CGCCCCAACA CCGCCCTTTT CGCCGCTGCT GGAGGCGAAT TGGGGACGGC GGGGGCGGCC
AAGGTCGACC GTTACCTGCG CACCAATCTC GACTCCGTCT TCGCCGGGGG GGACTGCGCC
GAGCATTATG TCCGGCAACT GGGAATGAAC TCCTACTTCC CGCTTGGCCC TGCGGCCAAC
AAGCACGGGC GCGTCATAGG GAGCAACGTC TCCAACCCCG ACCATATGAT GGAATTCTGG
GGAATCGATC AGACCGCGGT CTTCAAGTTC TTCGAGCTGA GCGTCGCCAC CACCGGTCTG
AACGAGAGGC AACTGCTCGC GCTCGGCAAG GATTTCGTCA AGGTCGCCGT GGACAACCCC
ACCCGCGGCG AATTCCCCGG TGGAAGCACC ATGCGCGTGA TCCTTTTCTG CCAGAAGGGG
GACGGGCTTC TCCTCGGCGC GCAGATCGTC GGCGAGGACG TGGTGGCCAA GAGGCTCGAC
GTGCTGGCGA CGGCGATCTA CAAGCAGATG ACAGTCTTCG AGATCGCCGA ACTGGATCTC
GCCTACGCCC CTCCCTACTC GCCGGTATGG GACCCGATCC TCGTCGCCGC CAACGTCGCC
GTCAAGAAGG TCTAA
 
Protein sequence
MKTVIIGGVA AGLSAASQAK RLSPESEVVV LEKTGDVSYA ACGMPYNLFF KEKPVEKLYA 
LSLETIRKER GIDYRLRQEV TGIDPVGKVV SVTDLATGKS YEERYDFLVY ATGNSAIRLT
APGFDDGDVF CFKTLDDTRH VKQFIYDKAP KRAVLVGAGY TNLEVADVLT NMKIKPVILE
KAPTILPSFC EEAREKVMEK VKERGVELIT GVDIAEKAGG EVRSSAGVFP ADLVVVAVGT
RPNTALFAAA GGELGTAGAA KVDRYLRTNL DSVFAGGDCA EHYVRQLGMN SYFPLGPAAN
KHGRVIGSNV SNPDHMMEFW GIDQTAVFKF FELSVATTGL NERQLLALGK DFVKVAVDNP
TRGEFPGGST MRVILFCQKG DGLLLGAQIV GEDVVAKRLD VLATAIYKQM TVFEIAELDL
AYAPPYSPVW DPILVAANVA VKKV