Gene GM21_0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0029 
Symbol 
ID8135328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp36584 
End bp38581 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content64% 
IMG OID644867646 
Product4Fe-4S ferredoxin iron-sulfur binding domain protein 
Protein accessionYP_003019874 
Protein GI253698685 
COG category[C] Energy production and conversion 
COG ID[COG1148] Heterodisulfide reductase, subunit A and related polyferredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGAA TCGGTGTTTT TGTGTGCCAC TGCGGCGAGA ATATCTCCCG CACGGTGGAT 
GTGGAGCAGG TGGCTAGGTC CGCGGGGGAG ATCCCCGGCG TGGCTTACGC CTGCGATTAC
AAGTACATGT GTTCCGACCC GGGGCAGAAC CTTTTGAAGA AGGCCGTCGC CGAGCACAAG
CTCGACGGCG TGGTGGTGGC GGCGTGCAGC CCGCGCATGC ACGAGAAGAC CTTCAGGAAG
GCGGCTTCGG CCGCAGGTCT CAACCCATAC CTCTGCGACA TGGCCAACAT CCGCGAGCAC
TGCTCCTGGG TCCACGAGGA CAAGAAACTG GCGACGGCCA AGGCCGGCGA CATCGTGAAG
CTCATGGTGG AGCGGGTCAA GAAGGGTAAG TCGCTCGCGC CCATCACCGT GCCGGTCACC
AAGCGGGCGC TGGTCATCGG CGGGGGGATC GCCGGGATCC AGGCGGCTCT CGACATAGCG
GACGCCGGGC ACCAGGTGCT CTTGGTCGAG CGCGAGCCCT CTATCGGCGG GCACATGGCG
CAGCTCTCCG AGACCTTCCC GACCTTGGAC TGCTCCCAGT GCATCATGAC CCCCAAGATG
GTCGACGTGG CCAACCACCC GAACATCACC CTGCACACCT TTTCCGAAGT CGAGAAGGTC
GAGGGGTACA TCGGCAACTT CCAGGTCACG CTGAAGCACA AGGCGCGCTC GGTGGATCAG
TCCAAGTGCA CCGGCTGCGG CATCTGCATG ACCAAGTGCC CCAAGAAGAA GATCCCCAAC
GAGTTCGACC AAGGGCACGG GATGCGCACC GCCATCTACG TCCCATTTCC CCAGGCGGTT
CCCAACACCC CCGTCATCGA CCGCGAGAAC TGCACCATGT TCCAAAGCGG CAAGTGCGGC
GTCTGCGCCA AGGTCTGCGG TCCGGGAGCG GTCGACTTCC AGCAGGAGGA TCGTTTCTCC
GTCGAAGCCG TGGGCGCGGT AGTCGTCGCC ACCGGGTTCA AGCTCTACTC CATCGACAGA
AAGCCCGAGG GAAGCCCGAT CCAGGGGTAC GGCGAGTTCG GCTACGGCAC CATCCCCGAC
GTCATCGACG GCATGACCTT CGAGCGCCTC GCCTCCGCCT CAGGTCCTAC CGGCGGCAAG
ATCCTCAGAC CCTCCGACGG GAAAGAGCCG AAGCAGGTGG TCTTCATCCA GTGCGTCGGC
TCCCGCGCCA GGGAGAAGGG GATCTCCTAC TGCTCCAAGG TCTGCTGCAT GTACACCGCC
AAGCACACCA TGCTCTACCA CCACAAGGTG CACGACGGCC AGGCGTACGT CTTCTTCATG
GACGCGAGGA CCCCGGGGAA GGGATACGAC GAGTTCTGGA GGCGCGCCGT CGAGGAAGAG
GAGGCTGTCT ACATCAGGGG GATGGTTTCC CGCATGTACC AGAAGGGCGA GAAGATCGTG
GTGATGGGGA GCGACATACA GGTCGGCGTG CAGGTCGAGA TAGAGGCCGA CCTGGTGGTC
CTCGCCACCG CGGTGCAGGC CCAGGACGGG GCCGACCTCC TGGCCCAGAA GCTCGGCATT
TCCTACGACA AGTACAACTT CTACTCCGAG GCCCACGCGA AGCTCAAGCC CGTGGAGTGC
GCCACCGCCG GGATCTATCT CGCCGGCGCC TGCCAGGGGC CCAAAGACAT CCCCGACACC
GTGTCCCAGG CCTCCGCCGC CGCGGCCAAG GTGATGACCC TCTTCTCCAA GGACCAGCTG
GAGCGCGACC CGGTCGTCGC CAAGGTCAAC GAGAAGTACT GCGTCGGCTG TCTCGCCTGC
AAGAAGGTCT GCCCCTACGG GGCCGTCGAG GAGAAAGAGA TCAGGGACCG CCAGGGGAAC
CTGGTCAAGG TCGTCGCCTA CGTGAACCCC GGCGTCTGCG GCGGCTGCGG CACCTGCCAG
GCCACCTGTC CGTCCAAGAG CGTCGAGCTC GACGGCTACA CCGACGAGCA GATCATGGCG
ATGATCGAAT CTCTGTAA
 
Protein sequence
MSRIGVFVCH CGENISRTVD VEQVARSAGE IPGVAYACDY KYMCSDPGQN LLKKAVAEHK 
LDGVVVAACS PRMHEKTFRK AASAAGLNPY LCDMANIREH CSWVHEDKKL ATAKAGDIVK
LMVERVKKGK SLAPITVPVT KRALVIGGGI AGIQAALDIA DAGHQVLLVE REPSIGGHMA
QLSETFPTLD CSQCIMTPKM VDVANHPNIT LHTFSEVEKV EGYIGNFQVT LKHKARSVDQ
SKCTGCGICM TKCPKKKIPN EFDQGHGMRT AIYVPFPQAV PNTPVIDREN CTMFQSGKCG
VCAKVCGPGA VDFQQEDRFS VEAVGAVVVA TGFKLYSIDR KPEGSPIQGY GEFGYGTIPD
VIDGMTFERL ASASGPTGGK ILRPSDGKEP KQVVFIQCVG SRAREKGISY CSKVCCMYTA
KHTMLYHHKV HDGQAYVFFM DARTPGKGYD EFWRRAVEEE EAVYIRGMVS RMYQKGEKIV
VMGSDIQVGV QVEIEADLVV LATAVQAQDG ADLLAQKLGI SYDKYNFYSE AHAKLKPVEC
ATAGIYLAGA CQGPKDIPDT VSQASAAAAK VMTLFSKDQL ERDPVVAKVN EKYCVGCLAC
KKVCPYGAVE EKEIRDRQGN LVKVVAYVNP GVCGGCGTCQ ATCPSKSVEL DGYTDEQIMA
MIESL