Gene GM21_2807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2807 
Symbol 
ID8138150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3263701 
End bp3266739 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content63% 
IMG OID644870409 
Product4Fe-4S ferredoxin iron-sulfur binding domain protein 
Protein accessionYP_003022598 
Protein GI253701409 
COG category[C] Energy production and conversion 
COG ID[COG1148] Heterodisulfide reductase, subunit A and related polyferredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACCG CAGAGAGAAA TATTGGTGAC GTACTCATTG TCGGTGGAGG GATCAGCGGT 
ATTCAGGCCG CCCTGGATCT CGCCAACTCC GGGTTCAGGG TCTTCCTGGT GGACAAGTCT
CCCGCCCTGG GCGGGAAGAT GTCCCAGTTG GACAAGACCT TCCCCACCAA CGACTGCTCC
ATGTGCATCG AGAGCCCGAA ATTCATCGAG TGCTCCCGAA ACCCGAACAT CGAGATCATC
ACCTACACCG AGGTGGATCG GGTGGAAGGG GAGGCCGGCG ACTTCAAGGT GACCCTCACC
AAGAAGCCCA GGTACATCTC GGAGGAGAAG TGCACCGGGT GCAACATCTG CGTGGACTAC
TGCCCGGTCA AGATCCCGGA CCCGTTCAAC CAGAACCTCT CCGAGAACAA GGCGGTCCAC
ATCTACTTCT CCCAGGCGGT GCCGCTGGTC ACCTACGTCG ACCCGGAGAC CTGCCTGTAT
CTCAAGGAAG GGAAGTGCCA GATCTGCGTC GGCGCCTGCA AGACCAACGC GATAGACCTG
CACCAGAAGC CGGAGACCTT CCAGGTCGAG GTGGGCGCCA TCATCCTCTC CCCGGGCTAC
TCCACCTTCG ACCCGAAGCT GAGAAACGAC TTCGGCTACG GCAAGATGCA GAACGTGGTG
ACGAGCCTCG ACTTCGAGCG CATCCTCTGC GCCACCGGCC CCTACGAGGG GGAGGTGCTG
CGCCCCTCCG ACAAGAAGCA CCCGCACAAG ATCGCCTGGA TCCAGTGCGT GGGGTCGCGC
CAGGTGATCC CGGGCGGCAA CAACTACTGC TCGGCCGTCT GCTGCGCCTA CTCGCAAAAG
CAGGTGATAC TGGCGAAAGA CCACAGCCCG ACCCTTGAGG CCACCATCTT CCACAACGAC
GTCCGCGCCT ACGGAAAAGA CTTCGAGAGG TTCCACCAAA GGGCCGAGAA GCTCCCGGAT
GTCCGCTTCA TCAGGAGCTA CGTCACGGTG GGCAGGGAAA TCGAGAGCTC GAAGAACGTC
ACCATCAGGT ACTCCACCGT CGACCAGGGG GTCAAGGAGG AGGAATTCGA CATGGTGGTC
CTCTCGGTGG GCTTGAATCC CCCGAAGGAC GTGGAGGCGC TGGCCGAAAA GTTCGGCATC
GAGCTGACCG ACCAGAAGTT CTGCAAGAAC AACCCCTACA ACCCGATCGA GACCTCGAAG
AAGGGGATCT TCGTCTCCGG CGCCTTCCAG GGGCCGCTCG ACATCCCCGA ATCGGTCCTG
ACCGCCAGCG GCGCAGACGC TCTCTGCGGA GAGCTTCTTT CCTACCGGCG CCACAAGCTG
GAGCGGGAGA GGGTCTACCC CGACGAGAAG GACGTATCCA AGGAGGAGCT GAAGATCGGG
GTGGTGGTCT GCTACTGCGG CGCCAACATC GGCCGCGTGG TCAACATCCC CGAGGTGATC
GAGTACGCGG CTACGCTCCC GAACGTCTCC TGGGCCGGCG AGAACCTCTT CGCCTGCTCC
ACCGAGAACG CGAAGCAGAT CTCCGACGCC ATCGTGGCCA AGGGTCTGAA CCGCGTGGTC
CTCGCCGCCT GCACCCCTAG GACCCACGAG CCGCTCTTCA GGGACACCTG CCGCGAGGCG
GGGCTGAACC AGTACTTCTT CGAGTTCGCC AACATCCGCG AGCACTGCTC CTGGGTCCAC
TCCCGCGAGA AGGAGAACGC GACCGAGAAG GCGAAGGAGA TCGTCAGGAT GGCCGTGGCC
CGAGCCGCGC ACCTGGAGCC CCTGCAGGAA TTCCAGCTCC CGGTCGACCA CACGGCGATG
GTCGTTGGCG GCGGCGTCGC CGGCATGACC GCCGCGCTCA ACATGGCCGA GCAGGGTTTC
GAGATCTACC TGATCGAGAA GGAAGACGAC CTGGGGGGGA TGGCGAGAAG GCTGCACTAC
ACCCTGGAGG GGACCGAGAT CCAGCCCTTC CTGGAGGATC TGGTCAAGAA GGTGTACCGG
CACCCGTCGA TCCACGTCTG GACCGACTCC ACCATCTTGG ACGTGACCGG TTACGTGGGT
AACTTCACCA CGCAAGTGGA GAGCCAGGGT CGGGTGCGCG AGGTGAAGCA CGGGGTATCG
CTTCTGGCCA CCGGCGCCGC CGAGTACAAG CCGACCGAGT ACCTCTACGG CGAGAACGAG
AACGTGCTGA CCCAGTTGGA GCTGGAAGGG GAGATCGTCG CGGAAAGCGA AAGGGTGAAG
GCGGCCCAAA GCGTGGTGAT GATCCAGTGC GTGGGATGCC GGCAGACGGA CAGAAACTAC
TGCAGCCGGG TCTGCTGCAG CCAGGCGATC AAGAACGCGC TGAAGCTGAA GAAGATCAAC
CCGGAGATCG AGATCCAGAT CATCTTCCGC GACATGCGCA CCTACGGCCT GAAGGAGGTC
TACTACCGCG AGGCGGCGAA CCAGAACGTG AAGTTCATCC GCTTCGAGGC GGACAAAAAG
CCGGTGGTCG AGGCGCAAGG GGAAGGCTTC AAGGTAACCG TACCCGACCC GGTCCTGGGG
CAGTTGATGG AGCTTGAGGC GGACCTCGTG GTGCTCGCCG CCGCGGTGAT ACCGTCGGAG
GCAAGCCAGG AGGCGGGGAA ACTCTTCAAG GTCTCCAACA ACCCGGACGG CTTCTTCCAG
GAGGCCCACG TGAAGCTGAG GCCGGTCGAC TTCAGCGCGG ACGGCGTCTT CCTCTGCGGC
ACGGCCCACT ATCCCAAGCA CCTGACCGAG ACCATCAGCC AGGCCTTGGG GGCCGCGGGA
CGCGCCGTGG CGATCCTCTC CAAGGAGACC GTCACCGCCT CCGGCTCGGT CTGCGACGTC
AACGAGAATA ACTGCGTATC CTGCGGGGCC TGCATCACCG CCTGCAAGTA CGGCGCGATC
AGCTTCGTCG ACACCCCCAA GGGGAAAAAG GCGCGTGTGG AACCGATCCT TTGCAAAGGG
GACGGCCTGT GCAACGCCAA GTGCCCGACC CAGGCGATCT ACCTGAAGCA CTACACCGAC
GACGCCATAT TCGCCCAGAT CGACGCAGCG CTTCACTAA
 
Protein sequence
MNTAERNIGD VLIVGGGISG IQAALDLANS GFRVFLVDKS PALGGKMSQL DKTFPTNDCS 
MCIESPKFIE CSRNPNIEII TYTEVDRVEG EAGDFKVTLT KKPRYISEEK CTGCNICVDY
CPVKIPDPFN QNLSENKAVH IYFSQAVPLV TYVDPETCLY LKEGKCQICV GACKTNAIDL
HQKPETFQVE VGAIILSPGY STFDPKLRND FGYGKMQNVV TSLDFERILC ATGPYEGEVL
RPSDKKHPHK IAWIQCVGSR QVIPGGNNYC SAVCCAYSQK QVILAKDHSP TLEATIFHND
VRAYGKDFER FHQRAEKLPD VRFIRSYVTV GREIESSKNV TIRYSTVDQG VKEEEFDMVV
LSVGLNPPKD VEALAEKFGI ELTDQKFCKN NPYNPIETSK KGIFVSGAFQ GPLDIPESVL
TASGADALCG ELLSYRRHKL ERERVYPDEK DVSKEELKIG VVVCYCGANI GRVVNIPEVI
EYAATLPNVS WAGENLFACS TENAKQISDA IVAKGLNRVV LAACTPRTHE PLFRDTCREA
GLNQYFFEFA NIREHCSWVH SREKENATEK AKEIVRMAVA RAAHLEPLQE FQLPVDHTAM
VVGGGVAGMT AALNMAEQGF EIYLIEKEDD LGGMARRLHY TLEGTEIQPF LEDLVKKVYR
HPSIHVWTDS TILDVTGYVG NFTTQVESQG RVREVKHGVS LLATGAAEYK PTEYLYGENE
NVLTQLELEG EIVAESERVK AAQSVVMIQC VGCRQTDRNY CSRVCCSQAI KNALKLKKIN
PEIEIQIIFR DMRTYGLKEV YYREAANQNV KFIRFEADKK PVVEAQGEGF KVTVPDPVLG
QLMELEADLV VLAAAVIPSE ASQEAGKLFK VSNNPDGFFQ EAHVKLRPVD FSADGVFLCG
TAHYPKHLTE TISQALGAAG RAVAILSKET VTASGSVCDV NENNCVSCGA CITACKYGAI
SFVDTPKGKK ARVEPILCKG DGLCNAKCPT QAIYLKHYTD DAIFAQIDAA LH