Gene GM21_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2033 
Symbol 
ID8137369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2356527 
End bp2358281 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content63% 
IMG OID644869648 
Productindolepyruvate ferredoxin oxidoreductase, alpha subunit 
Protein accessionYP_003021843 
Protein GI253700654 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID[TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000000000299655 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGAAA TACTTTCCGG CAACGAAGCC ATTGCCAGGG GTGCTTACGA GGCAGGGGTG 
AAAGTTGCCT GTGCTTACCC TGGCACCCCT TCCACCGAAA TCCTGGAAAA CACCATCCGG
TACCAGGAGA TAGACTCCTC CTGGGCCACC AACGAAAAGG TCGCGCTGGA GGTCGGCATC
GGCGCCTCTT TTGTCGGCGC CCGCTCCCTG GTCACCATGA AGCACGTCGG CGTGAACGTG
GCCGCCGATC CCCTTTTCAC CCTTTCCTAT ACCGGCGTGA ACGGCGGGCT CTTGCTGATC
TGCGCCGACG ACCCGGAACT GCACTCCTCC CAGAACGAGC AGGACAGCCG CAACTACGCC
AAGTTCGCCA AGATCCCGAT GCTGGAGCCG GCGGATTCGC AGGAATGCCT TGAGTTCACC
AAGCTCGCCT ACGAGATCTC CGAGCGCCAC GACACCCCGG TCATGCTCCG CACCACCACC
CGCATCTCGC ACAGCAAGTC GGTCGTGACG CTGGGCGAGA GGGTCGCCGC CGTCGCCGAG
CCGAAGCTCG CCAGGAACGC CGCCAAGTTC GTCATGCTCC CGGGGAACGC GCGCGGGCGG
CACTACGTCG TCGAGGACCG GATCACCACG CTCTCCAAGG AAGGGTGTTC CATGCCCATC
AACAGGATCG AGCTGCGCGA CAAGAAGATC GGCGTCATCA CCGCCGGAAT CAGCTACCAG
CACGTGCGCG AGGCGCTCCC CGAGGCATCG GTGTTGAAGC TCGGCATGGT GTTCCCGCTT
CCCTTCGACC TGATCCGCGA GTTCGCCTCC AAGGTGGACA AGCTCTACGT GGTCGAGGAA
CTCGACGCCT TCATCGAGGA CCAGGTGAAG GCCATCGGCA TCCCGGTGAC CGGCAAGGAG
ATCATCTCAC TTTGCGGTGA GCTGACCCCC GGCCGCGTCA GGAAAGCCTT CGGGCTTCCC
GAGAACGCAC AGGGCACGGT GGAGAAGCTC CCGGGGCGCC CCCCCAACAT GTGCCCCGGC
TGCCCGCACC GCGGCGTGTT CTACACACTG AAACAACTGA ATGCCTACGT CTCCGGCGAT
ATCGGCTGCT ATACCCTGGG CTTCATGCCG CCTCTTTCCG CCATGGATAC CTGCGTCTGC
ATGGGCGCCT CCATCGGCAT GGCGACCGGC GCAGTAAAGG TGCTCTCCCC GGAGGAGAGG
AAGAAGGTTG TGGCCGTGAT CGGCGACTCG ACCTTCCTCC ACACCGGCAT CAACGGCCTC
ATGGACATGG TCTACAACAA GGGGGCCGCG ACGGTCATCA TCCTCGACAA CAGGATCACC
GCCATGACCG GCCGCCAGGA AAACCCTGGT TCCGGCCACA CCCTGATGAA CGAGCCTACC
AACGCCATCG ATTTCCCGAT GCTTTGCCAG GCGATCGGCG TGAAGAACAT CCGCACCATC
AACCCGCTGG ACCTGGACGA ATGCCGGAGG GTGATCGCAG AGGAGATGGA GCGCCCCGAG
ACCTCGGTCA TCATCACCGA CAAGCCGTGC GTACTCATCA AGAAGGAAGG GGTCTTCACC
CCGGGCAAGC CGCTTGCCGT CGTCGAGGAT AGCTGCACCG GCTGCCGCGC CTGCCTGAAG
ATCGGCTGCC CGGCCATCGA GTGGGTCCCC TCCAGCGGCA AAAAGGGCCA AGCCAAGATC
GATCCGCTTT TGTGCAACGG TTGCGACGTC TGCAGGCAGC TGTGCAAGTT CTCTGCGATT
CAGGAGGCGA AATGA
 
Protein sequence
MKEILSGNEA IARGAYEAGV KVACAYPGTP STEILENTIR YQEIDSSWAT NEKVALEVGI 
GASFVGARSL VTMKHVGVNV AADPLFTLSY TGVNGGLLLI CADDPELHSS QNEQDSRNYA
KFAKIPMLEP ADSQECLEFT KLAYEISERH DTPVMLRTTT RISHSKSVVT LGERVAAVAE
PKLARNAAKF VMLPGNARGR HYVVEDRITT LSKEGCSMPI NRIELRDKKI GVITAGISYQ
HVREALPEAS VLKLGMVFPL PFDLIREFAS KVDKLYVVEE LDAFIEDQVK AIGIPVTGKE
IISLCGELTP GRVRKAFGLP ENAQGTVEKL PGRPPNMCPG CPHRGVFYTL KQLNAYVSGD
IGCYTLGFMP PLSAMDTCVC MGASIGMATG AVKVLSPEER KKVVAVIGDS TFLHTGINGL
MDMVYNKGAA TVIILDNRIT AMTGRQENPG SGHTLMNEPT NAIDFPMLCQ AIGVKNIRTI
NPLDLDECRR VIAEEMERPE TSVIITDKPC VLIKKEGVFT PGKPLAVVED SCTGCRACLK
IGCPAIEWVP SSGKKGQAKI DPLLCNGCDV CRQLCKFSAI QEAK