Gene GM21_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3050 
Symbol 
ID8138396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3538723 
End bp3540021 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content65% 
IMG OID644870650 
Product4Fe-4S ferredoxin iron-sulfur binding domain protein 
Protein accessionYP_003022836 
Protein GI253701647 
COG category[C] Energy production and conversion 
COG ID[COG1143] Formate hydrogenlyase subunit 6/NADH:ubiquinone oxidoreductase 23 kD subunit (chain I) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones134 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGC GAGCGCAGGA GAACGGCTAT CACCGGTTGA TGCAGCGCGT GAACAAGTTC 
CCGCAGGGGG CGCCTCCCTC GGAGCTGCTG CTCAAGATCT ACGCCCTTTT GTGCAGCGAG
GAGGAGGCGA GGCTTTTGAG CCTGCTGCCG CTGCGCCCGT TTTCGGCGGC CAAGGCGGCG
CGGGCGTGGC GCATGGAGAA GGAGGCGGCA CGGGGGAGAT TGGAGCAGAT GGCGTCGCGG
TCGCTGCTTC TGGACATCGA GCGGGACGGC AAGATGGTGT ACGTCCTCCC TCCCCCCATG
GCCGGGTTCT TCGAGTTCTC GCTGATGAGG CTGCGGCCGG ATCTGGATCA GAAGCAGTTG
GCCAGCCTCT TCTTCCGCTA CATCAACGTG GAGGACGACT TCATTCGGGA TCTTTTCGCC
GGGGGGGAGA CGCCGCTTGG GAGGGTGCTG GTGAACGAGG AGGCGATACC GGAGGAGAAG
GTCTGCCAGG TGCTGGATTA CGAGCGCGCG AGCGAGGTGA TCGGGAGCGC GACCCACATA
GCGGTCGGCC TTTGCTACTG CAGGCACAAG ATGCTGCAGG TGGGAAAGGG GTGCAAGGCG
CCCTTGGACA TCTGCATGAC GCTGAATCTC GCGGCCCAGT CCCTGATCAG GCGCGGCGCC
GCCCGCAGCG TGGACGCGGT CGAAGGGCTG GACCTCTTGC AAAAGGCGCG GGATTTGAAC
CTCGTGCAGT GCGCCGACAA CGTGCAGCGG CAGGTCAACT TCATCTGCCA CTGCTGCGGC
TGCTGCTGCG AGGGGATGAT CGCCACCCGC CGCCTCGCCA TTCCCAACGC CATGTACACC
ACCAACTTCA CCCAGGCCAC CGACCCCGCC CGTTGCGACG GCTGCGGCAG GTGCGTCGCC
ATCTGCCCGG TCGACGCCAT CTCGCTTGTG CGCGAGCCCG AGGGAAGCGG CATGCCGGCA
AAGGCCAGGT TGAACTCCGA GCTTTGCCTT GGGTGCGGGG TCTGCGCGCG CAACTGCCAC
ACCAAGGCGG TGCGGCTTGA GGCGAGGGAG AAGCGGATCC TGACCCCTGT CAACACCGCC
CACCGGCTGG TATTGATGGC CCTGGAGCGG GGGAAGCTGC AGAACCTGAT CTTCGACAAC
CAGGCCTATT TGAGCCACCG CGCCCTGGCC GCGATACTGG GGGCGATACT GAGGCTTTCC
CCGGTGAAAC GGCTCACGGC CAGCCGGCAG CTCAAGTCGC ACTACCTGGA GCGATTGATG
GCGGACGTCG ACGTCACCAG ATTTTCCAGG TTCGAGTAA
 
Protein sequence
MAERAQENGY HRLMQRVNKF PQGAPPSELL LKIYALLCSE EEARLLSLLP LRPFSAAKAA 
RAWRMEKEAA RGRLEQMASR SLLLDIERDG KMVYVLPPPM AGFFEFSLMR LRPDLDQKQL
ASLFFRYINV EDDFIRDLFA GGETPLGRVL VNEEAIPEEK VCQVLDYERA SEVIGSATHI
AVGLCYCRHK MLQVGKGCKA PLDICMTLNL AAQSLIRRGA ARSVDAVEGL DLLQKARDLN
LVQCADNVQR QVNFICHCCG CCCEGMIATR RLAIPNAMYT TNFTQATDPA RCDGCGRCVA
ICPVDAISLV REPEGSGMPA KARLNSELCL GCGVCARNCH TKAVRLEARE KRILTPVNTA
HRLVLMALER GKLQNLIFDN QAYLSHRALA AILGAILRLS PVKRLTASRQ LKSHYLERLM
ADVDVTRFSR FE