Gene Mmc1_2494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_2494 
Symbol 
ID4484164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp3181177 
End bp3182184 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content63% 
IMG OID639723242 
ProductNADH ubiquinone oxidoreductase, 20 kDa subunit 
Protein accessionYP_866400 
Protein GI117925783 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.014165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.608581 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGG AGAGCTTACA GCTCATGTGG TTGCAGTCCG GGGGTTGTGG GGGGTGTAGC 
CTATCGCTGC TGTGCGCGGA GAATCCGGAT CTCTACACCG CTTTGGCCAG TCATGGTATT
GAGTTGCTCT GGCATCCGGC CCTGAGCGTG GGGCCGGAGG ATGGTTTTTT GGCCCTGTTG
CAGCGGATTG TGCGCGGTGA GCAGCGGCTG GATATCCTCT GTCTGGAAGG GGCGGTGATG
ACCGGCCCCA ATGGCACGGG GGCCTTTCAT CGGTTATCGG GCAGCGGTCG CCCGATGATG
GCGGTGGTCA AAGAGCTGGC TGCACTGGCG GGGCAGGTGG TGGCGGTGGG CAGTTGCGCG
GCCTATGGTG GGGTAACGGC GGGGGGTGGC AACCACACCC AGGCGGTGGG GCTGCACTAT
GTGGGGCGCA AACGGGGGGG GCTGTTGGGG GCCGCTTTTC GCGCACGCCA AGGGCTGCCG
GTGATCAATG TGGCGGGTTG TCCCACCCAC CCCAATTGGG TGCTGGAGTG TTTGGCTGAG
TTGGCCGATG GCAGCTTGAC CGAGGCCCAG TTGGATCCTC TGGGCAGACC CCGTGCCTAT
ACCGATACGT TGGTCCACCA TGGTTGCCCG CGCAATGAGT TTTATGAGTA CAAAGCCAGT
GCGGAAAAGC TGGGGCAGCT GGGCTGCTTG ATGGAGCATT TGGGCTGTTT GGGTACTCAA
GCCCATGCCG ACTGTAATCT GCGTCTGTGG AATGGCGAGG GCTCCTGTCT GCGCGGGGGT
TATCCCTGCA TCAACTGTAC AGCGCCGGGT TTTCAGGAGC CGGGCCATGC CTTTACTGCC
ACGCCCAAGG TGGCGGGGAT CCCTGTGGGT TTGCCCACCG ATATGCCCAA GGCGTGGTTT
GTGGCGCTGG CCTCCCTCTC CAAAGCGGCC ACCCCCAAAC GGTTGCGGGA TAATGCCTTG
GCCGAGCATA TTGTGACCCC GCCGGCCACC CCTGTGGAGA GACCATGA
 
Protein sequence
MAKESLQLMW LQSGGCGGCS LSLLCAENPD LYTALASHGI ELLWHPALSV GPEDGFLALL 
QRIVRGEQRL DILCLEGAVM TGPNGTGAFH RLSGSGRPMM AVVKELAALA GQVVAVGSCA
AYGGVTAGGG NHTQAVGLHY VGRKRGGLLG AAFRARQGLP VINVAGCPTH PNWVLECLAE
LADGSLTEAQ LDPLGRPRAY TDTLVHHGCP RNEFYEYKAS AEKLGQLGCL MEHLGCLGTQ
AHADCNLRLW NGEGSCLRGG YPCINCTAPG FQEPGHAFTA TPKVAGIPVG LPTDMPKAWF
VALASLSKAA TPKRLRDNAL AEHIVTPPAT PVERP