Gene Emin_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0477 
Symbol 
ID6262658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp509761 
End bp511197 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content42% 
IMG OID642610948 
Producthydrogenase large subunit 
Protein accessionYP_001875371 
Protein GI187250889 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAA GAAAGATTAA CAATTCGGAA CACTTGAAAA GAGAAGTACT TGTAGAAATA 
GCAGCAATGT TTTTTCACGG TAACCTTGAA AAAGATATAC ACAAAATTCC TTATAACATA
ATACCAACAG GCTCGGAAGC GCAATTCAGA TGCTGCGTTT ATAAAGAACG CTCAATATTG
AAATTCCGCG TTTTGGCCGC GTTGGGCTAC TCCGTAAACG AAGTTGACGA AGCTGAACAT
TTAAATAACT TTGTAGAGCA GAAAGATTTA AACAATAAAG AAGGTAAACT TTTAACCGTT
TTAGATTCAG CCTGCAAAGC TTGTATGAGA AGTAATTATA TGGTGACGGA AGTCTGCCAG
GGCTGCGTGG CCAGGCAGTG TATTTACGAC TGTCCTTTTA ACGCTATAAG CATGCAAAAC
GGACGTGCTT ACATAGAGCC GGCAAAATGC AAAAACTGCG GTAAATGTAA GTCAGCCTGT
CCTTACGGAG CTATTTTAAA ATTAAACGTG CCTTGTGAGG AAGCTTGTCC CGTTAACGCT
ATTAAAAAAG ACCAAAAAGG CCGCGCTATA ATTGACCACA GCATGTGTAT AAGCTGCGGC
AGATGTATGA AGGTTTGCCC TTTTGGCGCT ATTATGGAGC GCAGCCAAAT ACTTAATGTT
TTAAAAGCCT TTAACAGTGA TAAAAAAGTT GTGGCAATGG TGGCCCCTGC TATAGCAGGC
CAGTTTGACG CAAGTATGGG CGTGCTTACA ACAGCTTTAA AAAAAATCGG GTTTGACTAC
GTCTATGAAG TCGCAAAAGG CGCGGAAGTT ACCGCTTCTA ACGAAGCCGC TGAGTTTAAA
GAGCGCGTTA TTGAAAAAAG GCAAAAGTTT ATGACAAGCT CCTGCTGTTT CGCTTACACA
AAGCTTGTGC AAAAGCATGT GCCGGAACTT CAGCAGTATA TTTCGCATAC CAAAACGCCT
ATGCACTACA CCGCTGAAAT AGTAAGACGT GAACTGCCTG GCGCGGTTAC GGTGTTTATA
GGCCCTTGTT TATCAAAAAG AAAAGAAGGC CAGCAAAGCG GTTTGGTTGA CTTTGTTCTT
AATTTTGAAG AACTTTACGC CATTTTAACC GCCAAAGGTA TAAATTTGCT TCAGTGCGAG
GAAGAAAAGC TTGAAAACAG GCCCAGCGGC GCGGCAATGA GATTTCCTTT AGCGGGCGGC
GTTACAAAGG CTGTAAGAGC CGCGTCTAAA GAAGATCTTG GCATTAAGGC GGAGCTTATT
AACGGCTTAG ACCAAAAAGT TATTTACAAA CTTAAAGCTT ACTGCAACGG CAACTGTCCG
CATAACTTTT TAGAAGTTAT GACTTGTTTG GGCGGCTGCG TAGGCGGGCC CGACGCTATA
AGAGATAAAA TTAAAGCCGC TGTAGACGTT GAAAAATACT CGGCCCAAAA CGATTAA
 
Protein sequence
MIKRKINNSE HLKREVLVEI AAMFFHGNLE KDIHKIPYNI IPTGSEAQFR CCVYKERSIL 
KFRVLAALGY SVNEVDEAEH LNNFVEQKDL NNKEGKLLTV LDSACKACMR SNYMVTEVCQ
GCVARQCIYD CPFNAISMQN GRAYIEPAKC KNCGKCKSAC PYGAILKLNV PCEEACPVNA
IKKDQKGRAI IDHSMCISCG RCMKVCPFGA IMERSQILNV LKAFNSDKKV VAMVAPAIAG
QFDASMGVLT TALKKIGFDY VYEVAKGAEV TASNEAAEFK ERVIEKRQKF MTSSCCFAYT
KLVQKHVPEL QQYISHTKTP MHYTAEIVRR ELPGAVTVFI GPCLSKRKEG QQSGLVDFVL
NFEELYAILT AKGINLLQCE EEKLENRPSG AAMRFPLAGG VTKAVRAASK EDLGIKAELI
NGLDQKVIYK LKAYCNGNCP HNFLEVMTCL GGCVGGPDAI RDKIKAAVDV EKYSAQND