Gene Emin_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0533 
Symbol 
ID6262724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp583121 
End bp584869 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content43% 
IMG OID642611003 
Producthydrogenase, Fe-only 
Protein accessionYP_001875425 
Protein GI187250943 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0310904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.240554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAG CTATTATTAA CGGTACGGAA ATTCAGGTAA AAGAAGGAAC CACGATATTA 
GAGGCCGCCA GGCTTGTTAA TATAAATATT CCTACTCTTT GCAAACATCC CGATTTAGTC
GCTGACGCGG GCTGTGGCAT TTGTGTGGTA AGGGTACAGG GCACAGGCAA AATGTTAAGA
GCATGCTGCA CTGCTTTGGA AGAAGGTATG AAAATTACCA CCCACGATCC CGAAATTGTT
AAAGTGCGCA AAAACGTGCT TGAACTTATT TTATCTAACC ACCCGAAGGA TTGTTTAATA
TGCGCCAGAA ACAATGACTG TGAGCTTCAA AGGCTTTCTT CCGAGTTTGG CATAAGAGAC
GCCTACTATC CTTTAATTGT CGGCAGAAAG AAACATAAGC ATGATGAATC AACCAAAACT
ATAGATATTG AAGGTTCAAA ATGTATTTTG TGCGGACGCT GTGTGCAGGT CTGCCAGAAA
AACCAAAATG TTTGGGCGTT ATCTTTCTTA GGCCGCGGTA TTAATACGGT GCTTTCCCCC
GCGGGCGAAA TTGAACTTAA TGATTCACCC TGCGTGAAAT GCGGGCAGTG TTCCAACCAT
TGTCCCGTGG GAGCTATAGT AGAACATGAC GAAACCCAAA AAGTTTGGGA CGCGTTAAGC
AACCCCGATC TTTTTCCCGT GGTGCAAATC GCGCCTGCGG TACGCGTTTC AATAGGGGAA
GCATTCGGCT ACCCTATAGG AACAAATCTT ACGGGTAAAC TTATAAGCTC TTTAAAGAAA
CTTGGGTTTA AAGGCGTTTT TGACACAAAC ATGGGCGCCG ACATGACTAT TATGGAAGAA
GGCAACGAGT TTGTGCATCG CTTTAAGAAA AAAGACAATA TGCCTTTGAT AACATCTTGT
TGCCCTGCCT GGGTTGACTT TTTGGAAAAG TTCCACTCCG ATATGCTTGA TAACTTTTCA
ACATGCAAAA GCCCGCATGA AATAATAGGA GTTTTGTCCA AAACATATTA CGCTAAAAAG
CATAATGTTG ACCCTTCTAA AATTTTTATG GTTTCAATTA TGCCTTGCAC GGCTAAAAAA
TATGAAATCC ACAGAAGTGA GGAAATGTTT GCTTCAGGAC ACCAGGATAT TGACATTTCG
CTCACAACGC GCGAACTTGC CCGCATGATT AAACAAAGCG GTATTGATTT TAAAAATATT
GAGGACCAGA AAGCCGATTC AATACTTGGC GCCTATTCCG GGGCAGGCAC CATTTTTGGC
GCTACCGGCG GAGTTATGGA AGCCGCTTTG AGAACCGCCT ACCATGTTAT TACCGGCAAA
GAACTCTCTA AAGTGGAGTT TAAGCAAGTA CGTGGCCTTA AAGGTATTAA AGAAGCCAAT
ATTGATATAG ATGGTACAAC AGTAAGAGTT GCCATAGCGC ACGGTCTTGC CAATGTTGAC
CATCTTTTAA AAGAGATTGA AAAAACCAAA GCAGAAGGAA AGCCTTCGCC ATATGATTTC
GTTGAAGTTA TGGCTTGCGA GGGCGGCTGC GTAGGCGGCG GCGGGCAGCC TTACGGCGTT
ACGGACGAGC TTCGCAAAAA ACGAGCGGCG GGTTTATATA AAGATGACGA AAGCTCCAAA
GTGCGTTGTT CTCATTTAAA TCCCGCGGTC ATACAGGTTT ATAAAGAGTT TGTAGGCGAA
CCTTTAGGCC CGCAGGCTCA TAAGCTGTTC CATAACAAAT ACACTAAGAG AAAGACTTAC
AAAAAATAA
 
Protein sequence
MVKAIINGTE IQVKEGTTIL EAARLVNINI PTLCKHPDLV ADAGCGICVV RVQGTGKMLR 
ACCTALEEGM KITTHDPEIV KVRKNVLELI LSNHPKDCLI CARNNDCELQ RLSSEFGIRD
AYYPLIVGRK KHKHDESTKT IDIEGSKCIL CGRCVQVCQK NQNVWALSFL GRGINTVLSP
AGEIELNDSP CVKCGQCSNH CPVGAIVEHD ETQKVWDALS NPDLFPVVQI APAVRVSIGE
AFGYPIGTNL TGKLISSLKK LGFKGVFDTN MGADMTIMEE GNEFVHRFKK KDNMPLITSC
CPAWVDFLEK FHSDMLDNFS TCKSPHEIIG VLSKTYYAKK HNVDPSKIFM VSIMPCTAKK
YEIHRSEEMF ASGHQDIDIS LTTRELARMI KQSGIDFKNI EDQKADSILG AYSGAGTIFG
ATGGVMEAAL RTAYHVITGK ELSKVEFKQV RGLKGIKEAN IDIDGTTVRV AIAHGLANVD
HLLKEIEKTK AEGKPSPYDF VEVMACEGGC VGGGGQPYGV TDELRKKRAA GLYKDDESSK
VRCSHLNPAV IQVYKEFVGE PLGPQAHKLF HNKYTKRKTY KK