Gene Emin_0389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0389 
Symbol 
ID6262473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp414916 
End bp415956 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content40% 
IMG OID642610855 
Producthydrogenase formation HypD protein 
Protein accessionYP_001875283 
Protein GI187250801 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.483899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0000000115047 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATAAAAA AATTAAATGA TTTAGCTAAA AGGCTGTCGC GTAAGGTTAA CATAATGGAA 
GTTTGCGGCA CGCATACAAA CTCAATAGCT AAAAACGGTT TAAAAAGCCT CTTAAATGAA
AATATTAATT TAATTTCAGG CCCCGGCTGT CCTGTTTGCG TAAGCGCGGA CGGTGATATC
GAGGCCGCTA TAGATTTAGC CTTAAAAAAG GATAATATAA TTTTTACTTT TGCCGATATG
CTCCGCGTGC CGGGGCGTAA CGGCAGTTTG CAGGAAGCTA AAGCTTCGGG TGCGGATGTC
AGAGTTATTT ACAGCCCTTT AGACGCTTTT TTGGAAACAG GAAAAACAAA TAAAACCGTA
ATTCTTTTAG CAAGCGGTTT TGAGACGACG GCCCCTTTAA TAGCCGTTTG TTTAAAAAAA
GCGAAAGAAG CGGGGTTTAA AAACTTTTTT GTTTTTCCCG TTTTAAAACT TATTAACCCC
GCCATAACAG CGCTTTTAAG TGAAGAAAAT AAAATAGACG GGTTTTTGTT GCCCGGGCAT
GTCAGTTTGG TTATAGGCAA AAAACCTTAC AGTTTTATAA GCAAAAAATT TAATAAACCG
GGTGTTATAG GCGGTTTTGA AGCGGAGGAA ATTGTTGCCG CTCTTATAGA AATAGTTAAA
CAGCTTTTAG AAGGCAAAGC CCAAATACAA AATGCCTACC CCGCAATAAA AGAAGAAGGC
AACCAAACCG CCCTAAAAAT GATAGAGGAT GTTTTTGAGC CTTACGACGC GGTTTGGAGG
GGATTTGGCG TAATTCCTTC CTCAGGGCTT AAAATAAAAA AAGAATTTAG AGAGTTTGAC
GCTTTAATAA AATTTAACAT TAAACCCTGT TACGGCGGTT CTTTAAATAA AGCATGCAAA
TGCGCGGAAG TTTTAAAGGG TAAAATAAGC CCCGTAAAAT GCCCGCTTTT TGGCAAAAAA
TGCGCGCCCG GCAAACCTTT AGGGCCGTGT ATGGTATCAA GCGAGGGGGC ATGCAACGCC
TTATACAATT ATGAAAAATA A
 
Protein sequence
MIKKLNDLAK RLSRKVNIME VCGTHTNSIA KNGLKSLLNE NINLISGPGC PVCVSADGDI 
EAAIDLALKK DNIIFTFADM LRVPGRNGSL QEAKASGADV RVIYSPLDAF LETGKTNKTV
ILLASGFETT APLIAVCLKK AKEAGFKNFF VFPVLKLINP AITALLSEEN KIDGFLLPGH
VSLVIGKKPY SFISKKFNKP GVIGGFEAEE IVAALIEIVK QLLEGKAQIQ NAYPAIKEEG
NQTALKMIED VFEPYDAVWR GFGVIPSSGL KIKKEFREFD ALIKFNIKPC YGGSLNKACK
CAEVLKGKIS PVKCPLFGKK CAPGKPLGPC MVSSEGACNA LYNYEK