Gene Emin_0391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0391 
Symbol 
ID6262543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp416947 
End bp419157 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content40% 
IMG OID642610857 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_001875285 
Protein GI187250803 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000910038 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000000117936 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAAGAA AAAGGATTTT AATAAACGGC GTTGTGCAGG GCGTGGGTTT TAGGCCTTTT 
ATTTATAAAA GCGCTTTAAA ATTTAGGCTT AGCGGCTTTG TGCAAAATAC TTCGGCGGGT
GTTATAACTG AAGCTCAGGG GAGTAAAGAA AATATAAATA AATTTATACT TTTTATAAAA
CAAAACTGCC CGCCGCAGGC TAAGATAGAA GATCTTAAAA TAACGGATAT AAAAGAAAAA
AAAGAAAAAG GTTTTTTAAT TATCGCAAGT AAAAAAGACA ATATTCAAAC CGTTTTTTTA
CCGCCTGACT TAGCTATGTG TAAAGATTGC GGCAAGGAAA TACTTAACCC CAAAGACCGC
CGTTTTTTAT ACCCGCTTAC CAACTGCACC AACTGCGGCC CCAGGCTCAG TATTACAAAA
ACTTTGCCCT ATGACCGCAA GGCAACCACA ATGTCAAAGT TTAAAATGTG CCCCGCCTGC
CAAAAGGAGT ATGAAAGCCC AAACTCCCGC CGTTTTCACG CTCAGCCTAA CGCCTGCCCA
GTCTGCGGGC CGCAAGTATT TTTCAAATTT AAAAATAAAA ATTTTAAAGG TTTAAAAGCC
TTAAAAACCG CAGCCGTTTT TTTAGATAAA GGTAAAATCA CGGCTGTAAA AAGTATAGGC
GGTTATCATT TAGCCTGCGA CGCTAAAAAT CCTAAAGCTG TCACTTCTCT GAGACTTAGC
AAAAAACGCC CCTACAAACC TTTAGCTATA ATGTTTAAAG ATTTAAAAAC CGCCAAAAAA
TATTGTTTTA TAAATAAAAT TGAGGCCCGC GCCCTTTTGT CGCCCGCCGC GCCTATAGTA
ATTGTAAAAA AGAAAAAAGA GCTTAAGCTT ATTTCGGATA ATCTTAATAA TTATGGTATT
ATGCTGCCTT ATACGCCGCT TCATAAAATA TTATTTTCCC TCTTAAAAAC GGATGTTTTG
GTCATGACTT CGGGCAACGC CAAAGGCGGC GCGCTTTGCA GCGGCGATGG GGAAGCTTTT
AAAAAACTTT CCAAAATAGC GGATGGCTTT TTATACCACG AGCGTGAAAT TTATAATAAG
GTTGATGACA GTATTATGTT TGAAGCCTTG GGCAAAATGC GTTTTATAAG GCGGGCCAGA
GGCTTTGCCC CGCTGCCCGC CACGCTTGCT AAAAAAGCTA AAAAAAGCAT TTTAGCTTTG
GGGGCTGATA AAACGGCCGC TTTTTGTTTA GTTAAAGAAA ATAAAGCTTA TCTAAGTCAG
TATATAGGCG ATTTGAATAA AAAAGAAAAC GGCGGGTTTT ATTTAACCGC GCTTGAAAAA
AATAAGAGGC TTTTAGGGAT AAACCCCCAA GTTACAATTG CCGATTTGCA CCCCGGTTAT
TTTACAAACA AACTGCCTTT TAAGAATGTA AATAAAATAC AGCATCACGC TGCGCACGCT
TTAAGCGTGG CGGCGGAGCA TAATTTAAAA GGCCGTTTTT TAGCGGTAAA TCTTGACGGT
AGCGGACTGG GAAGCGACGG CGCCATATGG GGCGGGGAGT TTTTGGCTTT TAATAATAAA
AATTGGAAAA GGGCCGCTTA TTTTGAGCCT TTGCCCTTAC CCGCCGGGGA TGAAAGCGTA
AGCGAAATTT GGCTCCTTAC ATTAGGCGCT ATTAAAAAAA TTTATGGCAA GGATTGGACT
AACTATAAAT ATCTTTTTAA AAATGTGCCG CGGCAAAAAT TTAGGCTGGC CTTAAAACTT
ATAGACAACG GTATTAACGT TTATAACTCT AGCAGCGCGG GCCGTGTTTT TGATATTGTA
AGCCATATCG CTTTAGGCAT TACCAAAACC ACATACCAGG CCCAGGCGGC CATGGAGCTT
GAGGCAAAAT GCCTTAGGCT TAAAAGCCCT TATAAAGTTG TTATGGAAAA ACAAAATGGT
TGTTACATTA TAAAAACAGG GGAGATGTTA AAAGAAATTT TATCTTATTC GAGGAGCGCG
GAAGAAATAT CGGCAAGATT TCACGCTTAC ATGGTTTACT CTGTTTTGGA AACGGCAAAA
AAACTTAAAC TTAAAAATAT TTGTTTAAGC GGAGGCGTAT TTCAAAATAA AGTTTTATTG
AAAGGCACGG TAAATGTTTT AAAACGCGCA GGATTTAATG TGTATTTAAA TGAACAAACG
CCTTGTAACG ACGGGGGGTT GGCTCTCGGA CAGGCGTGGA ATATGGTTTA A
 
Protein sequence
MLRKRILING VVQGVGFRPF IYKSALKFRL SGFVQNTSAG VITEAQGSKE NINKFILFIK 
QNCPPQAKIE DLKITDIKEK KEKGFLIIAS KKDNIQTVFL PPDLAMCKDC GKEILNPKDR
RFLYPLTNCT NCGPRLSITK TLPYDRKATT MSKFKMCPAC QKEYESPNSR RFHAQPNACP
VCGPQVFFKF KNKNFKGLKA LKTAAVFLDK GKITAVKSIG GYHLACDAKN PKAVTSLRLS
KKRPYKPLAI MFKDLKTAKK YCFINKIEAR ALLSPAAPIV IVKKKKELKL ISDNLNNYGI
MLPYTPLHKI LFSLLKTDVL VMTSGNAKGG ALCSGDGEAF KKLSKIADGF LYHEREIYNK
VDDSIMFEAL GKMRFIRRAR GFAPLPATLA KKAKKSILAL GADKTAAFCL VKENKAYLSQ
YIGDLNKKEN GGFYLTALEK NKRLLGINPQ VTIADLHPGY FTNKLPFKNV NKIQHHAAHA
LSVAAEHNLK GRFLAVNLDG SGLGSDGAIW GGEFLAFNNK NWKRAAYFEP LPLPAGDESV
SEIWLLTLGA IKKIYGKDWT NYKYLFKNVP RQKFRLALKL IDNGINVYNS SSAGRVFDIV
SHIALGITKT TYQAQAAMEL EAKCLRLKSP YKVVMEKQNG CYIIKTGEML KEILSYSRSA
EEISARFHAY MVYSVLETAK KLKLKNICLS GGVFQNKVLL KGTVNVLKRA GFNVYLNEQT
PCNDGGLALG QAWNMV