Gene Emin_0752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0752 
Symbol 
ID6263388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp827983 
End bp829662 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content40% 
IMG OID642611227 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001875644 
Protein GI187251162 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.801139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000202467 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGGAAA TTTTAAAATT AAAAAAATCA AATTGCAAAA ACTGTTACAA ATGCATAAGG 
CATTGCCCCG TTAAGTCTAT AAGAGTGTCC GGCAACCAAG CCCATATCAT TAATGATGAA
TGTATTTTGT GCGGACAGTG CTTTGTTGTT TGCCCGCAGG GCGCAAAACA AATTGAATCA
GATATTGAAA AAGTAAAGGT AATGCTTTAC GGTAAAGAAC CTGTAATAGC AAGCCTTGCT
CCTTCATTTA TTGCAAATTA CGGCGGCGCG GGTATAAAGT CTTTAAGTAA AGCGCTTAAA
AAGCTAGGGT TCCATTCTGC TGAGGAAACC GCCATAGGCG CCACAATGGT TAAAAACGAG
TATGAAAATA TTTTGGCTAA AAAAGAACAT GATATTTTAA TTTCCTCTTC CTGCCACTCG
GTTAATTTGC TTATTCAAAA GCATTTTCCC TCCGCGCTTA AGTATTTAGC CAATGTAATG
TCACCTATGC AGGCGCATTG CCGGGATATA AAAAAACGTT ATCCCGGCGC GAAAACAGTT
TTCATAGGAC CTTGCGTATC TAAAAAAGAC GAAGCCCAGC GTTACCCCGA TATTGTGGAC
GCCGTGCTTA CTTTTGAGGA ACTTACCCAA TGGTTCAAAG AGGAAAAAAT TACCGTTGAA
CCTGAAAAAA ATAAAACCGC GGAAGGCAAG GCAAGGTTTT TTCCAACTTC GGGAGGGATA
TTAAAAACCA TGAGGGGCAC CGCTAAAAAA GGGGAATATA AGTATATAGC GGTTGACGGT
ACGGAAGCGT GTATAGACGC TCTTAACGAT ATTAAAGATG GCAACATTTT CCGCTGTTTT
ATAGAAATGT CCGCCTGTAA GGGCAGCTGT ATAGGCGGCC CTATTATGGA AAAATACAAC
CGTTCGCCCA TAAAAGGCTA TATGCAGGTT GCCGCTTATG CGGGTGAAAA AGATTTTAAG
GTTGAAGATT ATTCCGGCAA GATAACACAA AAACGCAACC CGCTGCATTT TAATAAAAAA
GAAATTCCCC CCCATGAAAT AGAGGAAATT TTAAAGCAGA TGGGCAAAAC AAAACCCGAG
CATGAACTCA ACTGCGCTTC GTGCGGTTAT AATACCTGCA GGGAAAAGGC CGCCGCCGTT
TACCTGGGTA ACGCTAATAT ATCAATGTGC CTGCCGTTTT TAAAAGACAA GGCGGAAAGC
TTTTCTGACA ATATTATTAA TAACACGCCA AACGCTATTT TGGTTCTTAA TGAGAATTTA
GAAATTCAGC AAATTAACAG ATCGGCTTGC AAACTTATGA ATATAGCCCA CCATTCTTAT
GTTTTGGGCG AGCCTGTCGT ACGCATTTTA GATCCGCAGA TTTTTATGCA GGTGCGCGAC
AGCGGCGCAA GCGTGAGGGA AAGAATTACC TATCTTTCCG AATACAAAAA ATATGTCGAA
AAAACTGTTT TGTACGACAG GGACTTTCAT ATTCTTATTT GTATTATGAG AGATATTACC
GCCGAAAAAG AAGAAGAAAA GAAAAAAGAA GAACTTAGAA GGAAAACTAT AGAAACCGCC
GATAACGTTG CCGATAAACA AATGCGTATC GTGCAGGAAA TCGCCTCTTT ATTAGGGGAA
ACGGTGGCGG AAACAAAAAC CTCTCTCACA AAACTAAAGG AAAGCATTAA CGATGAATAA
 
Protein sequence
MREILKLKKS NCKNCYKCIR HCPVKSIRVS GNQAHIINDE CILCGQCFVV CPQGAKQIES 
DIEKVKVMLY GKEPVIASLA PSFIANYGGA GIKSLSKALK KLGFHSAEET AIGATMVKNE
YENILAKKEH DILISSSCHS VNLLIQKHFP SALKYLANVM SPMQAHCRDI KKRYPGAKTV
FIGPCVSKKD EAQRYPDIVD AVLTFEELTQ WFKEEKITVE PEKNKTAEGK ARFFPTSGGI
LKTMRGTAKK GEYKYIAVDG TEACIDALND IKDGNIFRCF IEMSACKGSC IGGPIMEKYN
RSPIKGYMQV AAYAGEKDFK VEDYSGKITQ KRNPLHFNKK EIPPHEIEEI LKQMGKTKPE
HELNCASCGY NTCREKAAAV YLGNANISMC LPFLKDKAES FSDNIINNTP NAILVLNENL
EIQQINRSAC KLMNIAHHSY VLGEPVVRIL DPQIFMQVRD SGASVRERIT YLSEYKKYVE
KTVLYDRDFH ILICIMRDIT AEKEEEKKKE ELRRKTIETA DNVADKQMRI VQEIASLLGE
TVAETKTSLT KLKESINDE