Gene Emin_0811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0811 
Symbol 
ID6262588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp888331 
End bp889602 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content41% 
IMG OID642611289 
Producthypothetical protein 
Protein accessionYP_001875703 
Protein GI187251221 
COG category[S] Function unknown 
COG ID[COG3681] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.000000789442 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCTGC TTAAAGAAGT TTTAAAAAAC CAAGTTTACC CCGCTATGGG CTGTACGGAA 
CCTGTTTCCG TGGCCTTATG TGCCGCTTAC GCAGCTAAAG AATTGGGCAA ACCCGTGCAA
AAAGCGGTTT TTTATTTAGA CGCCGGCACA TTTAAAAACG GCCTTGCGGT ACGTATTCCC
AATACCAGCG GGGAAAGGGG TAATTTACTT GCCGGAACCG CCGGGCTTTT GATAGCAAAA
CCGCAGTTAA AAATGGAAAT TTTAAAAGCC GCCACACCGT CAATACTTAA GCGCGCCAAA
CAATTAATAG ACGATAAAAA AGCGTTTATC AAAGTAGCTC CCTGTAAAAA ACACTTTTAT
ATAAAAGTAG AGGTTGAAAA CGGTAAAGAC AAAGCCTCCT GTGTTATATC GGACAGCCAC
ACCACGGTCA GCAAATTAAC AAAAAACGGC AAAGTTATTT TTGAAAACAA ACCTTCCAAA
AAGAAAGAAG ATAATTATAA GCAGCTTTTG GGTAAAGCCA CATTAAAAGA TCTTATAGCA
CTTGCGGATA ACGCTGACAA CACGGATTTA AAATATATAA AAAAAGGCGT TGAAATGAAT
TTAAACGCCT GTAAAGAAGG CAAAAAACTA AAAAAAGTAG GCTTTTTTTT AGAAAGCACT
GTGGAAAAAA GTATTTTGCA AAAAAATCTT GTTACCGAAA CTAAAATAAT GGCCGCCCGC
GTGGCCGACG CAAGGATGGA CGGCATTGCC GTACCGGTAA TGAGCAGCGG CGAAAGCGGA
AATCAGGGCG TAGTGGCCAT TTTAGTGCCT TATAACGTGG GTAAAAAATC AAAGGTAAAA
GAAGAAAAGA TATTAAAAAG CATAGCTTTT TCTCATTTAC TTAACGGATA TGTTAAAGTT
TATACGGGAA GTCTTTCTCC TCTATGCGGC TGCGCCATAG CCGCGGGAGT CGGCGCCGCG
GGAGCTATTG TCTACCAGCA AAACGGCGAT TTAAAAAAAA TAACATTAGC CATAAATAAT
ATTATAAGCG ATATCGGCGG CATGTTATGC GACGGGGCAA AAAGCGGCTG CGCTTTAAAG
GTGGTAAGCT CCGTTGACAG CGCTATAAGA GCCGCCTATA TGGGCCTTAA CAATTACGGC
ATTACGGAAC TTGAAGGATT TATAGGTAAA ACAGCCGAAG AAACCATACA AAACCTGGGC
AATATATCAA TTACCGGAAT GTGCGACGTT GACGCTGTTA TAGTCGATAT AATGAAGAAA
AAGGTTAAAT AA
 
Protein sequence
MNLLKEVLKN QVYPAMGCTE PVSVALCAAY AAKELGKPVQ KAVFYLDAGT FKNGLAVRIP 
NTSGERGNLL AGTAGLLIAK PQLKMEILKA ATPSILKRAK QLIDDKKAFI KVAPCKKHFY
IKVEVENGKD KASCVISDSH TTVSKLTKNG KVIFENKPSK KKEDNYKQLL GKATLKDLIA
LADNADNTDL KYIKKGVEMN LNACKEGKKL KKVGFFLEST VEKSILQKNL VTETKIMAAR
VADARMDGIA VPVMSSGESG NQGVVAILVP YNVGKKSKVK EEKILKSIAF SHLLNGYVKV
YTGSLSPLCG CAIAAGVGAA GAIVYQQNGD LKKITLAINN IISDIGGMLC DGAKSGCALK
VVSSVDSAIR AAYMGLNNYG ITELEGFIGK TAEETIQNLG NISITGMCDV DAVIVDIMKK
KVK