Gene Emin_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0161 
Symbol 
ID6262919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp172516 
End bp174519 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content40% 
IMG OID642610625 
Productorganic solvent tolerance protein OstA-like protein 
Protein accessionYP_001875063 
Protein GI187250581 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTTC CTTCGGGCGA TACTTACATA TATTTTAATG CTGATGAACT TGATTATGAC 
GGCAATACCA GAACAGCAGG ATTACTAGGG GATGTTAATG TAACCGCAGT GAACTCAGAT
TTTGATGAAA CACGCATATA TTCCGAAAAT TTATATATAA ATCAGGCCGA GCAAAAAGTT
TATAATGAAG GCGAGGTTAG AGTAGAGCAG GACGGTGGAG AATTAAGAGG GGAAAATCTT
TTTTACGATT ACCAAAACAG TCATTTGACT CTTCAAAATA TTTCGGCCGA ATATCCTCCG
ATAAGGCTTT TACACGCTGA CAGCGCGGAA TTTAAAAACG GAAGGCAGCG GTACAGAGGC
GCGCAGGTAA CCTGCTGTGA CCAGGAAGAT CCCCATTATC ATTTAAAAGC CGGCAGCGTT
ACAATGACTC CCGAAAAAAG AGTTTATGTT ACTAACGCGT TATTATATTT AAGCGATGTT
CCTGTTTTTT ATCTGCCGTT TTTTTGGCGT TCTTTAGATT CAAAAAAACC TTTTACTACC
TACGTTGATT TTACGCAAAG CGCAAGAACG GGTTACGGTC TTTTAACAAG CACTGTCTTC
TTTCCTACAA GAAATTTAAG AACAACCGTT AATCTTGACG GTTACACAAA AGCGGGTTTT
GGTTACGGCG TACAGCTTCT TGTTCAAAAT ACGGATAAAG TAATTGGTAA TTTGGAAGCC
TACGCCATTG ATGACCAAAA GGAAGAAGAT TACCGCTGGG GCGTAAGCGG CGGTTTTTGG
GCTCAGTTGT TTGACAATTC TGACCACTTA AACAGAGATG ACGGCGGCGC CATGTATACA
ATGCAGTCAC AGTTTAGAAG TGTGTCCGAC CCGTATTTTA ACGACACTTA TTTCCGTTCC
AACCCGTTTA AATTTATGCC TGACCAGGAT ATAAATGTGG CTTTTTCCAG GCAGAGCAGA
AGGTCTATTA CACGCATAAG CTATTCCCAA AAAGATGAAT ATAATTATGT TACCGAAGAG
TATGAAATAG TTGAAAAAAT TCTTCCTAAA TTTGAATATC ATCTTATGCC GTTTACCCTT
CCTTTGGGCA TAGTAAACCG TTTTAACGCT AATGTTTATA ATACCGAAGT AAGAGACGAA
GGTTTTAAAC AAACCGCGGG GGCTTTTTGG CACAGCAGCA GGTCTTTTAA TTTAAACAGA
ACTTTTACCC TTTTGCCGTA CGTGTCGCTT GATGAAAGAG TGATTTTCAG AGATAAAGAT
AATAATGAAG ACGCATATGT AACCCGTTTA GGTTCCGGCG TTAATTTAAG AGCGGAGCTG
CTTACCGGAT CTTTAGATAT ATCCTATGAT TATTTAAAAA GATTTTCCAC CGGCACTCTA
AATACTGACA ATGTTTCCGA TGATATGGGG GAAGAGCTTA ACAGAATCTA TATACAAAAC
CATTACCGTC CGTCGCAATG GCTATATTTC AGGCTTGGCA CGGGGTATGA CTTGCGGGAA
ACACAAGACA ATTGGAGCTT TGATAACAGA ATGCTGCCTA TTCTTGCCGA ACTGGGCGTT
AACACAATGA ACGGAGACCT TAATTTGTTC GCGCAGAATC TTTATGACGT TAAAGAAGGA
CAGGAAGCGT TTGTTTTACA AAGTGATTTT AGAATGTTTA AAAAAAGCCG TATGGTTTTT
GGCATGAACA ATTATTCTTT AGACCAAAAC AGCTATCTTT TTAACACTAA ATTTTGGTTC
AGGCCCGAAA ATATTACCTG GTACTTTGAC GTAGGCGTTG ATTTTGAGAT AAGGCAAGGT
TCTTTAAACG CGTATTCAAG AAGTTTTAAA GTTTACAAAG ATTTTCATGA CGCGCATATG
GAATTTGGCG TTGAGGACAG AAACAACAAC CTTTCTTTTG CTTTTAGGAT AGCTGTGCTT
TGCGGAAAAA AACACAGGGA CGACACCTTC AGTAAGGAAG ACCGTTACTG GTCCCCCTGG
CGTAATCCCG GCGATTTAAG ATAA
 
Protein sequence
MPVPSGDTYI YFNADELDYD GNTRTAGLLG DVNVTAVNSD FDETRIYSEN LYINQAEQKV 
YNEGEVRVEQ DGGELRGENL FYDYQNSHLT LQNISAEYPP IRLLHADSAE FKNGRQRYRG
AQVTCCDQED PHYHLKAGSV TMTPEKRVYV TNALLYLSDV PVFYLPFFWR SLDSKKPFTT
YVDFTQSART GYGLLTSTVF FPTRNLRTTV NLDGYTKAGF GYGVQLLVQN TDKVIGNLEA
YAIDDQKEED YRWGVSGGFW AQLFDNSDHL NRDDGGAMYT MQSQFRSVSD PYFNDTYFRS
NPFKFMPDQD INVAFSRQSR RSITRISYSQ KDEYNYVTEE YEIVEKILPK FEYHLMPFTL
PLGIVNRFNA NVYNTEVRDE GFKQTAGAFW HSSRSFNLNR TFTLLPYVSL DERVIFRDKD
NNEDAYVTRL GSGVNLRAEL LTGSLDISYD YLKRFSTGTL NTDNVSDDMG EELNRIYIQN
HYRPSQWLYF RLGTGYDLRE TQDNWSFDNR MLPILAELGV NTMNGDLNLF AQNLYDVKEG
QEAFVLQSDF RMFKKSRMVF GMNNYSLDQN SYLFNTKFWF RPENITWYFD VGVDFEIRQG
SLNAYSRSFK VYKDFHDAHM EFGVEDRNNN LSFAFRIAVL CGKKHRDDTF SKEDRYWSPW
RNPGDLR