Gene Emin_0377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0377 
Symbol 
ID6264034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp401621 
End bp403636 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content39% 
IMG OID642610843 
Productsulfatase 
Protein accessionYP_001875271 
Protein GI187250789 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000264889 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000148489 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTTTTT CTCAAAAACT AAGCAGGGTA CCAAACTGGG TAAAAGGGTA TACCGGCTTT 
TTCGCGGCTA CATGGTTAGC TTTTTTCTTA ATGCGTTTTG TTTTTTTATT TGTTTACCGC
GCCGCAATTA CTGCTGAAGT TAAAACATAT CTTTTACAAT CTTTTTACAT AGGCGCTAAA
TTTGACGCGC GTTTAGCCGC TTTTTTGGCT TTGCCGCTGG GACTTTATTT ATTTATAAAA
AGTATTTTTC CTAAAATACC CGCGGTTTTT AATAAAATCA TGGCTTCGTT GTACACTGTT
ATTTTAACGG GCGCGGGGCT TGTTTACGCC GGAGATTTTG GGCACTATTC TTATTTGGGC
CTTAGGGTAA ACGCTTCTTT ATTTAAATAT CTTGAAAACG CTTTTATCTC TTTTGAAATG
GTTTGGCAAA CATATCCTGT TGTATGGGCT TCTTTAGGTT TTATACTTTT AATGTTTTTG
GCCTATAAAT ACGCGATGTT TTTTATAAAG CTGGGCCAAC CCGCTTCAAA CGACGGGTGG
AAGAAAAAAA CATCTTGGTT CCTTATTCTT TTTGTCTTAA CGTTTGGCGT TTGCTATGGG
CAAATTAACC AATATCCGTT ACGTTGGAGC AATGCTTATT TTTCAAGCAA TAATTTTATT
TCTAATTTAA CATTAAACCC GGTCTTAAAT ATTTATGACA CATATAGGTT TGCTAAGGAG
GATTCTTATA ATATTGAAGC CGTAAAAAAA TATTACCCCA TTATGGCAGA ATATTTAGGC
GTTGATAACC CCGATATAAA CACGCTTAAT TTTAAGCGTG AAGTAAAAGG AAAAGACCTC
GGAAGGCAGT TTAACATAGT TGTAATATTT ATGGAAAGTT TTGCATGGAA CAAAAGTTCT
TTTTCAAACA CCGGTAAATT TGATACCACG CCTAATGCGA AGGCTTTGGC TGAGCAGTCT
ATACTATTTA CCCAGTTTTA TACGCCCACT TCGGCTACGG CGCGCGCGGT ATTTGCAGCT
TTGTCAAGCA TACCTGACGT GAGTTCCTTT AAAACAAGTT CGCGTAATCC GCTTATCGTT
AACCAAAATT TAATAGCCAA CGAGCTTGAA GGTTATGATA AATTTTTCTT TATAGGCGGC
AGCGCAAGCT GGGGTAATAT AAGAGGTATT CTTTCCCATA ACTTAGACGG GCTTCAACTT
TATGAAGAGG AAGATTACGA GTCAAAACGT GTTGACGTTT GGGGTATTTC TGACTGGGAT
TTGTTTATTG AAGCCGACAA AGTTTTAGCA AAACAGGAAA GGCCTTTCTT TGCGGTAATA
CAAACGGCAG GGTATCACAG GCCTTATACA ATACCTCCGC ATGATGAAGG TTTTAAACTT
GAAAAAGATA TAAGTTATGA AGATCTTGTT AACCACAGTT TCGGCAGCAT TGAAGAGTTT
AACTCCTTAA GATTTTCTGA CTACGCGTTG GGTGAATTTT TCAGACGCGC GAGGAAAAGC
CCTTACTATA AAGACACCGT TTTTGTTATT TTCGGCGATC ACGGTTTGGA CGCGCCCAAG
TCTGAAAATA TGCCGAGAGG TTATGTTGAA TATAATTTAA TTAACCACCA TGTTCCTCTT
ATAATCCACG CTCCCGCTTT ATCTAAAGGC CGGGTTGTAA ATAAAACAGC CAGCCAGGTT
GATATAATGC CCACGGTGGC CGGCCTTATA GGCGCGCCGT ATGAAACTGT GGCTTTGGGG
CGTGATGTTC TTGACCCTAA ATACAAAGAA AAAGAAGGTG CGCTTGTTTT CGGATGGTCT
AAATATCCAC CCACGATTTC TTTTGTGTCA GGAGAATATC TTTACCACGA CCAAACCACT
CAAAAAGGTC TTTACAAGTT TGGTGCAAAA GATTATAATA AAGACCTGGC GGAAGAAAAC
CCTGAACTTT ATAAAAAAAT GGAAGATTTG TCCGCAGGTA TTTATGAAAC ATCCAGATAT
ATGCTTTACA ATAATCCAAA GAAGGAGAAG AAATAG
 
Protein sequence
MSFSQKLSRV PNWVKGYTGF FAATWLAFFL MRFVFLFVYR AAITAEVKTY LLQSFYIGAK 
FDARLAAFLA LPLGLYLFIK SIFPKIPAVF NKIMASLYTV ILTGAGLVYA GDFGHYSYLG
LRVNASLFKY LENAFISFEM VWQTYPVVWA SLGFILLMFL AYKYAMFFIK LGQPASNDGW
KKKTSWFLIL FVLTFGVCYG QINQYPLRWS NAYFSSNNFI SNLTLNPVLN IYDTYRFAKE
DSYNIEAVKK YYPIMAEYLG VDNPDINTLN FKREVKGKDL GRQFNIVVIF MESFAWNKSS
FSNTGKFDTT PNAKALAEQS ILFTQFYTPT SATARAVFAA LSSIPDVSSF KTSSRNPLIV
NQNLIANELE GYDKFFFIGG SASWGNIRGI LSHNLDGLQL YEEEDYESKR VDVWGISDWD
LFIEADKVLA KQERPFFAVI QTAGYHRPYT IPPHDEGFKL EKDISYEDLV NHSFGSIEEF
NSLRFSDYAL GEFFRRARKS PYYKDTVFVI FGDHGLDAPK SENMPRGYVE YNLINHHVPL
IIHAPALSKG RVVNKTASQV DIMPTVAGLI GAPYETVALG RDVLDPKYKE KEGALVFGWS
KYPPTISFVS GEYLYHDQTT QKGLYKFGAK DYNKDLAEEN PELYKKMEDL SAGIYETSRY
MLYNNPKKEK K