Gene Emin_0352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0352 
Symbol 
ID6263643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp376360 
End bp378162 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content37% 
IMG OID642610818 
Productsulfatase 
Protein accessionYP_001875248 
Protein GI187250766 
COG category[R] General function prediction only 
COG ID[COG3083] Predicted hydrolase of alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0469507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATGA AACTTAAGAA CATTTTTATA TTTACTTTTA TAAACCTTGC GATTTGGTGT 
CTGCTGAGTT TGAAATACTA TTTTACAAGC GGATTTTATT GGGATTTTGC GGGACTGATT
TTTACTTTAA CATTTATTCC CGGGCATTTA TTACTTTTTG CTTTAGGGCT TTTTGTTATT
TTGTGCCTTG CCAGTATAAT AGGCCCTAGG TTTTGCAAAA ACTTCGCTAT TTTTGCGGGT
GCTTTTTTCA CTCTTTTCTT TTTAACTGAC ATAATAGTAT ATTCCCAATA CCGCTTTCAT
ATAAGCCTTT CTATGGCGGA GCTGTTTTTT GGACCCGCCG GGAGGGAAAT TTTTGTTTTT
CCTATAGGAA TGTATTTGCT TATGGCGCTA GGTGTTCTTG TTGTATTAAT AGTGCAGGGA
ATTGCGGTTG CGGTGTCGTG CAACATTAAG GTTCCGAACA GATTAGTAAT TTTAGGTTTT
ATAGCATTAG TTTTTTGTTT TATAGCGTTT AATTCTTTAT ACGCTTGGGC CAAGTTTGTT
TCCGTTCCAA GTATAACGGC GCAAATATCT TATTTGCCGT GGGCAAATCC GTTGAGTGTT
AATACAAGGC TTAAAAAAAT GGGTTTAAAT CCCAGTTCTG AGCCTTTAGT GGCCGCTAAA
GGAGAAATGC TTAATTATCC GTTAAATCCT TTAAAGTGCG AAAGCGTAAA TCCCAAACTT
AACGTTCTTT TTATTTTGGT TGACTCTTTA AGGTCGGACA TGTTTACGCG CGAAATAATG
CCTAAAACTT ACGCTAAATA TAAAAACAGC CGCAACGGAT TCCATTTTAA AAACCATGTA
AGCGGCGGCA ACGCCACGCA GGCGGGCGTA TTTGCTTTTT TCTACGGGCT TCCTTCAACA
TACTGGAACG CTTTTTCCTC ATATAATATG GAGCCTGTTT TTATGCAGGA AATGAGAACC
AGGGGATATG AATTTGGCAT ATTTTCAAGC GGTAAATTAA ACAGTCCGGA ATTTCATAAA
AATATCTTTT CAGGCATAGA TAACTTAAGG ATCGAGTCTA AGGGAGATAC TAAGTATGAA
AGAGATATAG ATATGCAGCG CGATTTTGAG GCGTTTTTAG ACAACAGAGA TAAAAAAAGA
CCGTTTTTCG CTTTTATGTT TTATGATTCA CCGCATGGCT TTGAATACCC TCCGTCATTT
AAAGAAAAAT TTAAACCCGC AAAAGAGTTA AATTATATTT CACTTACATC TTCTACGGAC
CCGAAACCCT ATCTTAACAA GTATAAAAAC TCTATTAACT TTATAGACGG CAAACTGGGC
GAAGTTTTTG ATATGCTTAA AGACAGAAAA ATAAATGCGG AAACCGTTGT TATTATTACA
GGCGACCACG GACAGGAAAT CAACGATACC GGCAATAATT TTTGGGGACA TAACAGTAAT
TTCGCCAAAT ATCAAACGCA TACCCCTCTA ATAATGCTTT GGCCCGATAA AAGAGGCAAA
GATATTGAAT ACAGAACAAC GCATTATGAT ATTGTTCCTA CGGTTATGAA AGAAATTTTG
GGTTGTGTGA ACCCGCCTTC CGATTATAGT ATAGGTTATA ATTTGTTTGA TGATACTCCC
AGACCGTACA GCCTTGTTAT AAGTTATACA AAAAAGGCGG TTATAGTTGA CGACAATGTT
TCTGTTATAG ATAATTACGG CGCTTTGGAG AATTATGACG ACCAGTACCG CCCGTTAAAA
GAAAGTGTTG ATTCAAAAGC GATATCGGCC GCGTTAAAGG ATTTATCAAC GTTCTATAAA
TAA
 
Protein sequence
MNMKLKNIFI FTFINLAIWC LLSLKYYFTS GFYWDFAGLI FTLTFIPGHL LLFALGLFVI 
LCLASIIGPR FCKNFAIFAG AFFTLFFLTD IIVYSQYRFH ISLSMAELFF GPAGREIFVF
PIGMYLLMAL GVLVVLIVQG IAVAVSCNIK VPNRLVILGF IALVFCFIAF NSLYAWAKFV
SVPSITAQIS YLPWANPLSV NTRLKKMGLN PSSEPLVAAK GEMLNYPLNP LKCESVNPKL
NVLFILVDSL RSDMFTREIM PKTYAKYKNS RNGFHFKNHV SGGNATQAGV FAFFYGLPST
YWNAFSSYNM EPVFMQEMRT RGYEFGIFSS GKLNSPEFHK NIFSGIDNLR IESKGDTKYE
RDIDMQRDFE AFLDNRDKKR PFFAFMFYDS PHGFEYPPSF KEKFKPAKEL NYISLTSSTD
PKPYLNKYKN SINFIDGKLG EVFDMLKDRK INAETVVIIT GDHGQEINDT GNNFWGHNSN
FAKYQTHTPL IMLWPDKRGK DIEYRTTHYD IVPTVMKEIL GCVNPPSDYS IGYNLFDDTP
RPYSLVISYT KKAVIVDDNV SVIDNYGALE NYDDQYRPLK ESVDSKAISA ALKDLSTFYK