Gene Emin_1069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1069 
Symbol 
ID6263259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1162579 
End bp1164504 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content40% 
IMG OID642611549 
Productsulfatase 
Protein accessionYP_001875958 
Protein GI187251476 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAA CACCGTCAAA AATTAAAATC TCCCTAATTA TAACGGCGGG ATTTGCGATT 
GTTTTTACTT TGGCGAGGCT GGCTCTTTTG CTTATTTATC CAGACTATTT TAAAGAACTT
ACCAAAACTG AAATTTTGTT TTCTTTTTTA AACGGGCTGC GCTTTGATTT CGGCATAATA
GCTTTATTCG CGGGGCCGCT TTTACTGCTT TTTAACCTGC CCGTAAAATC AAAGATTTTT
TTAAAAACAG TTTTATTCTT ACTCGCTTTT GTTTTTATCG TTGTTACAGG CCTTTTAGCG
GCTGATATTA TTTATTTCGG CTACTCTTTT AAGCATATTA CCGAAGAAAT TCTTATTTTA
GGCAATGACG CCGGGTTTAT ATTCCAATAT GTTTTTAGCA AAAAATTACT TTTGCTTGCA
ATAATTGTTT TCGCCGCCGC TTTAATTGCG GCGGTAAACA AAATGGCAGA TATATATTCC
CGTGCGGAAA CGCAAAATAT TTTTAAATCC GCCTTAACCT CCGCAGTAAT AATAATTTTA
ATTATCTTCG GAATACGCGG CAGGATTTTG GTAACTAATG AAAGCGGCAC AAAATTCGCC
CGTAAAGCCA TAGGCATTGC GGACGTTTAT TTATACGCCG CAAATTCCGC CGCCGCTAAT
CTTACTTTAA ACGGGGTATT TACCTCTTTC CACACCACAA GAAAGGGTAA GGTTGAACTT
GTAAATAACT TTCCTTTAAA CGAAGCTTTG GAAAACGCGC AAAACATTCT GTTTGAGCCG
AAAGACATTC TTGTAAATAA AGATTTCCCT TTAATGCGCA TAAGTCCCAA AAAAACAAAC
GCAGGCGAAT ATAATTTTTT TGTAGTCCTT TTGGAAGGCT GGAGCCCTTT TTATATTGAT
TCTTTGCAAG GCAAAAATTA CGGCGTTACG CCTAATTTTG ACAATATCGT AAAAAACGGC
GTTAATATGA CAAACGCTTA CTCCGCCGGG GCAAGAAGCA TATTCGGCTT TGGGGCGGCT
TTCGCTGGCG TGCCTATGCT GCCAAGCCTT CCCGTTTTCG GTTACGGGCT GGAACTTTCT
GATATCACCG CAATAGGCCG CCCTTTTAAC GAGCGGGGAT ATTACACAAT TTTCGCACAG
GCCTCGCACA GAGATTCTTA CAGGATGTGC GCTTTAGCCT CAGGCCTTTT GGATATGCAA
GACAGTTTCG GCAGGGAAGA TATTCCCGTT TTGCTTCCTT ATAGGGAAAA TGCTTCCTTC
GGCTATGATT ACGATATGCT TATGTTTACG GCTGATAAAG TGAAAAAACA TGATAAATTT
TTAGCCCTTA CTTTTACCGC AATAACGCAT GATCCTTTTA CCGTCACCTT GGAAGAGTTT
GAAAAATACC CCAGGGGTAG TTGGGAAAAT GAATATCTTA ACTCCTTATA CTACGCTGAT
TTCGCCATAG GCGAGCTTAT TAAAAAAGCT AAAGAAGACG GCTGGTTTGA CAATACTGTT
TTTATTTTTT TATCCGACCA CGGGCAAGGA CAAAAAGGCC GTGACACAAT TAAAACAAGA
ATGCAAATAC CTTTTGTTAT TTACGCTCCT AAAATATTAA AACCGCAAAC AATTAATTAC
ACCGTTTCCC AGCTTGATTT GCTGCCCACT ATATATAATC TCGCGGGTAT TGAAAGCCCT
TATACGGCTT TAGGTAAAGA TATTTTCGGT TCGGACAAAG GGCGCGTTGC CTTTTTTGCC
GAAGGTATTG ATATCGGGCT TATGACTGAT AAAGGCGCAC TTAAACATAG CGGCTTAGGT
ATTTTAGGCG CGCAGTTTAC CGAGCCTGAT TTTGACGTAA AAAAAGCGGA AAGAGACCTT
CTTTCCCTTG AAAAAGCGGG AACGTCTTTA TTAAAAACCA ACAAGTGGTA TTTAAGCAAG
CCTTAA
 
Protein sequence
MQKTPSKIKI SLIITAGFAI VFTLARLALL LIYPDYFKEL TKTEILFSFL NGLRFDFGII 
ALFAGPLLLL FNLPVKSKIF LKTVLFLLAF VFIVVTGLLA ADIIYFGYSF KHITEEILIL
GNDAGFIFQY VFSKKLLLLA IIVFAAALIA AVNKMADIYS RAETQNIFKS ALTSAVIIIL
IIFGIRGRIL VTNESGTKFA RKAIGIADVY LYAANSAAAN LTLNGVFTSF HTTRKGKVEL
VNNFPLNEAL ENAQNILFEP KDILVNKDFP LMRISPKKTN AGEYNFFVVL LEGWSPFYID
SLQGKNYGVT PNFDNIVKNG VNMTNAYSAG ARSIFGFGAA FAGVPMLPSL PVFGYGLELS
DITAIGRPFN ERGYYTIFAQ ASHRDSYRMC ALASGLLDMQ DSFGREDIPV LLPYRENASF
GYDYDMLMFT ADKVKKHDKF LALTFTAITH DPFTVTLEEF EKYPRGSWEN EYLNSLYYAD
FAIGELIKKA KEDGWFDNTV FIFLSDHGQG QKGRDTIKTR MQIPFVIYAP KILKPQTINY
TVSQLDLLPT IYNLAGIESP YTALGKDIFG SDKGRVAFFA EGIDIGLMTD KGALKHSGLG
ILGAQFTEPD FDVKKAERDL LSLEKAGTSL LKTNKWYLSK P