Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_1069 |
Symbol | |
ID | 6263259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 1162579 |
End bp | 1164504 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642611549 |
Product | sulfatase |
Protein accession | YP_001875958 |
Protein GI | 187251476 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAA CACCGTCAAA AATTAAAATC TCCCTAATTA TAACGGCGGG ATTTGCGATT GTTTTTACTT TGGCGAGGCT GGCTCTTTTG CTTATTTATC CAGACTATTT TAAAGAACTT ACCAAAACTG AAATTTTGTT TTCTTTTTTA AACGGGCTGC GCTTTGATTT CGGCATAATA GCTTTATTCG CGGGGCCGCT TTTACTGCTT TTTAACCTGC CCGTAAAATC AAAGATTTTT TTAAAAACAG TTTTATTCTT ACTCGCTTTT GTTTTTATCG TTGTTACAGG CCTTTTAGCG GCTGATATTA TTTATTTCGG CTACTCTTTT AAGCATATTA CCGAAGAAAT TCTTATTTTA GGCAATGACG CCGGGTTTAT ATTCCAATAT GTTTTTAGCA AAAAATTACT TTTGCTTGCA ATAATTGTTT TCGCCGCCGC TTTAATTGCG GCGGTAAACA AAATGGCAGA TATATATTCC CGTGCGGAAA CGCAAAATAT TTTTAAATCC GCCTTAACCT CCGCAGTAAT AATAATTTTA ATTATCTTCG GAATACGCGG CAGGATTTTG GTAACTAATG AAAGCGGCAC AAAATTCGCC CGTAAAGCCA TAGGCATTGC GGACGTTTAT TTATACGCCG CAAATTCCGC CGCCGCTAAT CTTACTTTAA ACGGGGTATT TACCTCTTTC CACACCACAA GAAAGGGTAA GGTTGAACTT GTAAATAACT TTCCTTTAAA CGAAGCTTTG GAAAACGCGC AAAACATTCT GTTTGAGCCG AAAGACATTC TTGTAAATAA AGATTTCCCT TTAATGCGCA TAAGTCCCAA AAAAACAAAC GCAGGCGAAT ATAATTTTTT TGTAGTCCTT TTGGAAGGCT GGAGCCCTTT TTATATTGAT TCTTTGCAAG GCAAAAATTA CGGCGTTACG CCTAATTTTG ACAATATCGT AAAAAACGGC GTTAATATGA CAAACGCTTA CTCCGCCGGG GCAAGAAGCA TATTCGGCTT TGGGGCGGCT TTCGCTGGCG TGCCTATGCT GCCAAGCCTT CCCGTTTTCG GTTACGGGCT GGAACTTTCT GATATCACCG CAATAGGCCG CCCTTTTAAC GAGCGGGGAT ATTACACAAT TTTCGCACAG GCCTCGCACA GAGATTCTTA CAGGATGTGC GCTTTAGCCT CAGGCCTTTT GGATATGCAA GACAGTTTCG GCAGGGAAGA TATTCCCGTT TTGCTTCCTT ATAGGGAAAA TGCTTCCTTC GGCTATGATT ACGATATGCT TATGTTTACG GCTGATAAAG TGAAAAAACA TGATAAATTT TTAGCCCTTA CTTTTACCGC AATAACGCAT GATCCTTTTA CCGTCACCTT GGAAGAGTTT GAAAAATACC CCAGGGGTAG TTGGGAAAAT GAATATCTTA ACTCCTTATA CTACGCTGAT TTCGCCATAG GCGAGCTTAT TAAAAAAGCT AAAGAAGACG GCTGGTTTGA CAATACTGTT TTTATTTTTT TATCCGACCA CGGGCAAGGA CAAAAAGGCC GTGACACAAT TAAAACAAGA ATGCAAATAC CTTTTGTTAT TTACGCTCCT AAAATATTAA AACCGCAAAC AATTAATTAC ACCGTTTCCC AGCTTGATTT GCTGCCCACT ATATATAATC TCGCGGGTAT TGAAAGCCCT TATACGGCTT TAGGTAAAGA TATTTTCGGT TCGGACAAAG GGCGCGTTGC CTTTTTTGCC GAAGGTATTG ATATCGGGCT TATGACTGAT AAAGGCGCAC TTAAACATAG CGGCTTAGGT ATTTTAGGCG CGCAGTTTAC CGAGCCTGAT TTTGACGTAA AAAAAGCGGA AAGAGACCTT CTTTCCCTTG AAAAAGCGGG AACGTCTTTA TTAAAAACCA ACAAGTGGTA TTTAAGCAAG CCTTAA
|
Protein sequence | MQKTPSKIKI SLIITAGFAI VFTLARLALL LIYPDYFKEL TKTEILFSFL NGLRFDFGII ALFAGPLLLL FNLPVKSKIF LKTVLFLLAF VFIVVTGLLA ADIIYFGYSF KHITEEILIL GNDAGFIFQY VFSKKLLLLA IIVFAAALIA AVNKMADIYS RAETQNIFKS ALTSAVIIIL IIFGIRGRIL VTNESGTKFA RKAIGIADVY LYAANSAAAN LTLNGVFTSF HTTRKGKVEL VNNFPLNEAL ENAQNILFEP KDILVNKDFP LMRISPKKTN AGEYNFFVVL LEGWSPFYID SLQGKNYGVT PNFDNIVKNG VNMTNAYSAG ARSIFGFGAA FAGVPMLPSL PVFGYGLELS DITAIGRPFN ERGYYTIFAQ ASHRDSYRMC ALASGLLDMQ DSFGREDIPV LLPYRENASF GYDYDMLMFT ADKVKKHDKF LALTFTAITH DPFTVTLEEF EKYPRGSWEN EYLNSLYYAD FAIGELIKKA KEDGWFDNTV FIFLSDHGQG QKGRDTIKTR MQIPFVIYAP KILKPQTINY TVSQLDLLPT IYNLAGIESP YTALGKDIFG SDKGRVAFFA EGIDIGLMTD KGALKHSGLG ILGAQFTEPD FDVKKAERDL LSLEKAGTSL LKTNKWYLSK P
|
| |