Gene Mesil_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1248 
Symbol 
ID9250744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1231818 
End bp1233323 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content62% 
IMG OID 
Productsulfatase 
Protein accessionYP_003684653 
Protein GI297565681 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.305152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTGGA CTGGGATGGT CGTCGCTGGA TTGTTGCTTA CGGGGTGGGC TCTTTGGGTT 
TTGGGACAAA GGGTTCAGGC TCAGGCAACC CCCAAACCCA ACATCATCTT TATCCTCACC
GACGACGAAG ACGTGGGCAT CCACGCCTTC ATGCCTAAGA CCAAAGCCCT GTTGCACGAC
CAGGGCACCA CGTTCTCCAA CTTTTTCGTG ACCTACTCCC TGTGCTGCCC TAGCCGGGCC
TCCATTCTTA CCGGGCAGTA CCCGCACAAC ACCCACATCG AGGGCAACCG CCCACCCCAG
GGGGGCTTTC TCAAGGCCTA CCAGACCGGG CTCGAGGCCA ACACCGTTGC GGTCTGGCTG
CAACAGGCCG GTTACCACAC CCTGCTGGCG GGAAAGTACC TCAACGGCTA CGGGCAAGAC
AATCCTCGCC GCGCCCAGCG CCAGGGGCTG AACCTGCCCG ATCCCACCTA TGTCCCGCCC
GGCTGGACCG AGTGGTACGC CGGGGTAGGT AATGCACCTT ACCAGGACTA CAACTATTCG
CTCAACGAGA ATGGCCAGCT GGTGCGCTAT GGCAGCAGCC CCGAGGATTA CCTCACCGAC
GTGATCGCGC GCAAGGCAGT GCAGGCGATC ACTAACGCCA CCCGGGAGGG TAAGCCCTTT
TTCCTCTACC TAGCCCCCTT CACCCCCCAC GCTCCGGCCA ACTTCGCCCC TCGCCATGCG
AGCCTCTTCA AAGACGCCGA GCTGCCCCGC CCGCCCAACT TTGACGAAGC CGACGTGAGC
GACAAGCCTC CCCTCCTGCG CCGACTCCCG CGCTTGAGTG AGCGCGAATT GGCCAGAATG
CGGGAGCTAT ATGTCAAGCG GCTGCGCTCC TTACAAGCCA TTGATGACCT GGTGGAGAGT
ATAGTGCAGG CACTAAGGCA AAACGGGCAA CTCGCCAACA CCTATATCGT CTACACCTCC
GACAACGGCT TCCACATGGG CAACCACCGT ATGCCCCAGG GTAAAAATAT GCCTTATGAG
GAAGATATCC GGGTACCGCT GGTGGTGCGC GGGCCAGGGG TTCCGGCAGG CAAGACGGTT
AATGAGTTGG CTCTCAACAT CGACTTGGCG CCCACTTTCG CGAAAATTGC CGGGCTCGAA
GTGCCCCCGT CCTGCGACGG GCGCTCTCTC TTGCCCTTAC TGCGGGGCCA GATCCCCACG
GTTTGGCGCC AGAGCTTCAT GGTGCAGCGA GGTGAGGGGG CCGAGGCGCA GTCTGAAGAC
GGCGATGGTC GGGACCGGGC GGGGGCCTTC AGCGCCCTCC GTACGGCGGC TTACACCTTT
GTTCAGTGGG GTAGCGGCGA CCGCGAGCTT TACGACCTGA AGGCCGATCC CTACCAGCTC
CAGAATCTCG CCAGCAAAGC TGACCCGGTC TTGATCCAGC GTCTTTCCAC CCGGCTGAGC
GAGTTGTCTA AGTGCAGGGG GGACGAGTGC CGCCGTGTGG AGGAGCTTCC CATAGGGACG
CCCTAA
 
Protein sequence
MRWTGMVVAG LLLTGWALWV LGQRVQAQAT PKPNIIFILT DDEDVGIHAF MPKTKALLHD 
QGTTFSNFFV TYSLCCPSRA SILTGQYPHN THIEGNRPPQ GGFLKAYQTG LEANTVAVWL
QQAGYHTLLA GKYLNGYGQD NPRRAQRQGL NLPDPTYVPP GWTEWYAGVG NAPYQDYNYS
LNENGQLVRY GSSPEDYLTD VIARKAVQAI TNATREGKPF FLYLAPFTPH APANFAPRHA
SLFKDAELPR PPNFDEADVS DKPPLLRRLP RLSERELARM RELYVKRLRS LQAIDDLVES
IVQALRQNGQ LANTYIVYTS DNGFHMGNHR MPQGKNMPYE EDIRVPLVVR GPGVPAGKTV
NELALNIDLA PTFAKIAGLE VPPSCDGRSL LPLLRGQIPT VWRQSFMVQR GEGAEAQSED
GDGRDRAGAF SALRTAAYTF VQWGSGDREL YDLKADPYQL QNLASKADPV LIQRLSTRLS
ELSKCRGDEC RRVEELPIGT P