Gene Cmaq_0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0237 
Symbol 
ID5709156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp273001 
End bp274344 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content40% 
IMG OID641274739 
Productsulfatase 
Protein accessionYP_001540075 
Protein GI159040823 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.813396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAATG ATCATTTCAA CGTTATCCTA ATAGCTATAG ATACTTTGAG GGCTGATAGG 
GTTGGTTGCC TCGGCTCAGG TTACCCAACT ATGCCTAATG TTGACTCCCT CTGCCGTGAC
TCCGTGGCGT TTACGAATCA TTACGCTGAA GCCATACCCA CTCACCCATC CTTCACAACA
ATATTTACTG GAACAACACC ATTAATACAC AGTATAGTTA GTCATGGTGG TAAGGTTCAA
TTAAGTGGAA GCATATTAAC TCTACCACAG ATACTTAATG AAGCCGGCTA CTTAACTATA
GCTGTTGATA ACTTAGCCAC ACACATGTAT GCTGGATGGT TCGCTAGGGG CTACAGGTAC
TACATTGATA TTGGTGGTTT AGTGGTGATT TCTAACGGTA TTAAGGTTAA TGGTGAGTAT
GTTAATGCTA AGGTTAAGGA TGCCTTCGAA TTAATTAATA GGTATAAGAA TGAACCCTTC
TTACTCTTTA TACATTACTG GGATCCCCAC GCCCCATATA TACCACCTAA GCCTTATGCT
GAGAAGTTCT ATCATGGGGA TTACAGTAAG GGTGATTTGG TTAGTAGGCT TAACTCAACA
GCTTGGGGTA GATTACTACT TAAGGATAGT TGGATAAGGG ATTTAATTAA TTCCGGCGTT
AATGACCCTG ATTACATTAG GGCTCTTTAC GATAGTGAGG CTGCTTACGT TGATGAGAGG
ATTGGTGAAT TAATGAGCAT TATTAATAAT ACTGGCTTAC TTGAGGATAC ATTAATAGTA
TTAACATCAG ATCATGGTGA GGGTCTTGGT GAACACAACG TATACTACGA GCACCACGGC
TTATATGAGT GGGATGTTAA GACACCATTA ATCATTAGGC TACCTGATAA GTTAATTGAT
GAGGTTGGGC GTGGTAAAGC CAAGGGTGTT AAGTATGATG CATTTGTTCA AAACACCGAC
ATAACACCAA CAATATTAGA TTCACTAGGT TTAAAGATTC CCGAATACAT GACTGGCTTA
AGCCTACTTA AGGTTATTAG GGGTGAGTCT AAGGGTCATG ATGCAGTGTT CAGTCTTGAG
AATACTAGGC AGACTGCTAG GATGATTAGG GTTGGTGAAT GGAAGCTAAT ACAGTGGATT
AGGAATGATA CATACGGTAG GAGGAGTGGC CACGTTGAAT TATATAATTT AGCTAAAGAC
CCAACTGAAT CAAGGAACAT GGCAACCGAG GAAGGTGAAA TTACATTAAG GCTACTTGGA
TTAATTGAGA GGAGGTATAG GGAGGTGGCT GGAGCCAATG ACCCATTAAT ACTGCAGGAA
ATAAGCGTAC CAATAAAACC ATGA
 
Protein sequence
MVNDHFNVIL IAIDTLRADR VGCLGSGYPT MPNVDSLCRD SVAFTNHYAE AIPTHPSFTT 
IFTGTTPLIH SIVSHGGKVQ LSGSILTLPQ ILNEAGYLTI AVDNLATHMY AGWFARGYRY
YIDIGGLVVI SNGIKVNGEY VNAKVKDAFE LINRYKNEPF LLFIHYWDPH APYIPPKPYA
EKFYHGDYSK GDLVSRLNST AWGRLLLKDS WIRDLINSGV NDPDYIRALY DSEAAYVDER
IGELMSIINN TGLLEDTLIV LTSDHGEGLG EHNVYYEHHG LYEWDVKTPL IIRLPDKLID
EVGRGKAKGV KYDAFVQNTD ITPTILDSLG LKIPEYMTGL SLLKVIRGES KGHDAVFSLE
NTRQTARMIR VGEWKLIQWI RNDTYGRRSG HVELYNLAKD PTESRNMATE EGEITLRLLG
LIERRYREVA GANDPLILQE ISVPIKP