Gene Jann_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1494 
Symbol 
ID3933941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1465402 
End bp1466910 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content61% 
IMG OID637903844 
Productsulfatase 
Protein accessionYP_509436 
Protein GI89053985 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000530638 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAC CTAACATCCT GATAGTAATG GTGGATCAGT TGAACGGCAC GTTGTTCCCG 
GACGGCCCGG CGGACTGGCT GCACACGCCC AATCTGGACC GCTTGGCAGC CCGCTCCACA
CGATTTGCCA ACACGTATAC CGCCTCTCCA CTCTGCGCGC CGGCCCGCGC GTCGTTCATG
TCGGGCCAAT TACCGTCGCG CACCGGCGTC TATGACAACG CCGCGGAATT CACGTCGTCA
ATCCCGACCT ACGCCCATCA CCTGCGCCGT GCGGGCTACT ATACTGCGTT GTCGGGCAAG
ATGCATTTTG TGGGGCCGGA CCAGCTGCAC GGGTTTGAAC AGCGGCTGAC CACGGATATC
TACCCGGCTG ATTTCGGTTG GACGCCGGAC TACCGCAAGC CCGGTGAGCG GATCGATTGG
TGGTATCACA ACATGGGCTC CGTCACCGGA TCCGGCGTTG CTGAGATCAC CAACCAGATG
GAATATGACG ACGCCGTGGC GTTTGAGGCG GAGCAAAAAC TCTACGACCT GTCGCGTGGG
GCCGATCGGC GCCCGTGGTG TCTGACGGTC AGTTTCACCC ACCCCCATGA CCCCTATGTC
GCGCGCCGCG AGTTCTGGGA TTTGTATAAC GATTGCAATC GCTTGATACC TGAGATCGGG
GCGATCCCCT ATGCAGAGCA GGACAACCAT TCCAAGCGCA TTCTGGACGC GAATGATCTG
GGTAATTTCG ATATTACGGA TCAGGACATC GCCCGGTCGC GCCAGGCCTA TTTCGCCAAC
ATCTCCTACC TCGATCAGAA GATCGGCGGC GTGCTGGACG TGCTGGAGCG CACGCGGCAG
GAGGCGATCG TGATCTTCAT CTCGGACCAC GGTGACATGT TAGGCGAACG TGGCTTGTGG
TTCAAAATGT CCTTCTATGA GGGCTCGGCT CGCGTGCCGC TGATGATCTC CGCGCCGGGG
ATGGAGCCTG GGATGGTCTC GGACCCGGTC TCCACCATCG ACCTCTGCCC CACCCTTTGC
GACCTCGCGG GCGTGTCGAT GGACGAGGTG ATGCCGTGGA CCGATGGCGA AACGTTAACG
CCGTTAGGAA AGGGCGCCAA ACGGGTCAAT CCGGTGGCGA TGGAATACGC GGCGGAAGGG
TCGTACTCTC CCGTCGTGGG CCTGCGCCAA GGCCAATGGA AATACACCAA TTGCGCGATT
GATCCGGAGC AATTGTTCGA TCTGGCCGCC GATCCCCATG AGCTGAACGA CCTGTCCGAA
AACCCCGCCC ATGCGGCGAC CTTGGAGGCG TTTCGGGTCA AGGCAGCCGC CCGCTGGGAT
CTGGAGGCGT TCGACGCAAA CGTTCGGCAA TCCCAGGCCC GCCGCTGGGT CGTCTACGAG
GCGCTGCGCA ACGGCGCCTA CTACCCGTGG GATTTTCAAC CCCTGCGCGA CGCGTCCGAG
CGCTACATGC GCAACCACAT GGACCTCAAC GTTCTTGAGG AAAATCAACG CTTTCCCCGG
GGCGAATGA
 
Protein sequence
MTQPNILIVM VDQLNGTLFP DGPADWLHTP NLDRLAARST RFANTYTASP LCAPARASFM 
SGQLPSRTGV YDNAAEFTSS IPTYAHHLRR AGYYTALSGK MHFVGPDQLH GFEQRLTTDI
YPADFGWTPD YRKPGERIDW WYHNMGSVTG SGVAEITNQM EYDDAVAFEA EQKLYDLSRG
ADRRPWCLTV SFTHPHDPYV ARREFWDLYN DCNRLIPEIG AIPYAEQDNH SKRILDANDL
GNFDITDQDI ARSRQAYFAN ISYLDQKIGG VLDVLERTRQ EAIVIFISDH GDMLGERGLW
FKMSFYEGSA RVPLMISAPG MEPGMVSDPV STIDLCPTLC DLAGVSMDEV MPWTDGETLT
PLGKGAKRVN PVAMEYAAEG SYSPVVGLRQ GQWKYTNCAI DPEQLFDLAA DPHELNDLSE
NPAHAATLEA FRVKAAARWD LEAFDANVRQ SQARRWVVYE ALRNGAYYPW DFQPLRDASE
RYMRNHMDLN VLEENQRFPR GE