Gene Jann_1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1568 
Symbol 
ID3934016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1544858 
End bp1546507 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content63% 
IMG OID637903919 
Productsulfatase 
Protein accessionYP_509510 
Protein GI89054059 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.423465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGA CCGCGCGCCC GAATGTTCTG TGGATCATGG CTGACCAGCT GCGGTTTGAT 
TACCTCAGCT GTTACGGCCA CCCGCATCTG CACACCCCCC ATATTGACGC GCTGGCCGCG
CGCGGCGTGC GCTTCACCAA TGCCTATGTG CAATCGCCCG TCTGCGGCCC GTCGCGGATG
TCGGCCTATA CCGGGCGCTA TGTGCGCAGC CATGGCTCCA CCTGGAACGG CATGCCTTTG
CGTGTGGGGG AGCCGACCCT GGGCGATCAC CTGCGAGAGG CGGGCGCGCG GGCGGTTCTG
GTGGGCAAGA CCCATATGGT TGCCGATGCG GAAGGGATGG CGTGGCTTGG CATCGACCCC
AAAAGCGAGA TCGGCGTCAG CCGGTCCGAA TGTGGGTTTG AACCGTTTGA GCGTGACGAC
GGATTGCACC CCGACAGCCC ACGCCAACGC TGGTCCGCCT ATGACGATTA CCTCGTGTCC
CACGGCTACG ACAGCCAGAA CCCCTGGGAA GATTTCGCCA ATTCCGGCGT CGATGCGGAT
GGGGAACTGC TGTCTGCCTG GCTTCTGAAA AACTCTCGCC TCGCCGCGAA CGTGCCGGAA
GAGCATTCCG AGACCGCCTA TATGACCGAC CGCGCAATGG CGTTCATGGA GGAGGCGGAG
GCCGATGGCC GCCCGTGGAT GTGCCACCTC AGCTACATCA AGCCGCACTG GCCCTACATC
GTGCCCGCGC CCTACCACGA CATGTATGAC GAGAGCCACA TTATCGACCC GGTCCGATCA
GAGGCCGAGC GTGACAATGC CCACCCGCTG GTCGGCGCCT ACCAGAACTC CCGCGTGTGC
CGGACGTTTT CGCGCGATCA GGTCCGGGAC CATGTGATCC CCGCCTATAT GGGCCTGATC
AAGCAGCTCG ACGACAATCT GGGGCGGTTG TTTGCGTGGA TGGACGCGCG CGGACTGAGC
GAGAACACCA TCATCGCTTT CACGTCAGAT CACGGCGATT ACATGGGCGA TCACTGGATG
GGAGAGAAGG ATTTCTACCA TGAGATGGCG GTGAAAGTGC CCATGATCAT TGCCGATCCG
CGCCCACAGG CCGACGGGAC GCGGGGCCAT GTGGCAGATG ATCTGGTGGA GATGATCGAC
CTGGCCCCCA CCTTCATGAC AGCGCTGGGG GCCGCGCCCA AACCCCACAT CATCGAAGGC
CGCGACCTGA CGCCCCTTCT GCACGGCACG GACGGGTTCG CGCGGCAATA TGTCATTAGC
GAATACGACT ACCATTGGTC CGAGATGGCC GCCGCATTGC AGGTCCCGCA GGAGGACGCC
CACACCACGA TGATCTTCGA CGGGCGTTGG AAATACATCC GCTGTGAGCG TTTCGATCCG
GTGCTGTTTG ATCTGGAAAC GGACCCGCAG GAGTTGGTAG ACCTTGGCAC CTCCCCCGCC
CATGCTGAGA TCCGCGCGCT CATGGATGCG GCCCTGCTGA AATGGGCCAC GCAGCACCAC
ACCCGCATCA CTGCCACCGC CGCCGTTCTT GCCGGACAGA AGATCGCGGC CGAGACCGGC
ATCCTGATCG GGTTCTGGGA TGAGGCGGAG TTTGAAGCCG CCACCGGCTT TCCGTTTACC
GACCTGACAC CGCAGGGGCC GCCAAAATAG
 
Protein sequence
MITTARPNVL WIMADQLRFD YLSCYGHPHL HTPHIDALAA RGVRFTNAYV QSPVCGPSRM 
SAYTGRYVRS HGSTWNGMPL RVGEPTLGDH LREAGARAVL VGKTHMVADA EGMAWLGIDP
KSEIGVSRSE CGFEPFERDD GLHPDSPRQR WSAYDDYLVS HGYDSQNPWE DFANSGVDAD
GELLSAWLLK NSRLAANVPE EHSETAYMTD RAMAFMEEAE ADGRPWMCHL SYIKPHWPYI
VPAPYHDMYD ESHIIDPVRS EAERDNAHPL VGAYQNSRVC RTFSRDQVRD HVIPAYMGLI
KQLDDNLGRL FAWMDARGLS ENTIIAFTSD HGDYMGDHWM GEKDFYHEMA VKVPMIIADP
RPQADGTRGH VADDLVEMID LAPTFMTALG AAPKPHIIEG RDLTPLLHGT DGFARQYVIS
EYDYHWSEMA AALQVPQEDA HTTMIFDGRW KYIRCERFDP VLFDLETDPQ ELVDLGTSPA
HAEIRALMDA ALLKWATQHH TRITATAAVL AGQKIAAETG ILIGFWDEAE FEAATGFPFT
DLTPQGPPK