Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_1568 |
Symbol | |
ID | 3934016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 1544858 |
End bp | 1546507 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637903919 |
Product | sulfatase |
Protein accession | YP_509510 |
Protein GI | 89054059 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.423465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACGA CCGCGCGCCC GAATGTTCTG TGGATCATGG CTGACCAGCT GCGGTTTGAT TACCTCAGCT GTTACGGCCA CCCGCATCTG CACACCCCCC ATATTGACGC GCTGGCCGCG CGCGGCGTGC GCTTCACCAA TGCCTATGTG CAATCGCCCG TCTGCGGCCC GTCGCGGATG TCGGCCTATA CCGGGCGCTA TGTGCGCAGC CATGGCTCCA CCTGGAACGG CATGCCTTTG CGTGTGGGGG AGCCGACCCT GGGCGATCAC CTGCGAGAGG CGGGCGCGCG GGCGGTTCTG GTGGGCAAGA CCCATATGGT TGCCGATGCG GAAGGGATGG CGTGGCTTGG CATCGACCCC AAAAGCGAGA TCGGCGTCAG CCGGTCCGAA TGTGGGTTTG AACCGTTTGA GCGTGACGAC GGATTGCACC CCGACAGCCC ACGCCAACGC TGGTCCGCCT ATGACGATTA CCTCGTGTCC CACGGCTACG ACAGCCAGAA CCCCTGGGAA GATTTCGCCA ATTCCGGCGT CGATGCGGAT GGGGAACTGC TGTCTGCCTG GCTTCTGAAA AACTCTCGCC TCGCCGCGAA CGTGCCGGAA GAGCATTCCG AGACCGCCTA TATGACCGAC CGCGCAATGG CGTTCATGGA GGAGGCGGAG GCCGATGGCC GCCCGTGGAT GTGCCACCTC AGCTACATCA AGCCGCACTG GCCCTACATC GTGCCCGCGC CCTACCACGA CATGTATGAC GAGAGCCACA TTATCGACCC GGTCCGATCA GAGGCCGAGC GTGACAATGC CCACCCGCTG GTCGGCGCCT ACCAGAACTC CCGCGTGTGC CGGACGTTTT CGCGCGATCA GGTCCGGGAC CATGTGATCC CCGCCTATAT GGGCCTGATC AAGCAGCTCG ACGACAATCT GGGGCGGTTG TTTGCGTGGA TGGACGCGCG CGGACTGAGC GAGAACACCA TCATCGCTTT CACGTCAGAT CACGGCGATT ACATGGGCGA TCACTGGATG GGAGAGAAGG ATTTCTACCA TGAGATGGCG GTGAAAGTGC CCATGATCAT TGCCGATCCG CGCCCACAGG CCGACGGGAC GCGGGGCCAT GTGGCAGATG ATCTGGTGGA GATGATCGAC CTGGCCCCCA CCTTCATGAC AGCGCTGGGG GCCGCGCCCA AACCCCACAT CATCGAAGGC CGCGACCTGA CGCCCCTTCT GCACGGCACG GACGGGTTCG CGCGGCAATA TGTCATTAGC GAATACGACT ACCATTGGTC CGAGATGGCC GCCGCATTGC AGGTCCCGCA GGAGGACGCC CACACCACGA TGATCTTCGA CGGGCGTTGG AAATACATCC GCTGTGAGCG TTTCGATCCG GTGCTGTTTG ATCTGGAAAC GGACCCGCAG GAGTTGGTAG ACCTTGGCAC CTCCCCCGCC CATGCTGAGA TCCGCGCGCT CATGGATGCG GCCCTGCTGA AATGGGCCAC GCAGCACCAC ACCCGCATCA CTGCCACCGC CGCCGTTCTT GCCGGACAGA AGATCGCGGC CGAGACCGGC ATCCTGATCG GGTTCTGGGA TGAGGCGGAG TTTGAAGCCG CCACCGGCTT TCCGTTTACC GACCTGACAC CGCAGGGGCC GCCAAAATAG
|
Protein sequence | MITTARPNVL WIMADQLRFD YLSCYGHPHL HTPHIDALAA RGVRFTNAYV QSPVCGPSRM SAYTGRYVRS HGSTWNGMPL RVGEPTLGDH LREAGARAVL VGKTHMVADA EGMAWLGIDP KSEIGVSRSE CGFEPFERDD GLHPDSPRQR WSAYDDYLVS HGYDSQNPWE DFANSGVDAD GELLSAWLLK NSRLAANVPE EHSETAYMTD RAMAFMEEAE ADGRPWMCHL SYIKPHWPYI VPAPYHDMYD ESHIIDPVRS EAERDNAHPL VGAYQNSRVC RTFSRDQVRD HVIPAYMGLI KQLDDNLGRL FAWMDARGLS ENTIIAFTSD HGDYMGDHWM GEKDFYHEMA VKVPMIIADP RPQADGTRGH VADDLVEMID LAPTFMTALG AAPKPHIIEG RDLTPLLHGT DGFARQYVIS EYDYHWSEMA AALQVPQEDA HTTMIFDGRW KYIRCERFDP VLFDLETDPQ ELVDLGTSPA HAEIRALMDA ALLKWATQHH TRITATAAVL AGQKIAAETG ILIGFWDEAE FEAATGFPFT DLTPQGPPK
|
| |