Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_3320 |
Symbol | |
ID | 8359486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 4083793 |
End bp | 4085355 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644965493 |
Product | sulfatase |
Protein accession | YP_003122988 |
Protein GI | 256422335 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0000925232 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAGTATCT TAGCTTCTAT GTTGAAAATT ACGAATATCA GATCTGTAGT TGTCGCACTT ATTGTACTCC TTGCACCAAT TCTGTTGAGT GCACAGTCGA AACGACCAAA TATCATCTTC ATTCTTTCAG ATGATCATAC CTACCAGGCC ATCAGTGCTT ATGGTAACAG GTATGTGCAA ACGCCAAATA TCGACCGGAT AGCACGCGAA GGGGTCTTAT TCCATCATGC CATGGTCACC AACTCCATAT GTGGACCAAG CAGAGCTACG TTATTGACAG GAAAATACAG CCATAAAAAT GGGTATCCAC TGAATGAAAA ACGATTCGAC AATACCCAGC AGACTTTCCC CGGTATTTTG CAGCAAAATG GTTATCAGAC TGCATGGATA GGTAAAATGC ACCTTGGTAC ATTGCCAGAG GGATTCAATT ACTTCAGCAT TTTGCCTGGT CAGGGTAAAT ACTACAATCC GGACTTCATT TCCACACCGA ATGATACGGT GAGGATGGGC GGTTATGTCA CTAATATTAT TACGGACTTG TCTGTATCCT GGCTTAATGG CCGGGATACA AGCCGTCCTT TTATGCTCGT TGTAGGAGAA AAAGCAACAC ACCGCGAATG GCTGCCTGAC CTGCCTGATC TGGGCGCGTA CGACAGTATT AATTTCCCGC TGCCTGCTAG CTTCTCCGAC GACTATAAAG GCCGTCGTGC AGCCCTCGAC CAGGATATGA CCATCTCCAA CACGATGCGA CTGAAAGAAG ACCTGAAAGT GCATGTAAAC TATCAGCAAA ATAAAAACAG TGAATACGGC AGGTTCACAC CAGAGCAGCT GGCGCCGTTT GCTCAGTATT ATGAGCAGAA AATAAGCCAC GAATTCGATT CCCTGCACCT GAGTGGCCAA GCACTTACAG CGTGGAAATA TCAGCGTTAT ATGCGCGATT ATCTCGCTAC TGCCCGTTCC CTGGACAGGA ATATCGGACG CATCCTCCAA TACCTCGATA GCACCGGACT GGCCGACAAT ACAGTGGTGA TCTATTGTTC CGATCAGGGC TTCTACCTGG GAGAACATGG CTGGTTTGAC AAACGTTTTA TTTACGAACA GTCTTTACAT ACGCCTTTTG TAATGCGTTA TCCTGGTGTG ATCAAACCCG GTACAAACAG TAATAGCTTT ATGCTGAATA TTGACTGGGC GCCTACCTTA CTGGATATCG CACACGTAAA AGCACCTGCA GATATGCAGG GTACTTCGTT TATGCCTTTG GTGAGCGGTA AAGGGGATAC TACCCGCTGG AGGAAGGATA TGTATTATCA TTATTACGAG TTCCCGGAGC CGCATCATGT ATCGCCTCAC TTTGGCATAC GTACGGAGAG ATATAAACTG GTACGTTTTT ATGGACCTGC TGATTTCTGG GAATTGTATG ATCTGAAAAA CGATCCGCAG GAACTGCATA ATATTTACAA TGATCCGGCA CAGGCGAAGA ATATAGTAAA CCTGAAGAAA CGTTTGAAGG TACTGATAAC GCAGTATGAT GATAAAGATG CGGAGAAACT ACTGAAGAAT TAA
|
Protein sequence | MSILASMLKI TNIRSVVVAL IVLLAPILLS AQSKRPNIIF ILSDDHTYQA ISAYGNRYVQ TPNIDRIARE GVLFHHAMVT NSICGPSRAT LLTGKYSHKN GYPLNEKRFD NTQQTFPGIL QQNGYQTAWI GKMHLGTLPE GFNYFSILPG QGKYYNPDFI STPNDTVRMG GYVTNIITDL SVSWLNGRDT SRPFMLVVGE KATHREWLPD LPDLGAYDSI NFPLPASFSD DYKGRRAALD QDMTISNTMR LKEDLKVHVN YQQNKNSEYG RFTPEQLAPF AQYYEQKISH EFDSLHLSGQ ALTAWKYQRY MRDYLATARS LDRNIGRILQ YLDSTGLADN TVVIYCSDQG FYLGEHGWFD KRFIYEQSLH TPFVMRYPGV IKPGTNSNSF MLNIDWAPTL LDIAHVKAPA DMQGTSFMPL VSGKGDTTRW RKDMYYHYYE FPEPHHVSPH FGIRTERYKL VRFYGPADFW ELYDLKNDPQ ELHNIYNDPA QAKNIVNLKK RLKVLITQYD DKDAEKLLKN
|
| |