Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_3801 |
Symbol | |
ID | 8359972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | - |
Start bp | 4778414 |
End bp | 4779775 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644965973 |
Product | sulfatase |
Protein accession | YP_003123464 |
Protein GI | 256422811 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00499822 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.102523 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCAA GGTTTTTCTA TGTAATGACC CTATTGTTGG CCACTGTACA AATGACTTAT TGTCAAAAGC CCGTTCAGCC CAACGTGGTC ATCATACTGA CCGATGACAT GGGATATGGC GATATTAGCT GCTACGGCGG CAACGTAATG CCTACTCCCC ATGTCGACAA TATGGCGAAA AACGGTATGC GCTGTACCCA ATATTACAGT GCAGCCCCCA TCTGCTCTCC TTCCCGCGCA GGCATCCTCA CAGGGATGTA TCCTGCCCGC TGGAACTTCA GCACCTACCT GGACAACAAA AAACATAATA AAGCCGCGCA ACAAACCGAT TACCTGGATC CGAAAGCCCC CTCTATCGCA TGCATCTTTA AAAATGCAGG ATATGCCACA GGCCATTTCG GTAAATGGCA CCTCGGCGGA GGCCGTGACG TAACAGACGC TCCAGGCTTT GAACAATACG GTTTTGATGA ACATGCCAGT ACCTATGAGA GTCCGGACCC GGACCCGCTA CTTACAGCAA CCAACTGGAT CTGGTCTGAT AAAGACAGTA TCAAACGCTG GGACCGCACG GCCTATTTTG TAGATAAAGC ACTGAGCTTC CTCCGTAGTC ATACCGGCCA ACCCTGCTTT ATCAACCTCT GGCCCGACGA TGTACATACC CCCTGGGTTC CCCGCAGGTC AGATGGTGAT ACCGCCCGCC TGAAACCAGA AGAAGAAGCC GCACTGAGAA AAGTTTTAAA AGAATACGAT ATACAGATCG GGCGCTTCCT CGCCGAACTG AAAAGATCCG GTCTAGATAA AAATACCATC GTCATTTTTA CCAGTGATAA TGGTCCGCTA CCCACCTTCC GGAACAGCCG TACATTGGGT TTACGTGGTT CCAAATTATC ACTGTACGAC GGCGGTACAC GGATGCCATT TGTCATTAAC TGGACAGGAC ATATCAAACC GGGTAGTATC GACAGCACCT CCATGATCAC CGGACTAGAC CTGTTGCCGA CTCTGGCAGG TATGGCAGGT ATTAAACTAC CAAAAGACTA TCACGGCGAT GGTGTTGATC GCTCTGCCGT CTTTACCGGA CGTCCCTCTG CCCGTAACAA AGACATGTTC TGGGAATACG GTCGTAATAA TATCGCCTAC GCCTATCCCA AAGAATTGGC TTGGAATAGA AGTCCGCAGC TGGCTGTTCG CTCCGGAGAA TGGAAATTCC TGATGAACGC AGACCGTAGC GAACCAGCAC TATACAACGT AAAACTGGAT CCGGGAGAAT CACTCGACAT GAGCGGCATC CGTCCTGATC TGGTAAAACG GCTTTCCACT GATCTCATGA ACTGGTGGAC CGCCATGCCA AAGCTGCAAT AA
|
Protein sequence | MKARFFYVMT LLLATVQMTY CQKPVQPNVV IILTDDMGYG DISCYGGNVM PTPHVDNMAK NGMRCTQYYS AAPICSPSRA GILTGMYPAR WNFSTYLDNK KHNKAAQQTD YLDPKAPSIA CIFKNAGYAT GHFGKWHLGG GRDVTDAPGF EQYGFDEHAS TYESPDPDPL LTATNWIWSD KDSIKRWDRT AYFVDKALSF LRSHTGQPCF INLWPDDVHT PWVPRRSDGD TARLKPEEEA ALRKVLKEYD IQIGRFLAEL KRSGLDKNTI VIFTSDNGPL PTFRNSRTLG LRGSKLSLYD GGTRMPFVIN WTGHIKPGSI DSTSMITGLD LLPTLAGMAG IKLPKDYHGD GVDRSAVFTG RPSARNKDMF WEYGRNNIAY AYPKELAWNR SPQLAVRSGE WKFLMNADRS EPALYNVKLD PGESLDMSGI RPDLVKRLST DLMNWWTAMP KLQ
|
| |