Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_2850 |
Symbol | |
ID | 8359011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 3516028 |
End bp | 3517719 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644965030 |
Product | sulfatase |
Protein accession | YP_003122530 |
Protein GI | 256421877 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00796001 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCGC GGAATGTATT ACTATCATCA TTGATCATTC TATCTTCCTG TACAGGGAGT CGTAAGAACA ACAAGACAAC AGTAAGTAAG GATGAAAGAC CCAATATCGT GCTGATACTG GCGGATGATC TTGGCTACTC AGATCTTGGT TGTTACGGCG GGGAGATACA AACACCCAAT CTCGATTATC TGGCAGCAAA TGGTCTGCGC TTCAGGCATT TCTATAATAC TTCGCGTTGT TGCCCTTCCA GGGCTTCCCT GCTGACCGGT TTATATAATC AACAGGCAGG TATCGGCGAG ATGACGACCG CGAGAGCAGA GGCCGGATAT AGGGGTTATA TCACCGAGAA TACGGTTACA CTGGCAGAGG TATTGAAAGA TGCCGGTTAC CATACGGCTA TGTCAGGCAA GTGGCATGTA TCCAATACAG TGGAACAGTC CACACCGGCA GCACAACTGA AATGGCTGAA CCACCAGGCA TCACATCCTT ACTTCTCACC AGTGGAACAA TATCCTGTCA ACAGAGGATT TGAAAAGTAT TACGGTAATA TCTTCGGTGT GGTCGATTAT TTTGATCCTT TCAGTCTGGT GAATGGTACG ACGCCGGTGG AAAGTGTTCC GAAAGACTAT TATCATACAG ACGCGATCAA CGATACAGCT GTCAGCTATG TGAGAGCGCT CAGTAAAGAA GATAAACCCT TCTTCCTGTA TGTTGCACAT ACTGCCCCGC ACTGGCCTTT ACAGGCATTA CCGGAAGACA TAAAGAAATA CGAACAGACT TATAAGGGTG GCTGGGATGT TATCCGGGAA GCTCGTTATA AAAGGATGGT AGCACAGGGG CTGATCGATC CCAAAACCAC ACCATTATCG CCCCGTATCA ATAACCAGCT GAGCTGGGAT AAAAATCCGG ATAAGGACTG GGATGCACGG GCCATGGCAG TACATGCTGC GATGGTTGAT CGTATGGACC AGGGCATTGG TCGCCTGATT CAGACATTAC GGGAAACGGG TAAACTGGAC AATACGATTA TCATATTCTT AAGCGATAAC GGCGCCAGTC CGGAGAACTG TATGCGATAT GGTCCGGGCT TCGATCGTCC GGGACAAACC AGGGATGGAA AAGAGATCAG TTACCCGGTA AAGAAAGATG TATTGCCTGG TCCTCAGACC ACCTTTGCTT CCATTGGAGA GCGTTGGGCG AATGTAGTGA ATACGCCTTA TCAATACGCG AAAGCACAGT CCTATGAAGG CGGTGTGCGT ACGCCGATGA TTGCCTACTG GCCGAAGGGT ATTAAAGCGA AAGGGGCGTA TGCAGACCAA CTCGCACACG TCATGGATTT CATGCCTACC TTCCTGAATG TGGCAAAAGC CAGTTATCCG CAGACTTATA AGGGGCATAG TATTACTCCT TCCACTGGGG TCAGTCTGCT ACCAGCTTTC GAAGGTAAAC AGGAGCCAGG ACATGATGTG TTATATAACG AGCATTTCAA CGCCCGTTAT GTACGAGCAG GCGACTGGAA ACTGGTCTCC TTGTCCGGCG ATTCCACCTG GCATCTCTAC AAGATCAACC AGGATGAAAC GGAATTAAAC GATCTGGCGG CACAGCATCC CGATGTAGTC GCCCGTATGA CGGCACAATG GAGACAATGG GCAAATACCC ACAATGTGTT TCCTAAACCC GGTAAAAAAT AG
|
Protein sequence | MKARNVLLSS LIILSSCTGS RKNNKTTVSK DERPNIVLIL ADDLGYSDLG CYGGEIQTPN LDYLAANGLR FRHFYNTSRC CPSRASLLTG LYNQQAGIGE MTTARAEAGY RGYITENTVT LAEVLKDAGY HTAMSGKWHV SNTVEQSTPA AQLKWLNHQA SHPYFSPVEQ YPVNRGFEKY YGNIFGVVDY FDPFSLVNGT TPVESVPKDY YHTDAINDTA VSYVRALSKE DKPFFLYVAH TAPHWPLQAL PEDIKKYEQT YKGGWDVIRE ARYKRMVAQG LIDPKTTPLS PRINNQLSWD KNPDKDWDAR AMAVHAAMVD RMDQGIGRLI QTLRETGKLD NTIIIFLSDN GASPENCMRY GPGFDRPGQT RDGKEISYPV KKDVLPGPQT TFASIGERWA NVVNTPYQYA KAQSYEGGVR TPMIAYWPKG IKAKGAYADQ LAHVMDFMPT FLNVAKASYP QTYKGHSITP STGVSLLPAF EGKQEPGHDV LYNEHFNARY VRAGDWKLVS LSGDSTWHLY KINQDETELN DLAAQHPDVV ARMTAQWRQW ANTHNVFPKP GKK
|
| |