Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_3297 |
Symbol | |
ID | 8359463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 4061241 |
End bp | 4064066 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644965470 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003122965 |
Protein GI | 256422312 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.000222307 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACTG TAATCAGAAG ATCCTGTAAG CTACTGCCAG CTATCCTGTC AATGCTGCTT TTACAGCCGG CTGTCGCCCA ACAACCTGCC ATACCGCATA CTGACACCCT GCGCCCTGAC GTATGGCCTG CACCGGGTAA CAGGCTGCAT CAGCCCTGGA CCCGCTGGTG GTGGGTAGGA AGTGCGGTAG ATGAGAAAAG TCTGGATGCG TCGTTGAAGA CATTGCATGA TGCGGGATTT GGCGGTGTTG AAATAGCCCC GATCTATGGG GCGAAAGGAT ATGAATCCCG ATACGTTAGT TTTCTCTCGC CACAGTGGAT CGACCTGCTG CGCTATACCG TACAAACAGC TGCCGGATAT AATATGGGCG TAGATCTGAC CACCGGTACG GGATGGCCGT TTGGCGGTCC GCAGCTCACA AAAGCACAAG CCGCTTCCCG TCTCATTATA CAAACCTATC CTGTAAAAGG CGGTACTCCC TTTACAGCAC AGATACGGAT CAACGATAAT AAACAACAAG GGGCTGCTTT ACAGGCCCTG ACCGCTTACA ATGGCAAACA ACGTATCTCC CTGATCAGTA AAGTAGATAA GGAGGGTAAA CTCCAATGGA TACCGCCTGC GGGTGACTGG ACCCTGTATG CTTTGTTTAG CGGCTGGACG CTGCAGCAGG TAAAAAGGGC TGCTCCGGGT GGAGAAGGAT ATACCCTGGA TCATCTTTCT GCCACTGCCT TACCGGTATA CCTGCAACGT TTTGATACTG CCTTTAAAGG CAAGTCGCCG GGTGTACGCT CCTTCTATAA CGATAGCTAT GAAGTATATA ACGCAGACTG GACGCCTGCG TTTTTCAGCG CCTTTAAAAC ACACAGGGGC TATGACCTCC GGGAATACCT GCCGGAACTC ATCAGCGAAG AGAAGAGCGA AACGGCAGGC AGGGTAAAGT CTGACTACCG CGAAACCATG TCCGAATTAC TCCTTGACAA TTTTACTCGT CCCTGGACCC AATGGGCACA TCGCTACAAT AGTGTGTCAA AGAACCAGGC GCATGGCTCT CCCGGTAATC TCCTTGATCT GTACGCAGCA GTCGATATCC CTGAATGCGA AACCTTCGGC TCTACTTATT TCCCGATCCC CGGACTCAGA AGAGACAGTG CCGATATCCG GATTGTAGCT CCTGATCCGG TTATGTTGAA ATTCGCCTCC TCCGCTGCGC ATGTAACAGG ACATCCCCTG GTGTCCTGTG AGACATTCAC CTGGCTGACG GAGCATTTCA AGACCTCCCT TTCCCAGTGT AAACCTGAAG TGGAACAGGC ATTTCTGGCC GGTGTCAATC ATGTATTTTA TCATGGGAAT ACCTTTTCTC CGCCGGATGT ACCCTGGCCA GGCTGGTTGT TCTATGCCTC GGTCAATTTC GTACCTACTA ATAGTTGCTG GCCGCACCTG ACAGGACTGA ACAACTACAT TTCCCGGGTA CAATCCGTAC TACAGACCGG CATCCCTGAC AATGAACTGC TGATCTACTG GCCGGTATAT GATTGCTGGG AAAATCCCAA AGGGAAAGAT TTGCCATTGA AAGTCCATGA TATCGATGAA TGGCTGCACC CTACACCTTT CTATAAACAG GTAAAAAGTT TGCAACAGGC AGGGTATTCA CTGGATTTTA TTTCAGATAA ACAACTGGGC AAGACGCTGG TGCAGGGAAG AAAACTCATT ACGAGTACCA ATGCAGCCCC CTATAAGGTG CTGATCATTC CTGCAATGAA ACTTATGCCC CTGCAGACAT TTCAGCAGAT CATCAAACTT GTAGAAGCGG GAGCTACTGT CATCTTTGAA TCGATGCCGG AAGATGTTCC TGGTCTGCAT GATCTTGATA AACGCAGACA GCAGCTGAAA GCACTGCAGC AAAAAGTGAA AGGGGCTTTA TCAGGCAAAA TAGCCGGCAC CAGGCTGGGA AAAGGCACTA TATACAACTA TGACGGGATA CAGGAAGCCT TACACATACA GCATATCGAT GGAGAAAAAC TAACAGCGAA AGGCTTACAG TTTATCCGCA GAAAAGACAA AGACAGAACC TGGTATTATA TCGTCAATCA CACGGCGGAT GCTGTAGATG ATGCGGTCCC TTTTAACCAG ATGGGTGCTG AAGATACCGT CTTACTCCTC GATCCGATGA CAGGAGTATA TGGTCCCGCA ACAGTGGGCC ATCACCATGT GAATGATGTG GTCAGGATCC AGTTGCAACC CGGCCAGTCG CTGATAGTAA GAACCGGTCC GGTCACTGCC GCAGAGCAGG CGGTTGGCAA CTGGAGATAC CTGAATAAAC CAGGAACGCC GCTTCCTTTA TCGGGCAAAT GGGACCTGAC CTTTACCCAG GGAGGCCCTT TTAAGCCCGC CGACCGCCAA TTGGACCAGC TGGTATCCTG GACAAGTCTT TCCGATACCG CAGCAGCTTC CTATAGTGGT TCAGCGGTTT ATACCCAGAC ATTTACCCTG CCGGATAAGC TGGAAAAGGA ATATCTGCTG GACCTGGGGA AAGTGCACGA AAGTGCCAGG GTCACCATCA ACGGACAAGA CGCCGGTATC TACTGGGCCA TTCCTTTCCA GGGAAGGGTG GGGCAATATC TGCGTCCAGG TAAAAATGAG ATCAGGATAG AAGTAGCCAA TCTGATGGCC AACCGGATCC GTTATATGGA TCAACATGGC ATTCCCTGGC GAAACTATCA TGAGATCAAT TTCGTCAATA TCAATTATAA GTCTTTTGAT GCTGCAGGCT GGCCCCTGCA GGCTTCTGGT CTGATCGGTC CGGTCACATT GATCCCCCAT CAATAG
|
Protein sequence | MKTVIRRSCK LLPAILSMLL LQPAVAQQPA IPHTDTLRPD VWPAPGNRLH QPWTRWWWVG SAVDEKSLDA SLKTLHDAGF GGVEIAPIYG AKGYESRYVS FLSPQWIDLL RYTVQTAAGY NMGVDLTTGT GWPFGGPQLT KAQAASRLII QTYPVKGGTP FTAQIRINDN KQQGAALQAL TAYNGKQRIS LISKVDKEGK LQWIPPAGDW TLYALFSGWT LQQVKRAAPG GEGYTLDHLS ATALPVYLQR FDTAFKGKSP GVRSFYNDSY EVYNADWTPA FFSAFKTHRG YDLREYLPEL ISEEKSETAG RVKSDYRETM SELLLDNFTR PWTQWAHRYN SVSKNQAHGS PGNLLDLYAA VDIPECETFG STYFPIPGLR RDSADIRIVA PDPVMLKFAS SAAHVTGHPL VSCETFTWLT EHFKTSLSQC KPEVEQAFLA GVNHVFYHGN TFSPPDVPWP GWLFYASVNF VPTNSCWPHL TGLNNYISRV QSVLQTGIPD NELLIYWPVY DCWENPKGKD LPLKVHDIDE WLHPTPFYKQ VKSLQQAGYS LDFISDKQLG KTLVQGRKLI TSTNAAPYKV LIIPAMKLMP LQTFQQIIKL VEAGATVIFE SMPEDVPGLH DLDKRRQQLK ALQQKVKGAL SGKIAGTRLG KGTIYNYDGI QEALHIQHID GEKLTAKGLQ FIRRKDKDRT WYYIVNHTAD AVDDAVPFNQ MGAEDTVLLL DPMTGVYGPA TVGHHHVNDV VRIQLQPGQS LIVRTGPVTA AEQAVGNWRY LNKPGTPLPL SGKWDLTFTQ GGPFKPADRQ LDQLVSWTSL SDTAAASYSG SAVYTQTFTL PDKLEKEYLL DLGKVHESAR VTINGQDAGI YWAIPFQGRV GQYLRPGKNE IRIEVANLMA NRIRYMDQHG IPWRNYHEIN FVNINYKSFD AAGWPLQASG LIGPVTLIPH Q
|
| |