Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_1608 |
Symbol | |
ID | 8357750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | - |
Start bp | 1952084 |
End bp | 1955389 |
Gene Length | 3306 bp |
Protein Length | 1101 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644963788 |
Product | collagen triple helix repeat protein |
Protein accession | YP_003121305 |
Protein GI | 256420652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.712053 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT TATATCTACT GATCATATTA ATGGCTGCTG TCTGTGCGGG CGCCTATGCC CAGAACGGAC TGGCGGGTAC CAATTACCAG GCGGTAGTCC GCAATACCAA TGGTACGGTT GTCGCCAATA AAGACATCTC TGTTCGTATC TCCGTTCTTG GCGGTTCTGC TGCCGGTCCG GTGCAATATG AAGAAACACA TGAAGTGAAA ACCAGCACAC TGGGACTCTT CAATCTGCAG ATCGGTAAGG GTAACCCTAC AACCGGCACC TTTGCAGGTG TGCCCTGGGC TAATGCTAAC CAATATCTGA AAGTGGAGGT AAATACCGGC AGTGGCTTTG TTACCCTCGG TACCACTGAA CTGCTGAGTG TTCCCTATGC ATTATATGCT GCGAATGGTA CTCCCGGCCC GGCTGGCCCC ACAGGACCTC AGGGTCCGGC TGGCCCAACT GGTGCAAAAG GTGATGCCGG CGCAACCGGA CCAGCTGGTG CCAAAGGAGA TACTGGTCCG GCTGGAGCAG CTGGCCCTCA GGGAGCACCA GGTGTAGCAG GTCCGGTAGG ACCTGTAGGA CCGGCAGGCG CTAAAGGAGA TGCAGGAGCA GCTGGCCCCC AGGGAGCGCC AGGTGTAGCA GGTCCAGTTG GTCCTGTAGG ACCGGCGGGT GCTAAAGGAG ATGCAGGAGT CGCTGGTCCT CAAGGTCCAG TTGGCGCAAT TGGTCCGGTA GGACCCGCAG GTGCAGCAGG TCCACAAGGT AATCCAGGCC CGGTAGGCCC CATCGGACCA ATAGGTCCGG TAGGTCCGGC AGGTGCAACA GGACCACAGG GCGATCCGGG TGTAGCAGGA CCAGCGGGAC CAGTAGGACC GATAGGACCA GCAGGCGCCA CAGGCCCACA AGGTGATGCA GGTGTAGCCG GACCAATAGG ACCGGTAGGA CCGGCGGGTG CAGCTGGCGT AGCAGGCCCT GCAGGTCCAC AAGGTCCTGT AGGACCAGTG GGACCGGCAG GGCCGGCGGG TGAGATCAAT GGTGCAGCAG CAGGCGGTGA TCTGAGTGGT ACCTATCCGA ATCCGAATGT AGTACGGTTA CAGAATATCC CCGTATCAGC AACAGCTCCT CTGGCCGGAC AGATCCTCCG GTATGATGGT ACCAACTGGC TGCCAAGTGT TGCCGGTGGC GGATTCACGC TCCCATATGT CACTGTGGAA AACAATGCTT CTACCCTGTT CTCACTGACA AACGATGGAG ATGGTACTTC TATAGAAGGG GTAAATAACA CCACCACTTC CAGTATTGCA GCTATACGTG GTATTGTAAA CAGTACTGCT CCAGGTGGAT TTTCCAGTGC AGTAAGAGGT ATTAACAATG GTACCGGCGG ACTGGGTATC GGCGTATGGG GTTCTCAGGC CGGATCTGGC TGGGGCGTAT ATGGTGTTAC TCCCAATGGT CTTGGTGTAT ATGGTAACTC TTCCGCCAAT GGTTACGGCG TATTTGCCAA CAGTAACAGC GGTATTGGTC TGAATGCCAC TTCTGTAAAT GGCATCGCAG CCAGTATTAA CATTAATAAT AACGCCAATA ATAACAATGC ACTGAACGTC ACTTCTGTTG GTAATGGTAC TGTTGTCAAT GTCAGCACAA CCGGTAATGG AACAGGTGTA TTAAGTAGTG CAGGTGCAGG TTTTGGCGTA CATGGCATTA CCTCCGAACA AACATCTGCC GGTATTGTCG GTGATAATAA CGGAGCCGGG GAAGCGATCG TAGGTCGTAC AACATCCGAT ATTGCAGGTG CCGTAGTCGG CCGTAATGAT GGCGGAGGAT ATGGAGTAAG AGGTTTTGTT GCTACCAGCA CTGCCAACAC TGGTATCGGT GTATATGGCC AGGTAGGTAT TAATAACAGT ACCGGTATGG CGGGTAAATT TGAAAACTTC AACCAGGACA ATACAGAAGC AAATATCCTG GACGTAGTGA GTAACAGTAA CGGTAATATT CCTGACAATA CCCTGGGTAA TGCCTCTTCC TTCCTGCTTG ACAACAACAA CAGCGTAGGT GCTGCGGTGA GAGCAGAAGT AAATACCATT TTTGGCAACT TTGGTGCAGC AGGTGTATTC GGTATTTCTT CCGGTACTGG TGGTCGTGCG GGTCTGTTCT ATGCCTCCAA TCCGGCTGGT AATGGCGCTT CGCTGATCGC GCTGACGGAT GGTAACGGTA ATGCGATCAC TGCGAATGCA GGCAAAGATG GCAATGGTGT TGAAACCAAT ATCGATGGAG CAGGTACTGC CCTATATGCA TGGGTGCCTA CTTTCAGTGA AGGCCGTGCG GGAAGGTTCG AAATCTTTAA CGAGGATAAT GAAAATCCGG TGATCACAGT GAAAACTGTC GGCAATGGTA CTGCGGGCAA TTTCCTGGTT GACAGAGTAA CCGGTACTTC TCCAGCGGTA AAAGGTGAAG TGAACTCTCA ATTCGCCAAC TTTGGAACAG CCGGTATATA CGGCGTGTCG TCTGGTACAG GTGGCTATGC AGGGCTTTTT TATGCATCCA ACGCGGCAGG TAATGGTCCG TCGGTACTGG CACTTACAGA CGGTAATGGC AATGCTATCA CTGCCAATGC CGGTGGCAAC GGCGATGGTA TAGAAGCCAG TTGCGACGGT GGTGGTAATG CTGTTTCCGG ATTTGTCCCC AATTTTGGTA GTGGCAGAGC AGGAAGATTC GCAAATTTCA ACAATTCGAA CGGTCTGCCT GTTGTACATA TCACGACCAC TGGTACCGGC AGTACATTAT TGGTCAACCA TCAGGGGGCA TCCGGTAACA TCGCACAGTT CCAGAGTGCC AGCGGTAACG TTGCGCGTAT CAACAAAGCA GGTCGCGGAT TCTTTAACGG TGGCACACAA AACAGTGGTG CGGACGTTGC AGAAGCATTT GATGTAAAAG GTCATGTGAA TCAGTATGCA CCAGGTGATG TACTGGTGAT CGCTGAGGAC GCAGACAGAA CAGTTGCATT GTCTTCCAAG CCATATTCTA CCCTGGTAGC AGGTGTATAC GCCACTAAAC CGGGTGTATT ATTAACAGAA GAAAATATTG ACGATGAACT GGCAGATAAA GTGCCAATGG GTGTGATCGG TGTTATTCCG ACTAAAGTAT GTGGAGAAGG GGGCGCTATC CGCAGGGGCG ATCTGCTGGT GACTTCCAGC AAAGCGGGTC ACGCCATGAA AGCTGATCTG GATAAAGTAA AACCAGGTCA GGTGATTGGA AAAGCACTGG AAACTTTTGA TGGCCAGGAT ACAGGGCTGA TCAAAGTACT TGTAAACGTA AGATAA
|
Protein sequence | MKKLYLLIIL MAAVCAGAYA QNGLAGTNYQ AVVRNTNGTV VANKDISVRI SVLGGSAAGP VQYEETHEVK TSTLGLFNLQ IGKGNPTTGT FAGVPWANAN QYLKVEVNTG SGFVTLGTTE LLSVPYALYA ANGTPGPAGP TGPQGPAGPT GAKGDAGATG PAGAKGDTGP AGAAGPQGAP GVAGPVGPVG PAGAKGDAGA AGPQGAPGVA GPVGPVGPAG AKGDAGVAGP QGPVGAIGPV GPAGAAGPQG NPGPVGPIGP IGPVGPAGAT GPQGDPGVAG PAGPVGPIGP AGATGPQGDA GVAGPIGPVG PAGAAGVAGP AGPQGPVGPV GPAGPAGEIN GAAAGGDLSG TYPNPNVVRL QNIPVSATAP LAGQILRYDG TNWLPSVAGG GFTLPYVTVE NNASTLFSLT NDGDGTSIEG VNNTTTSSIA AIRGIVNSTA PGGFSSAVRG INNGTGGLGI GVWGSQAGSG WGVYGVTPNG LGVYGNSSAN GYGVFANSNS GIGLNATSVN GIAASININN NANNNNALNV TSVGNGTVVN VSTTGNGTGV LSSAGAGFGV HGITSEQTSA GIVGDNNGAG EAIVGRTTSD IAGAVVGRND GGGYGVRGFV ATSTANTGIG VYGQVGINNS TGMAGKFENF NQDNTEANIL DVVSNSNGNI PDNTLGNASS FLLDNNNSVG AAVRAEVNTI FGNFGAAGVF GISSGTGGRA GLFYASNPAG NGASLIALTD GNGNAITANA GKDGNGVETN IDGAGTALYA WVPTFSEGRA GRFEIFNEDN ENPVITVKTV GNGTAGNFLV DRVTGTSPAV KGEVNSQFAN FGTAGIYGVS SGTGGYAGLF YASNAAGNGP SVLALTDGNG NAITANAGGN GDGIEASCDG GGNAVSGFVP NFGSGRAGRF ANFNNSNGLP VVHITTTGTG STLLVNHQGA SGNIAQFQSA SGNVARINKA GRGFFNGGTQ NSGADVAEAF DVKGHVNQYA PGDVLVIAED ADRTVALSSK PYSTLVAGVY ATKPGVLLTE ENIDDELADK VPMGVIGVIP TKVCGEGGAI RRGDLLVTSS KAGHAMKADL DKVKPGQVIG KALETFDGQD TGLIKVLVNV R
|
| |