Gene Cpin_1608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1608 
Symbol 
ID8357750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp1952084 
End bp1955389 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content52% 
IMG OID644963788 
Productcollagen triple helix repeat protein 
Protein accessionYP_003121305 
Protein GI256420652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.712053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TATATCTACT GATCATATTA ATGGCTGCTG TCTGTGCGGG CGCCTATGCC 
CAGAACGGAC TGGCGGGTAC CAATTACCAG GCGGTAGTCC GCAATACCAA TGGTACGGTT
GTCGCCAATA AAGACATCTC TGTTCGTATC TCCGTTCTTG GCGGTTCTGC TGCCGGTCCG
GTGCAATATG AAGAAACACA TGAAGTGAAA ACCAGCACAC TGGGACTCTT CAATCTGCAG
ATCGGTAAGG GTAACCCTAC AACCGGCACC TTTGCAGGTG TGCCCTGGGC TAATGCTAAC
CAATATCTGA AAGTGGAGGT AAATACCGGC AGTGGCTTTG TTACCCTCGG TACCACTGAA
CTGCTGAGTG TTCCCTATGC ATTATATGCT GCGAATGGTA CTCCCGGCCC GGCTGGCCCC
ACAGGACCTC AGGGTCCGGC TGGCCCAACT GGTGCAAAAG GTGATGCCGG CGCAACCGGA
CCAGCTGGTG CCAAAGGAGA TACTGGTCCG GCTGGAGCAG CTGGCCCTCA GGGAGCACCA
GGTGTAGCAG GTCCGGTAGG ACCTGTAGGA CCGGCAGGCG CTAAAGGAGA TGCAGGAGCA
GCTGGCCCCC AGGGAGCGCC AGGTGTAGCA GGTCCAGTTG GTCCTGTAGG ACCGGCGGGT
GCTAAAGGAG ATGCAGGAGT CGCTGGTCCT CAAGGTCCAG TTGGCGCAAT TGGTCCGGTA
GGACCCGCAG GTGCAGCAGG TCCACAAGGT AATCCAGGCC CGGTAGGCCC CATCGGACCA
ATAGGTCCGG TAGGTCCGGC AGGTGCAACA GGACCACAGG GCGATCCGGG TGTAGCAGGA
CCAGCGGGAC CAGTAGGACC GATAGGACCA GCAGGCGCCA CAGGCCCACA AGGTGATGCA
GGTGTAGCCG GACCAATAGG ACCGGTAGGA CCGGCGGGTG CAGCTGGCGT AGCAGGCCCT
GCAGGTCCAC AAGGTCCTGT AGGACCAGTG GGACCGGCAG GGCCGGCGGG TGAGATCAAT
GGTGCAGCAG CAGGCGGTGA TCTGAGTGGT ACCTATCCGA ATCCGAATGT AGTACGGTTA
CAGAATATCC CCGTATCAGC AACAGCTCCT CTGGCCGGAC AGATCCTCCG GTATGATGGT
ACCAACTGGC TGCCAAGTGT TGCCGGTGGC GGATTCACGC TCCCATATGT CACTGTGGAA
AACAATGCTT CTACCCTGTT CTCACTGACA AACGATGGAG ATGGTACTTC TATAGAAGGG
GTAAATAACA CCACCACTTC CAGTATTGCA GCTATACGTG GTATTGTAAA CAGTACTGCT
CCAGGTGGAT TTTCCAGTGC AGTAAGAGGT ATTAACAATG GTACCGGCGG ACTGGGTATC
GGCGTATGGG GTTCTCAGGC CGGATCTGGC TGGGGCGTAT ATGGTGTTAC TCCCAATGGT
CTTGGTGTAT ATGGTAACTC TTCCGCCAAT GGTTACGGCG TATTTGCCAA CAGTAACAGC
GGTATTGGTC TGAATGCCAC TTCTGTAAAT GGCATCGCAG CCAGTATTAA CATTAATAAT
AACGCCAATA ATAACAATGC ACTGAACGTC ACTTCTGTTG GTAATGGTAC TGTTGTCAAT
GTCAGCACAA CCGGTAATGG AACAGGTGTA TTAAGTAGTG CAGGTGCAGG TTTTGGCGTA
CATGGCATTA CCTCCGAACA AACATCTGCC GGTATTGTCG GTGATAATAA CGGAGCCGGG
GAAGCGATCG TAGGTCGTAC AACATCCGAT ATTGCAGGTG CCGTAGTCGG CCGTAATGAT
GGCGGAGGAT ATGGAGTAAG AGGTTTTGTT GCTACCAGCA CTGCCAACAC TGGTATCGGT
GTATATGGCC AGGTAGGTAT TAATAACAGT ACCGGTATGG CGGGTAAATT TGAAAACTTC
AACCAGGACA ATACAGAAGC AAATATCCTG GACGTAGTGA GTAACAGTAA CGGTAATATT
CCTGACAATA CCCTGGGTAA TGCCTCTTCC TTCCTGCTTG ACAACAACAA CAGCGTAGGT
GCTGCGGTGA GAGCAGAAGT AAATACCATT TTTGGCAACT TTGGTGCAGC AGGTGTATTC
GGTATTTCTT CCGGTACTGG TGGTCGTGCG GGTCTGTTCT ATGCCTCCAA TCCGGCTGGT
AATGGCGCTT CGCTGATCGC GCTGACGGAT GGTAACGGTA ATGCGATCAC TGCGAATGCA
GGCAAAGATG GCAATGGTGT TGAAACCAAT ATCGATGGAG CAGGTACTGC CCTATATGCA
TGGGTGCCTA CTTTCAGTGA AGGCCGTGCG GGAAGGTTCG AAATCTTTAA CGAGGATAAT
GAAAATCCGG TGATCACAGT GAAAACTGTC GGCAATGGTA CTGCGGGCAA TTTCCTGGTT
GACAGAGTAA CCGGTACTTC TCCAGCGGTA AAAGGTGAAG TGAACTCTCA ATTCGCCAAC
TTTGGAACAG CCGGTATATA CGGCGTGTCG TCTGGTACAG GTGGCTATGC AGGGCTTTTT
TATGCATCCA ACGCGGCAGG TAATGGTCCG TCGGTACTGG CACTTACAGA CGGTAATGGC
AATGCTATCA CTGCCAATGC CGGTGGCAAC GGCGATGGTA TAGAAGCCAG TTGCGACGGT
GGTGGTAATG CTGTTTCCGG ATTTGTCCCC AATTTTGGTA GTGGCAGAGC AGGAAGATTC
GCAAATTTCA ACAATTCGAA CGGTCTGCCT GTTGTACATA TCACGACCAC TGGTACCGGC
AGTACATTAT TGGTCAACCA TCAGGGGGCA TCCGGTAACA TCGCACAGTT CCAGAGTGCC
AGCGGTAACG TTGCGCGTAT CAACAAAGCA GGTCGCGGAT TCTTTAACGG TGGCACACAA
AACAGTGGTG CGGACGTTGC AGAAGCATTT GATGTAAAAG GTCATGTGAA TCAGTATGCA
CCAGGTGATG TACTGGTGAT CGCTGAGGAC GCAGACAGAA CAGTTGCATT GTCTTCCAAG
CCATATTCTA CCCTGGTAGC AGGTGTATAC GCCACTAAAC CGGGTGTATT ATTAACAGAA
GAAAATATTG ACGATGAACT GGCAGATAAA GTGCCAATGG GTGTGATCGG TGTTATTCCG
ACTAAAGTAT GTGGAGAAGG GGGCGCTATC CGCAGGGGCG ATCTGCTGGT GACTTCCAGC
AAAGCGGGTC ACGCCATGAA AGCTGATCTG GATAAAGTAA AACCAGGTCA GGTGATTGGA
AAAGCACTGG AAACTTTTGA TGGCCAGGAT ACAGGGCTGA TCAAAGTACT TGTAAACGTA
AGATAA
 
Protein sequence
MKKLYLLIIL MAAVCAGAYA QNGLAGTNYQ AVVRNTNGTV VANKDISVRI SVLGGSAAGP 
VQYEETHEVK TSTLGLFNLQ IGKGNPTTGT FAGVPWANAN QYLKVEVNTG SGFVTLGTTE
LLSVPYALYA ANGTPGPAGP TGPQGPAGPT GAKGDAGATG PAGAKGDTGP AGAAGPQGAP
GVAGPVGPVG PAGAKGDAGA AGPQGAPGVA GPVGPVGPAG AKGDAGVAGP QGPVGAIGPV
GPAGAAGPQG NPGPVGPIGP IGPVGPAGAT GPQGDPGVAG PAGPVGPIGP AGATGPQGDA
GVAGPIGPVG PAGAAGVAGP AGPQGPVGPV GPAGPAGEIN GAAAGGDLSG TYPNPNVVRL
QNIPVSATAP LAGQILRYDG TNWLPSVAGG GFTLPYVTVE NNASTLFSLT NDGDGTSIEG
VNNTTTSSIA AIRGIVNSTA PGGFSSAVRG INNGTGGLGI GVWGSQAGSG WGVYGVTPNG
LGVYGNSSAN GYGVFANSNS GIGLNATSVN GIAASININN NANNNNALNV TSVGNGTVVN
VSTTGNGTGV LSSAGAGFGV HGITSEQTSA GIVGDNNGAG EAIVGRTTSD IAGAVVGRND
GGGYGVRGFV ATSTANTGIG VYGQVGINNS TGMAGKFENF NQDNTEANIL DVVSNSNGNI
PDNTLGNASS FLLDNNNSVG AAVRAEVNTI FGNFGAAGVF GISSGTGGRA GLFYASNPAG
NGASLIALTD GNGNAITANA GKDGNGVETN IDGAGTALYA WVPTFSEGRA GRFEIFNEDN
ENPVITVKTV GNGTAGNFLV DRVTGTSPAV KGEVNSQFAN FGTAGIYGVS SGTGGYAGLF
YASNAAGNGP SVLALTDGNG NAITANAGGN GDGIEASCDG GGNAVSGFVP NFGSGRAGRF
ANFNNSNGLP VVHITTTGTG STLLVNHQGA SGNIAQFQSA SGNVARINKA GRGFFNGGTQ
NSGADVAEAF DVKGHVNQYA PGDVLVIAED ADRTVALSSK PYSTLVAGVY ATKPGVLLTE
ENIDDELADK VPMGVIGVIP TKVCGEGGAI RRGDLLVTSS KAGHAMKADL DKVKPGQVIG
KALETFDGQD TGLIKVLVNV R