Gene Cpin_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1803 
Symbol 
ID8357954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp2193958 
End bp2197050 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content49% 
IMG OID644963991 
Producthypothetical protein 
Protein accessionYP_003121500 
Protein GI256420847 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000714199 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAGC GATTATTAAC AATATGTGTT GCCTGCTGGC TGCCTTTACT GGCAGCCGCA 
CAGGCAAAGA AAACAAAAGC GCCTTTACCT CCTGTATCAT GGGAAAAAGG ACAGCTGGCG
TATCAGCCGG ATGAAAAAGG GAATCGTGTT CCTGATTTTT CCTGGTGTGG ATATATGGCC
GGCGAACAAG CATTGCCCCT GACCCCTATA AGGGTCAGGG TGCCTGTTAT GAAGGGGGAT
GCGACCGCTA CCATACAGGC GGCGCTGGAC TATGTAGCGT CCTTACCATT GAAAGAGGGT
GTACGTGGCG CTGTATTGCT GGATAAAGGC ACTTTTGAAA TCAGCGGTAG CCTGCGGATC
AATGCGTCAG GTGTCGTGTT GAGAGGAAGC GGCGACAGTA CGGTATTACT GGCGACAGGC
TATAGTCGTG CGATACTCAT ACATGTCAGC GGTCGTAATG ATAAACAGCT GTCTCCTGCT
GTAAAAATCA CTGATGCTTA TGTGCCTGTC AACAGCCATG TTGTACACAT ACCAGCAGGT
GTATCATTGC AGGAAGGAGA CGAAGTGGAG ATAGTACGTC CCTGTACGCC TTCCTGGATA
CAACAACTGG GGACGGCCCA TTTTGGCGGT GGTATAACTG CCCTGGGCTG GAAACCGGGA
GATAGAGAAA TACACTGGTC ACGTAAGGTG ATGAAAGTAA CAGGAAATAC AGTTACCCTG
GATGTTCCGC TTACCAATGC ACTGGATACT GCTTATGGCA GTGCAACCAT TGCGGCATAT
AAATGGCCCG GATTGATTAC ACAGACAGGT GTGGAAAACC TGCATTGCCG TTCAGTCTTT
GATGCTGCCA ATCCGAAGGA TGAAGATCAT TGCTGGACAG CCATCGCAAT AGAAAATGCC
ACAGATGCCT GGGTAAGACA GGTCAGTTTT GAGCACTTTG CCGGTTCTGC AGTAGCAGTG
CTGGAAACAG CCCGCAGGGT GACCGTAGAA GATTGTATAT CAACAGCACC GGTCTCAGAG
ATCGGTGGAC AACGCCGGTA TACCTTCTTT ACTGCTGGTC AGCAGACTTT GTTCCAGCGT
AACTACGCAC AGTACGGTTA TCATGATTTT GCTGCCGGTT TTTGTGCAGC AGGTCCGAAT
GCTTTTGTAC AGTGTATGTC TGATATGCCC TACAGTTTCA GTGGTGCGAT TGATAGCTGG
GCGACTGGTC TATTGCTGGA TAATGTAATT GTTAACGGTC AGACTTTAGG TTTTCCCAAC
CGTGGTCAGG ACGGACAGGG AGCAGGATGG ACCGCTGCGA ATAGCGTACT ATGGCAATGT
GCGGCTGCAC GAATAGACTG TTATCGTCCA CCAAGCGCTA ATAACTGGGC TTTCGGTGCA
TGGGCGCAGT TTGCAGGAGA TGGTGAATGG TATGCTTCCA ATGAATACAT CCAGCCACGC
AGTTTATACT ATGCACAACT GTCCGCCCGC CTGGGTGATA AAGCTGCAGC ACGTGCATAC
TTGTTACCGG AACTGGGAGA CGCTTCCAGT AGTCCGACAG CTGATGTTGC TGCCGCATTG
TCCCGACAAT CTATGCAACC TGCCACTACT TTATACGAAT GGATCGGTAT GGCGGCAACC
CGTACACCGA TTCCTGTCAA TGCCGACGGC GCACGTATAC AGGCGCCTGT TAAACAGGCA
GTATCAGCTG CGGTACCTGG TATGAAAGTC GTGAACGGAT GGCTGGTGAA TAATGGCGAT
GTGATGACAG GAGACCGTAG AGATGTATCC TGGTGGAGAG GGAATATCAG ACCGGATGGA
ATACAGGAGG CTACGCCGCA TATTACCCGT TATGTGCCGG GTAGAACAGG TACCGGACTG
ACAGATGATC TGGACTCCGT ATCTGCCTGG ATGAGCCGTA AACATATCGT AGCGATTGAT
CATAATTACG GACTATGGTA TGAAAGAAGA AGAGATGATC ATGAGCGGGT ATTGCGACTG
GATGGAGATG TATGGCCACC GTTTTATGAA CAGCCTTTTG CGCGTAGCGG ACAAGGAACA
GCCTGGGAAG GATTGAGTAA ATATGATCTG ACGAAATATA ACCGCTGGTA TTGGTATCGG
CTGCAACAGT TTGCCACACT GGCTGACAGG AACGGACAGG TATTAATCCA TCAGCAATAT
TTTCAGCACA ATATTATCGA GGCAGGTGCG CATTATGCGG ACTTCCCCTG GCGGCCAGCC
AACAATATCA ACCAAACCGG ATTTCCGGAG CCGCCACCTT ATGCAGGTGG TAAGCGCATC
TTTATGGCTG AACAGTTTTA TGATACCACC AATGCTGTCA GAAATAAACT GCACCGTGCT
TATATCCGTC AATGCCTGGA CAATTTCAGC AATAACCATA ATGTCATTCA GCTGATAGGG
GCTGAGTTTA CAGGTCCTTT GCATTTTGTG GATTTCTGGG CAGATGTGGT CAGGGGCTGG
GAGCAGGAGA AGCAGCAACA TTCTGTCATA GGATTAAGTA CTACCAAAGA TGTACAGGAT
GCTGTGCTGC AGGATGCACA GCGTGCGCCG GTAATCGATC TGATAGATAT CCGTTATTGG
CATTACCAGG AGAATGGAAC TGTTTATGCA CCACAGGGTG GACAGAATCT GGCTCCCCGT
CAGCACGCAC GTTTGCTGAA ACCAAAACGT AGTAATGAGA AAGAAATCTA TCGGGCAGTA
CGCGAATATC GCGATTTATA TCCGGGTAAA GCCGTGATGT ATTCTGCTGA TAGTTATGAT
AAATACGGCT GGGCTGTATT CATGGCAGGA GGTTCTCTCG CCAGTATTCC ATCCATAGCT
GTTCCCGGTT TTCTGTCTGC CGCTGCGGGT ATGCATCCGG CAGACATGCC GGGACTATGG
GCACTGAGCA ATAATCATGG TGATTATATC ATCTACAATG CCGGTGCGGC CAATATAGAT
ATTCCCGGCG CCACAGCTGC CTACAACGCT GTATGGATTG ATGCCGTGAA TGGCAGAACA
CTACTGAGTA AAAAGAATAT TAAAGCCGGT ACCGGCCATA CTTTGTCAGC ACCACAAGCA
GGTGCACAGG TACTCTGGCT GATACGCAAA TAA
 
Protein sequence
MTKRLLTICV ACWLPLLAAA QAKKTKAPLP PVSWEKGQLA YQPDEKGNRV PDFSWCGYMA 
GEQALPLTPI RVRVPVMKGD ATATIQAALD YVASLPLKEG VRGAVLLDKG TFEISGSLRI
NASGVVLRGS GDSTVLLATG YSRAILIHVS GRNDKQLSPA VKITDAYVPV NSHVVHIPAG
VSLQEGDEVE IVRPCTPSWI QQLGTAHFGG GITALGWKPG DREIHWSRKV MKVTGNTVTL
DVPLTNALDT AYGSATIAAY KWPGLITQTG VENLHCRSVF DAANPKDEDH CWTAIAIENA
TDAWVRQVSF EHFAGSAVAV LETARRVTVE DCISTAPVSE IGGQRRYTFF TAGQQTLFQR
NYAQYGYHDF AAGFCAAGPN AFVQCMSDMP YSFSGAIDSW ATGLLLDNVI VNGQTLGFPN
RGQDGQGAGW TAANSVLWQC AAARIDCYRP PSANNWAFGA WAQFAGDGEW YASNEYIQPR
SLYYAQLSAR LGDKAAARAY LLPELGDASS SPTADVAAAL SRQSMQPATT LYEWIGMAAT
RTPIPVNADG ARIQAPVKQA VSAAVPGMKV VNGWLVNNGD VMTGDRRDVS WWRGNIRPDG
IQEATPHITR YVPGRTGTGL TDDLDSVSAW MSRKHIVAID HNYGLWYERR RDDHERVLRL
DGDVWPPFYE QPFARSGQGT AWEGLSKYDL TKYNRWYWYR LQQFATLADR NGQVLIHQQY
FQHNIIEAGA HYADFPWRPA NNINQTGFPE PPPYAGGKRI FMAEQFYDTT NAVRNKLHRA
YIRQCLDNFS NNHNVIQLIG AEFTGPLHFV DFWADVVRGW EQEKQQHSVI GLSTTKDVQD
AVLQDAQRAP VIDLIDIRYW HYQENGTVYA PQGGQNLAPR QHARLLKPKR SNEKEIYRAV
REYRDLYPGK AVMYSADSYD KYGWAVFMAG GSLASIPSIA VPGFLSAAAG MHPADMPGLW
ALSNNHGDYI IYNAGAANID IPGATAAYNA VWIDAVNGRT LLSKKNIKAG TGHTLSAPQA
GAQVLWLIRK