Gene Cpin_1886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1886 
Symbol 
ID8358037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp2294751 
End bp2297942 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content46% 
IMG OID644964074 
ProductTonB-dependent receptor plug 
Protein accessionYP_003121583 
Protein GI256420930 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCCCC GCATCCAGAC GCCCATACTA TGTCAAAAAA GGGTTCAAAC ATTTCAACCA 
AAAGGAGTAT CTATGAGAAG TAAACTACGC TTTAACAAAG CATGCGTTAC TGCAGGAGTA
ATGCTCGGTC TGATGCTTAC GGAAGGCATA CAGACCTCAT ACGCAGCTCC TATGCCCCTC
TCCAGCATGA GGCAGCAGAA GCAGGTAAGG GGTGTCGTTA GGGACAAAAA CGGCAATCCC
ATCCCAGGTG TCAACGTTCT GGTAAAAAAC GGCGCAGCAG GAACTGCTAC CGGACCAGAC
GGAAAATTTG TTATCTCTGT TAAATCCCCT ACTGACATTC TCGTATTCAG ATTCATCGGC
TATCTCACAC AGGAAGTACC GGTCGGCGAA CAAAGTTCTA TTGATGTAAC CCTGCTCGAA
GATGTTGCTA ACATTAATGA AGTTGTAGTA ACAGCATTGG GTATACGTAA AGAAGCGCGT
AGCAATGGTT ACGCCGTATC AAAAGTACAG GGCGCTGATA TGACCAAAGC AAGAGAGATC
AGCGTAGCCA ATGCACTGGT AGGCCGCGTA GCCGGTGTGA ACAGTGCTCC TCCTGCTACA
GGTCCGGGTG GTTCTTCCCG CGTAACAATC AGAGGTAACT CTTCCCTGAG TGGTAACAAC
CAGCCGCTGT ACGTAGTGAA CGGTGTTCCT ATGAACAACG CGAACCTCGG TTCTGCTGGT
AAATGGGGTG GTACTGACCT TGGTGATGGT ATCTCCAGCA TTAACCCGGA TGATATCGAA
GATATGACCA TCCTGAAAGG AGCAGCAGCA TCTGCATTAT ACGGTCAGCG TGGTGTAAAC
GGCGTTATCC TGATCACTAC CAAATCAGGT AAAGCAGGTC AGATGCGCGT TGAACTGAAC
AGCAACGTAA CCGTTGAAAA AGTAAATGAC TTCTTCGATT TCCAGGACGT ATATGGTCAG
GGTATCAAAG GCGCAAAACC TACCGATCAG CAGTCAGCAC TGAACAGTGG CTTGTCTAGC
TGGGGTAGCA AACTGGATGG TTCTTCCACT ACATTATTTG ATGGTAAAGC ACATCCTTAC
TCCAAACAGG GTAACCATAT TAAAGACTTC TATAAAACAG GCGCTACTTA CAGCAACACG
TTATCCATCT CAGGCGGTAG TGATAAAACC ACTTACCGTG TGGCATTGGG CGACCTGCGC
AGCAAGGGCG TTTACCCAAA TACAGAGTAT ATCCGCGATA ACGTAAACAT TGATGTCAAC
TATAAGTTAT CTGACAAATT CAGCGGACAG ACCAATGTTA TTTACGCGAA AGAAATTACC
AATAACCGTT CTAACCTGAG TGATGCTCCT GGTAACGGTA ACTATGGTAT CGCATTCCTG
CCGGCCAACG TAGCTGCCAG TTACCTGGAA CCAGGTTATG ACGGACAGTT TAAAGAACTG
GTATATAGCA GCGACCTTTT CAGTACGAAT CCTTTCTTTG CAGCAAACCG CTTCAGGAAC
AATACAAAGA AAGACCGTGT TATCGGCGTA ACCAGTTTAC GTTATACGCC TTTACCATGG
TTATTTGTAC AGGCTCGCGT AGCGAATGAC TACTTTGGTT TCAATGCTAC TTCCATCACT
CCTACCGGTA CTGCTTACCG CCCTGCGGGA AGCCTGGACC TGGAGCGTAA CCGCCAGTTC
AATGAAACCA ACGTGGACGC GTTACTGGGT ATCAACAAAG AAATCACCAA CAAGTTAAAC
CTGAGCTTCA CTGCCGGCGC CAACCTGTTA AAGAAGGTGG ACAAGGTAAA TGATGTTACT
GCCTCCAATT TTGCTTTCCC ATTTGTTTAC AACCCGGCTA CTGCTGCTAC GAAGAACAGT
ACCATCATAC AATACAATAA ACAGGTACAG TCCGTATACG GATCATTGGA ATTGTCCTGG
GCCAGCACTT TATATCTGAC GGTTACTGAC CGTAACGACT GGAGTAGTAC ACTGCCTAAG
AAAAATAATT CCTATAACTA TCCATCTGTA AGTGGTTCCT ATGTTTTCTC TGAAAACATT
AAGACCAACT GGCTCAGCTT TGGTAAGATC AGAGCAGGTT ATGCACAGGT AGGTGGTGAT
GCGGATGAAT ATAAAACTGC GCTGTACTAC AGCACACTGG GAAATACTGT CAATGGCGTA
CCACTGGGCG ACATCGACAA CAAAATTCCT AACAAAGAAC TGAAACCACT CAAAGTAAAA
GAAATAGAAG TAGGCGCTGA ACTGAAACTG TTTGACAATC GTTTATTTGG TGATTTCAGC
TGGTACAACA AACAGGCTTC TAATGACATC GTAACGGCTA CAGTATCTTC CGGATCTGGT
TACACCAGCG CGCTGGTGAA CGTAGGTAAA CTGGAAAACA AAGGTATTGA AGCGATGATC
GGTGGTACAC CTGTGAAGAC AGCTAACTTC TCCTGGACGA CTACGTTCAA CTTTGCCCAC
AACAAAAACA AAGTAATACA GCTGGCAGAA GGTCAGGCAT CTATGCTGGT GGAAGGTGGT
GAATCCCGCA CGGAAGATGG TTTCATCAAC CACGTGGTAG GACTGCCATA TTCACAGATC
ATGGTGTATG ACTTCAAACG TAACGCGAAA GGAGAATTGA TCGTAGATGC ATCAGGTGTT
CCTCAGCGTA CAGACGCACT GATCGCAGCT GGTTCAGGCG TAGCGCCTTA CACTGGTGGC
TGGAGCAACG AACTGTCTTA TGGCAAGTTC CACCTGAGCT TCCTGATCGA CTTCAAATCA
GGTGGTAAAA TCTATTCCGG TACCAATGCC AACGCTTATA GCTATGGTCT GCACAAAGAG
ACACTGGCAG GACGTGAAGG CGGTGTTGTA GTAACCGGTG TAACTGAAAG TGGCGACGCA
AAAACAACGA CTGTTGTGGC GGACGACTAC TACTCAAGAC TGTCCAGCAT CTCTGCATTG
CAGGTATATA AATCAGACTT CATCAAATTC CGTTCACTGG CACTGACATA TGACTTTTCC
AGAGCAGCAC TGCACGAAAA ACTGAATGGT ATCAGCATCT CTCTGGTAGG ACGTAACCTT
TTCTACATCA AGAAATCAAC GCCTAACATC GATCCTGAAT CCAACTACAG CAACAGCAAC
GCACAGGGTC TGGAATATGC AGGTCTGCCT ACTACCCGTT CTATCGGTGT GAACCTGAAT
GTTAAATTCT AA
 
Protein sequence
MHPRIQTPIL CQKRVQTFQP KGVSMRSKLR FNKACVTAGV MLGLMLTEGI QTSYAAPMPL 
SSMRQQKQVR GVVRDKNGNP IPGVNVLVKN GAAGTATGPD GKFVISVKSP TDILVFRFIG
YLTQEVPVGE QSSIDVTLLE DVANINEVVV TALGIRKEAR SNGYAVSKVQ GADMTKAREI
SVANALVGRV AGVNSAPPAT GPGGSSRVTI RGNSSLSGNN QPLYVVNGVP MNNANLGSAG
KWGGTDLGDG ISSINPDDIE DMTILKGAAA SALYGQRGVN GVILITTKSG KAGQMRVELN
SNVTVEKVND FFDFQDVYGQ GIKGAKPTDQ QSALNSGLSS WGSKLDGSST TLFDGKAHPY
SKQGNHIKDF YKTGATYSNT LSISGGSDKT TYRVALGDLR SKGVYPNTEY IRDNVNIDVN
YKLSDKFSGQ TNVIYAKEIT NNRSNLSDAP GNGNYGIAFL PANVAASYLE PGYDGQFKEL
VYSSDLFSTN PFFAANRFRN NTKKDRVIGV TSLRYTPLPW LFVQARVAND YFGFNATSIT
PTGTAYRPAG SLDLERNRQF NETNVDALLG INKEITNKLN LSFTAGANLL KKVDKVNDVT
ASNFAFPFVY NPATAATKNS TIIQYNKQVQ SVYGSLELSW ASTLYLTVTD RNDWSSTLPK
KNNSYNYPSV SGSYVFSENI KTNWLSFGKI RAGYAQVGGD ADEYKTALYY STLGNTVNGV
PLGDIDNKIP NKELKPLKVK EIEVGAELKL FDNRLFGDFS WYNKQASNDI VTATVSSGSG
YTSALVNVGK LENKGIEAMI GGTPVKTANF SWTTTFNFAH NKNKVIQLAE GQASMLVEGG
ESRTEDGFIN HVVGLPYSQI MVYDFKRNAK GELIVDASGV PQRTDALIAA GSGVAPYTGG
WSNELSYGKF HLSFLIDFKS GGKIYSGTNA NAYSYGLHKE TLAGREGGVV VTGVTESGDA
KTTTVVADDY YSRLSSISAL QVYKSDFIKF RSLALTYDFS RAALHEKLNG ISISLVGRNL
FYIKKSTPNI DPESNYSNSN AQGLEYAGLP TTRSIGVNLN VKF