Gene Cpin_5521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5521 
Symbol 
ID8361698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp7043539 
End bp7046931 
Gene Length3393 bp 
Protein Length1130 aa 
Translation table11 
GC content45% 
IMG OID644967667 
ProductTonB-dependent receptor plug 
Protein accessionYP_003125151 
Protein GI256424498 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000129215 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0524869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAC TACGTATGAA AAAAATTGCC ACGACCTTCA GAAGACGGGC ACTGTCCATC 
GTCTTTCTAG TACTCAGTGC GTCTCTTTTC CTGCCGGTAT ATGCAGGTAA AGGGCAAATC
CTGAAAGAAA CCAACGTTAC GATCCGGCTG AAAAGCGGAT CACTGGAGTC AGCGATACAA
GATCTGCACT CCTCCACCAA AGTCGCTTTT GCGTACGACA AACAACTGTT AAGATCATTC
CCTGTGTCCG ACTGTTCTTT CTCTAATGAA AGACTGGACA TAGTATTGGA GCAGCTGCTG
CGTAATAAGC AATTAGGTTT TGAAGAAGTA AATAATGTGG TTGTTATCAG CAAGGCTAAT
AACAACGCGT CTGCTAATGC TCCTAAGAGT ATCCGCGAAG ACATCGTTGT GTCAGGCGTG
GTGAAGGATG AAAGCGGTAA TCCTATGCCG GGTGTAAGCG TAGCTGTACG TGGAACCAGC
ACGATGACGT CTACCGGTGC TGACGGTAAA TTTAAACTGA TCGTACAATC ACGTGATGCG
GTGATCGGCT TCAGCTTTAT CGGTTATGAA ACCGCTGCTG TTACTGTCGG TACACAGACG
AGCGTAGAGG TACATATGAA AATAGCGAGC AAGTCCCTCG GTGAGGTTGC CGTTGTAGGT
TTCGGTACGC AGCGCAGGGT AAGCCTGGTA GGTGCGCAGT CCGAGATCAA ACCTGCGGAA
TTCAAACAAC CTACGGCTAA TATCAGTACG ATGCTGGCAG GTCGTATCGC CGGTGTGGTA
GGCGTACAAC GTTCCGGTGA GCCGGGTAGA GGCGCTGCCG ATCTGTGGAT ACGTGGTATC
TCCACTTTCG GTAATGGTAA TAATTCCGGT CCGCTGGTAC TGGTAGATGG TGTGGAACGT
TCTATCAATA ACATCTCTCC GGAAGATGTA GAATCATTCA CCGTACTGAA AGATGCTGCC
GGTACAGCGG TATATGGTGT ACGTGGTGCA AATGGTGTTA TTCTCATCAA AACAAAAACA
GGTAAAGTAG GTAAACCACA GATATATCTG GATTATAATG AGGGTGTAAA TACATTTACC
CGTCGTCCGG AAATGCTGGA CGGTATCTCT TATATGCGTC TCGCAAATGA AGCGTTGACT
ACCCGTAATC AGAATCCTAA ATATTCTGAA GAATATATTC AGAACACCAT CAGCGGAAAA
GATCCTTTGT TGTATCCGAA TGTAGACTGG ATGGACGCTG TATTCAATAA ATACGGCCAT
ACCCGCAGCA CGAACCTGAA TGCGAGTGGT GGTGTGGAGA ATGCACAATA TTATGTGTCC
CTGGGTTATT ACAATGAATC AGGTTTCCTG AAGACGGATG ACCTGGCAAA GTACAATTCA
TCCCTGAAGT ATAACAGGTA TAACTTCACC AGTAACCTGA ATCTGCGTGT TACTAAAACA
ACAAAACTGG ATGTAGGTTT ACAGGGATAT TTTTCCAATG GTAACTATCC CGGTATATCA
TCCGGAGACA TTTTCCAGAG TGCAATGGAC GCTGCTCCTG TCGCATATCC GATCATGTAT
CCAGGCGGCT TCGTACCCGG CCAATCATCA AACGGAGGTT TCCGTAACCC TTATGCCGAT
CTCACCCGCA GAGGTTATAC CAACGAGTTC CGCAATCAGC TTTATTCCAA CATCAGAGCT
ACCCAGGATA TGGACGCACT GACCAGGGGT TTGAAAGCTA GTGTGATGTT CGCGTTTGAT
TCTTACAACC AGAACAACAT TATCCGTTCA AAGAGAGAGG ATACTTATTA TCCTGACCAG
ACCAATCCTT ACAAACCGGA TGGTTCACTG AACCTGGTGA AAACATACAC CGGTAACCAG
TACCTGGGAT TTGATAATAG TCCTGGCAGT CGTCAGACCA GCCGTAAGTT CTATACAGAG
GCATCGCTGA ACTATGACAG AAGCTTTGGT AAACATCGCG TGGGTGGCCT GGCATTGTTC
TATTCAAGCG ATAGAACGAA CGCACTGGCA GGAGATTTCA TCAGCTCTAT CCCTGAGCGT
TCACTGGGGC TTGCAGGTAG AGTGACCTAT TCTTACTCAG ACAAATATTT CGTTGAACTG
AATGCAGGTT ATAATGGTTC TGAACTGTTT GCACCTGGTA ACCGTTTTGG TTTCTTCCCT
GCAGTAGGTA TAGGCTGGAT CGCATCCGAA GAGAAATTCT TTGAACCATT GAAAAATGCG
ATCAACTTCC TGAAATTCCG TTATTCAAAT GGTAATACTG GTTTGGGTAG CGCAGGCGAC
CGATTCCTGT ATATCACGAA TCTGAATACT TACAACGATG CTTATAAATA TGGTCAGGTG
CCACAGTTTG TAGGTGGTAT CAATATTGAC CGTTATGCAA CAAATGTAAA ATGGTCTGTG
TCTAACAAGC AGGACTTGGG CATTGAGTTC CGCACCTTGA ATGATCAGTT GTCTGTTATT
GTTGATCTGT TCAAGGAACA CCGTACCGGT ATCTTTTTGC AACGCGCTTC AAGCGTAGAC
TTTATGGGGC TGCAGAATCA ACCATATGCT AACCTGGGTA TCGTGGATAA CAAAGGTTTT
GATGCAACAC TCGAATACAG CACACGTTTT GGAGCAGTAG ACTTAAAACT GAGAGGTAAT
ATCACCTATA CAAAAGATAA GCTGATAGAA GATGATCGTC CGCCGCAGAA TTATCCATGG
ATGGAGCACA GAGGAAACAA CGTGCTGGCG CGTTACGGTT ATATCGCTGA AGGATTATTT
GGAAGCGAAG ATGAAGTAAA TAAGAGTGCA GTACCTGGAG ATAGATCTAA AGTTAAACCA
GGTGATATCA AGTATAAAGA CCTGAATGGA GATGGTCTCA TTAATAATTA TGATGTAACG
AAGATCGGTC GTGGCGATGT ACCAAGCACG GTATATGGTT TCGGTTTCGA CCTGGGTTAT
AAAGGCTTCA GCTTTGGTTT ATTGTTCCAG GGTATTTCGG ATGCAGATAG AATGCTGAGA
GGTTCCGGAA TTGTACCATT TAACGGTGGA GGTGGTGTAA CCAATGCTTA TGCGATCGCT
ACTGACAGAT GGACGGTAGA CAATCCGAAC CCGAATGCAT TTTATCCGAG ACTTGCTTAT
GGAGAATCTG AGAATATCAA TAATACGCAA GCAAGCTCCT GGTGGATTAA AGACGTGAGC
TTTGTGCGTC TTAAATCGGC ACAGCTGGCG TATAACTTCC CGGCAGCGAT GATGGGCAGA
ACAGGTATCC GCGCTGCCTC TGTTTATTTG CAGGGCATTA ATCTACTTAC TTTCAGCAAA
TTCAAACTGT GGGACCCTGA ATTGAACACA GACAATGGTT CCGCTTATCC GAACATCAGG
ACTATTTCCC TGGGTATGAA TCTGAAATTC TAA
 
Protein sequence
MPKLRMKKIA TTFRRRALSI VFLVLSASLF LPVYAGKGQI LKETNVTIRL KSGSLESAIQ 
DLHSSTKVAF AYDKQLLRSF PVSDCSFSNE RLDIVLEQLL RNKQLGFEEV NNVVVISKAN
NNASANAPKS IREDIVVSGV VKDESGNPMP GVSVAVRGTS TMTSTGADGK FKLIVQSRDA
VIGFSFIGYE TAAVTVGTQT SVEVHMKIAS KSLGEVAVVG FGTQRRVSLV GAQSEIKPAE
FKQPTANIST MLAGRIAGVV GVQRSGEPGR GAADLWIRGI STFGNGNNSG PLVLVDGVER
SINNISPEDV ESFTVLKDAA GTAVYGVRGA NGVILIKTKT GKVGKPQIYL DYNEGVNTFT
RRPEMLDGIS YMRLANEALT TRNQNPKYSE EYIQNTISGK DPLLYPNVDW MDAVFNKYGH
TRSTNLNASG GVENAQYYVS LGYYNESGFL KTDDLAKYNS SLKYNRYNFT SNLNLRVTKT
TKLDVGLQGY FSNGNYPGIS SGDIFQSAMD AAPVAYPIMY PGGFVPGQSS NGGFRNPYAD
LTRRGYTNEF RNQLYSNIRA TQDMDALTRG LKASVMFAFD SYNQNNIIRS KREDTYYPDQ
TNPYKPDGSL NLVKTYTGNQ YLGFDNSPGS RQTSRKFYTE ASLNYDRSFG KHRVGGLALF
YSSDRTNALA GDFISSIPER SLGLAGRVTY SYSDKYFVEL NAGYNGSELF APGNRFGFFP
AVGIGWIASE EKFFEPLKNA INFLKFRYSN GNTGLGSAGD RFLYITNLNT YNDAYKYGQV
PQFVGGINID RYATNVKWSV SNKQDLGIEF RTLNDQLSVI VDLFKEHRTG IFLQRASSVD
FMGLQNQPYA NLGIVDNKGF DATLEYSTRF GAVDLKLRGN ITYTKDKLIE DDRPPQNYPW
MEHRGNNVLA RYGYIAEGLF GSEDEVNKSA VPGDRSKVKP GDIKYKDLNG DGLINNYDVT
KIGRGDVPST VYGFGFDLGY KGFSFGLLFQ GISDADRMLR GSGIVPFNGG GGVTNAYAIA
TDRWTVDNPN PNAFYPRLAY GESENINNTQ ASSWWIKDVS FVRLKSAQLA YNFPAAMMGR
TGIRAASVYL QGINLLTFSK FKLWDPELNT DNGSAYPNIR TISLGMNLKF