Gene Cpin_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_0454 
Symbol 
ID8356560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp539509 
End bp542529 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content45% 
IMG OID644962600 
ProductTonB-dependent receptor plug 
Protein accessionYP_003120153 
Protein GI256419500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0605614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACA CTTTACTCCT ATGGCTATTC GTGGCCATCA GCGGGATGAC CGCGTATGCC 
CAAACACGAA CGATTAAAGG CAAGGTAACT GACTCAAAAG ATGGCGCAGC CGTACCCTAT
GCTACTGTCA GGATCCAGGG GACCAACAAA GGAACTGCAA CTGATCAAAG CGGAAATTTC
AGCATCGATA TTTCCGGTAC ACAGACATTG GTTATTTCAA GTGTTGGGTT CACCTCACAA
TCCGTGAAAC CTGATAACAG TAATACGGTG AATGTATCAT TGCTGGCAGA TAATACACTT
ACCGAATACA TTGCCACCGG CTATGTAAAT ACCAACAGGG TAAGAAAGGT AAGTGCGGTC
GCTGAAGTCA CGGCGGAAAA GCTGGGAAAC ACCCCGCTCG TTGACATCAA CCAGGCTTTA
CAGGGACAAG CTGCCGGCGT ATTCGTTGGC GGTGCTTCTG GTCAGCCGGG CTCCGTACAG
AATGTACGCA TCCGTGGGGT AGGTTCCATC AGTGCCAGCG CCGCGCCGCT TTATGTAATC
GACGGAATCA TCGTGGATGG CCGTGATGTG AACAATACCG GCAACCTTGC ACTGCAATCC
AATGACCTCC TGGCCAACCT GAACCCTAAC GATGTTGAAA GCATCAATAT CCTGAAAGAT
GCATCTGCTA CAGCGATGTA CGGTTCCAGA GGCGCCAACG GCGTGATCGT TATTAATACC
AAAAAAGGAA AAGCAGGTGT TACTACTTTC GGCGCCCGTG CACAATATGG TTCGGCAAAA
CCAAGCTTTG GTAAATCCTC ACTGCTGACA CCTGCCGAAA GCTATGCCTA TAATCGCGAT
GTACTGGCAT TGAATGATTT TACACCGGCT GAAATTGATG AAGAGTTTCC TGCTTCATTA
CTGGCTACAG GCTTTAACTG GCGCGACGCC GGTTTCCGTA CCGCTAAAAT GCAGGATTAT
GGTATCTCTG CTTCCGGTGG TAACGAGAAA ACAAAATTGT ACATTTCAGC AGGTTATAAT
GACCAGGAGG GAACACTGAT CAACTCCGGT CTGAAACGAT ATACCGTCAT TTCAAACGTA
TCCCAGAAAG TAAATGACAG ACTGGATATT GCGATGAACC TGAACCTTTC ACAAGGCGAC
GCCAGCAGTG CAATGGGGGG TAACTTTTAT TCCAGTCCGC TGTTGGGCGC TTATTTCGTT
TCTCCTTTTC AAAGCCCTTA TAAAGCCGAC GGTTCACTGT ATACTGGTCT TGAATCTGAT
TTTAATGCTG CCAGTGGAGA TAACTTCCTG TACTCTGTTT ACAGAAATGA CAAGAAGTTG
TCCAACTTCC GTGGTCTTGG TGGTGCAACG GTTTCATACC GCATATTCGA CTGGCTGAAA
ATACAGGAGA GAGTCAATCT TGATATGGTG AATACAGAAG CGAATCTGTT CTATGATCCT
ACTACCGGTG ATGGCTACAA TGCCGCCGAT CCGCTGAAAA GCGGAAGTGT ATACAACCAG
AATGTAAAAG TCTCTACAGT AACCAATCAG TTTTCTCTGA GCGGTAATTT CAATATCGGT
GAAGAACATC AGCTGGATTA TCTTGCACTG ACGGAATACA ACCGTTTTAA GTCAAGATCA
TTCAGTGCTG AAGGTATTGG TATTATCGGC AGTCAGCTGA AAGTACTTGA TATTACTGCT
ACTCCGCAAA CAGTAGGTGG TAATGCTACT GAGTATACGT TCCTGTCTTA TATGGGACAA
TTGAATTATT CCTTCAGACA AAAATATAAC CTGAGTCTTG GTGTCAGAAC AGATGGTTCT
TCACGTTTCG GGGTGAATAC CCGCTATGGT ACTTTCTTCT CAGTAGGTGC TTCGTGGAGA
TTGATAGAAG AAGAATTTAT GAAATCGCAG GAGCTGTTTT CTGATCTGAA ATTACGTGTT
AGTTATGGTC AGACAGGTAA TGCGGACTTT GGCAGTATCG ACAACTTCGT AGCCCGTGCA
CTGTATGACT ATGGCACCTC ATATAACGGT GCGCCTGGTA ATGCTCCTAA CACGATTGGT
AACGTCGATC TGACATGGGA AAAGAATAAG TCTTTCAATA TAGGTCTGGA CCTCGCTTTA
CTGAAAGGCG CTATCACTGC AACGATCGAT GTATACAAAC GTAAAACAGA TGGGTTGCTG
CTCAATACAC CGGTGTCATC TACCAGTGGT TTTACTACAC AGATGACTAA CATCGGCTCA
TTAGAAAATA AGGGTATTGA AGCATTGATC TCCACTAAAA ATTTTGATAA CAGAGGTGGG
TTTAGCTGGC GTACAGAACT CAATATCGGT ATGAACAGGA ATAAGATCAC ATCGCTGTTC
ATGGGCCGTG ATGTGGCTGG TGGAAACAGT ACACAGCTGC ACAGAGTAGG TCAGCCTGTA
CAAAGCTGGT ACCTGAATGA ATGGGCCGGT GTAGATCCTG AAAATGGTGA TCCTTTGTGG
TATACTGCTG ATGGAAAAAC AACCAACAAC ATCAACCTTG CAGAAAGAAG AATTGTGGGT
AATAGTCAGC CTAAATTCAC TGGTGGACTC ACCAACACTT TTGAATACAA AGGTATCGGT
GTTTCTGTAT TCTTCTATGC TGTAACAGGT AACAAAATTC TTAACAGAAC AAGAATACTG
GGTGATGCTG ATGGTGCTTA CTTCGGATAC GGATATGATA AGCTGACTGC TGAAAATTAC
TGGCGTAAAC CTGGTGATAT TGCAGAGAGA CCTAAACCTA TTCCAGGAGG AAATAAAAAT
GCCAACTCCG CATTGTCGAC ACGGTACCTG GAAGATGGAT CTTTCCTGCG TCTCAGAAAT
ATCAGCCTGA GTTATAGTCT GCCTGCAAAA TGGGTCAGAG CTGCCAAATT CACCAGTGTA
AAACTCTATG CACAGGCGGC AAATCTTGCT ACCTGGACCA GCTATACCGG ATGGGATCCT
GAACAGGATA TCAGTGCGAT GGAATTCTTC AGATATCCAC CTTCAAAGAG CATCACTTTT
GGCGCAAACA TTAACTTCTA A
 
Protein sequence
MKHTLLLWLF VAISGMTAYA QTRTIKGKVT DSKDGAAVPY ATVRIQGTNK GTATDQSGNF 
SIDISGTQTL VISSVGFTSQ SVKPDNSNTV NVSLLADNTL TEYIATGYVN TNRVRKVSAV
AEVTAEKLGN TPLVDINQAL QGQAAGVFVG GASGQPGSVQ NVRIRGVGSI SASAAPLYVI
DGIIVDGRDV NNTGNLALQS NDLLANLNPN DVESINILKD ASATAMYGSR GANGVIVINT
KKGKAGVTTF GARAQYGSAK PSFGKSSLLT PAESYAYNRD VLALNDFTPA EIDEEFPASL
LATGFNWRDA GFRTAKMQDY GISASGGNEK TKLYISAGYN DQEGTLINSG LKRYTVISNV
SQKVNDRLDI AMNLNLSQGD ASSAMGGNFY SSPLLGAYFV SPFQSPYKAD GSLYTGLESD
FNAASGDNFL YSVYRNDKKL SNFRGLGGAT VSYRIFDWLK IQERVNLDMV NTEANLFYDP
TTGDGYNAAD PLKSGSVYNQ NVKVSTVTNQ FSLSGNFNIG EEHQLDYLAL TEYNRFKSRS
FSAEGIGIIG SQLKVLDITA TPQTVGGNAT EYTFLSYMGQ LNYSFRQKYN LSLGVRTDGS
SRFGVNTRYG TFFSVGASWR LIEEEFMKSQ ELFSDLKLRV SYGQTGNADF GSIDNFVARA
LYDYGTSYNG APGNAPNTIG NVDLTWEKNK SFNIGLDLAL LKGAITATID VYKRKTDGLL
LNTPVSSTSG FTTQMTNIGS LENKGIEALI STKNFDNRGG FSWRTELNIG MNRNKITSLF
MGRDVAGGNS TQLHRVGQPV QSWYLNEWAG VDPENGDPLW YTADGKTTNN INLAERRIVG
NSQPKFTGGL TNTFEYKGIG VSVFFYAVTG NKILNRTRIL GDADGAYFGY GYDKLTAENY
WRKPGDIAER PKPIPGGNKN ANSALSTRYL EDGSFLRLRN ISLSYSLPAK WVRAAKFTSV
KLYAQAANLA TWTSYTGWDP EQDISAMEFF RYPPSKSITF GANINF