Gene Cpin_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1800 
Symbol 
ID8357951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp2186845 
End bp2190021 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content47% 
IMG OID644963988 
ProductTonB-dependent receptor plug 
Protein accessionYP_003121497 
Protein GI256420844 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACT TTTTATTCCA CAAGAAACTG TTATTGCCCG GAGCCTTGTT GTTACAAATG 
ATGGCATCCG CACAGCAACT GGCCGTTAAT AATTCCAACG CAACGCCGCC GGCCATTGCT
CAGGCAAAAG AAATTATCCT CAAGGGAAAA GTAATAGGGA CAGAAGAAGG TACCGGCCTG
CCCGGAGTAG TTGTACGTGT AAAAGTCGGC AACAAAGGGA CTACCACTAT GCCTGACGGC
TCCTATACAC TCAAGGTACA CGAGAATTCA ACCATCGTCG TATCCCTGAT CGGATATGTT
ACACAGGAGA TCCCTGTTAA CAAAAAAGAG AATATCACTG TTTATCTGGC CAAGGATGTG
AAAGCCCTGA CAGAAACAGT TATCATTGGT TATGGTACAC AGAAACGCGC CAATGTACTC
GGTGCTGTTG CTGCCGTGAA TGCCAGTGAT ATTGAAGACC TGCCGGTAGC CAATCTGGCA
ACTGCCCTGC AGAACAAAGT ACCTGGTGTG TCAGTGGCGC AATCTTCTGG TCGGCCGGGT
TCCTCTACCA GTCTGACCAT CCGTAATCCG GTGACCTGGG CTGCAACTGG CTCCTCTATT
GATCCATTGT ATGTAATCGA CGGTTTCCAG CTGACAAAAC AGGATTTTGA CAACCTGGAT
GCTACCCAGA TAGAAAGCAT TACTTTCCTG AAAGACGCGG CTGCTTCGAT CTATGGCGCC
CGTGGTGCGA ATGGTGTGGT ACTGGTAAAA ACCAAAATGG GCCGCCCGGG TAAACCCCGT
ATCAGCTATT CAGGTTCGCA TGGTATTTCT TCCGCTACCT ATATTCCGGA GATGCTGACG
GCTCACGATC ATGCTGTGCT GTTGAATAAT AAGTATACTG CGCTGAAAGA CGCGAATACA
AATAAATTCT ATACACCGCA GGAGCTGGAA TATCTGCAAA CGCATAACAA CAACTGGATT
GACAATGTAT GGAAAAGCTC TCACCTGAGC CGTCATACTA TCAATGTCAG CGGTGGTACA
GAAAGAATTA CCTTCTTCGC AGGTGGCAAC TACTATTCAG AAGATGGTAA CGTCGGCGAT
CTGAATGCGA CCAAATATGG TCTGCGTATG GGACTCAATA CCAAGATCAC CGACAACCTG
ACCGCTACGA TCTCGTTTGC AACAGATAAT TCAAAGATCA ATCGTCCTAC GCCCAAAACA
GTGCAGTCCG GCCTGACTGA ACAGTCCGAC CAGATGAGTG CTACCGTAAG CGCCCTTTTG
CTGACACCGG GATGGGTACC GATGTATATC GACGACAGGC CTGTATTTTC ATCCGTACCA
GGATGGCATC CGGGAGAACT GGTAAAAACA GGTAGTTACA GCCGCTCCCG TACACAGGGA
CAGACGATCA ATGCGGCCCT GGAATATAAA CTGCCGGCTG TGGAAGGCTT GTCTTTCCGT
GTGCAATATG CACGTAGTAA CAGGAATAAC TTTGGTAAAG AATTCTATGT GCCTTATTCC
CTGTACAACT TCGTAAAAGA AGGTACACAT GAGACGACAC AGAATGTGAT GTTTACCAAT
CAGCTGACTA CCAACAATCC GACTACCCTG ATCAAGAACG GTAACCTGAT GACGGAATCT
TATGGTAGTT CTTCCAGTTA TCAGCTGAAT GAGGCCATCA ACTATGCCCG CAGCTTCGGT
AAACATGATA TAGCGGTAGT ACTCGTAGCA GAACAGAGCG AATCTTCTTC TGACGCATTT
GATACCCGCC GTGAGCAGGT CGTAATACCA GGTATCGATG AACTGTTTGC TTTCAGCCAG
GACAAAAGTT TTTATGATAA CTCTGGGTCT TCCGGAGAAA CAGGCAGGAT GAGTTATGTC
AGCCGTATCA ATTATGCCTT TAACGACCGG TACCTGCTGG AAGCGACCTT CCGTGCAGAT
GCCTCTCCTA ACTTCCCGAA AAACTCAAGA TGGGGTTATT TCCCTTCTGT AGCGGTTGGT
TGGAGAGTGT CCCAGGAGAA ATTCTTCCAT GATAATGTGA AATTTATCAA CGACCTGAAG
ATCCGTTTCC AGGTTGGTCT TACTGGTAAC GATGCGGTGA AGAACTACCA GTACAAAGAG
CGTTATACAC AAACCACCGG TATGCTGTTT GGTAACACGA TGACCAGCGG TTTGAATAAC
AATGATATTC CTAACAGTAG TATTACCTGG GAGAAAGCCC TGTATAAGAA CCTGGGTTTT
GATGGTAGTT TCTTTGACCG CCGCTTCGAT TTCGCCATTG ACCTCTATCA CCGTTATAAC
TATGACATGC TGATGCAACC AAGCAATACA GTGCCTACCA GTTTTGGCGG TGGTATTGCT
GATCAGAACT ATGGTCGTCT GAAGTCATAT GGTATAGAAG CAGGACTGAC TTACAACGGC
AAAATCGGCA AAGAGTTCAA GTACTACACC ACCGTCAACT TCGGTATCAG TGATAATAAA
GTGATCCGCA AATACTACGG CGCTGGCGAT ACTGCCTGGA GAAACCCGAT CGGTCGTAGA
ACCGACAGCG GACTGGAAGG ATACAAAGCG GTAGGTATGC TGCGCACACA GGCAGATGTC
GATGCATTGC TGGCTAAAAA TCCAGACTGG ACGATTGATG GCGAAAAGCC GATTCCCGGA
TACATGAATT ATGAAGATAT CAATAGCGAT GGTAAGATCA ATGAAATGGA TAAGACCCGC
ATTGCACCAA GAGGTAGCAG TCTCTTTGGT GTAGGCTTCA ACCTGGGCGC ATCCTGGAGA
TCTTTCAAGT TCAGTGTGAA CATGTCGCTG CAGGTAGGTG GTCAGACGGT ATATGATAAG
ACAGCAAGAA CACCGGCGAA TGAAAACCAG CGTGCACTGG CTATCTGGAA AGATTCCTGG
TCACCGGAGA ATCCAAATGC GAAGTATCCG CTGATCAATG CACCACTGAT CAAAGAAATA
TCCGATATCT GGATGGTAAG CGCAACCACT ATGAGAGTGA ACAACATGAT GTTGTCTTAT
GGTCTTCCGC AGAACCTGGC CTCCCGCTGG AAAATACCTG ACCTGCGTGT GTTTGTGACA
GGTACGAATC TGTGGTCTAT TATCAACCAT CAGTCTTACA AAGATCCGGC TACCAACCTG
GCAGTAGATT ATCCTGCCCT GCGTACCTAC ACTTTTGGCC TGAATGTAAG TCTCTAA
 
Protein sequence
MKDFLFHKKL LLPGALLLQM MASAQQLAVN NSNATPPAIA QAKEIILKGK VIGTEEGTGL 
PGVVVRVKVG NKGTTTMPDG SYTLKVHENS TIVVSLIGYV TQEIPVNKKE NITVYLAKDV
KALTETVIIG YGTQKRANVL GAVAAVNASD IEDLPVANLA TALQNKVPGV SVAQSSGRPG
SSTSLTIRNP VTWAATGSSI DPLYVIDGFQ LTKQDFDNLD ATQIESITFL KDAAASIYGA
RGANGVVLVK TKMGRPGKPR ISYSGSHGIS SATYIPEMLT AHDHAVLLNN KYTALKDANT
NKFYTPQELE YLQTHNNNWI DNVWKSSHLS RHTINVSGGT ERITFFAGGN YYSEDGNVGD
LNATKYGLRM GLNTKITDNL TATISFATDN SKINRPTPKT VQSGLTEQSD QMSATVSALL
LTPGWVPMYI DDRPVFSSVP GWHPGELVKT GSYSRSRTQG QTINAALEYK LPAVEGLSFR
VQYARSNRNN FGKEFYVPYS LYNFVKEGTH ETTQNVMFTN QLTTNNPTTL IKNGNLMTES
YGSSSSYQLN EAINYARSFG KHDIAVVLVA EQSESSSDAF DTRREQVVIP GIDELFAFSQ
DKSFYDNSGS SGETGRMSYV SRINYAFNDR YLLEATFRAD ASPNFPKNSR WGYFPSVAVG
WRVSQEKFFH DNVKFINDLK IRFQVGLTGN DAVKNYQYKE RYTQTTGMLF GNTMTSGLNN
NDIPNSSITW EKALYKNLGF DGSFFDRRFD FAIDLYHRYN YDMLMQPSNT VPTSFGGGIA
DQNYGRLKSY GIEAGLTYNG KIGKEFKYYT TVNFGISDNK VIRKYYGAGD TAWRNPIGRR
TDSGLEGYKA VGMLRTQADV DALLAKNPDW TIDGEKPIPG YMNYEDINSD GKINEMDKTR
IAPRGSSLFG VGFNLGASWR SFKFSVNMSL QVGGQTVYDK TARTPANENQ RALAIWKDSW
SPENPNAKYP LINAPLIKEI SDIWMVSATT MRVNNMMLSY GLPQNLASRW KIPDLRVFVT
GTNLWSIINH QSYKDPATNL AVDYPALRTY TFGLNVSL