Gene Cpin_4849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4849 
Symbol 
ID8361025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6061233 
End bp6064391 
Gene Length3159 bp 
Protein Length1052 aa 
Translation table11 
GC content49% 
IMG OID644966999 
ProductTonB-dependent receptor plug 
Protein accessionYP_003124484 
Protein GI256423831 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.626865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00508751 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTCCGG CCCCCGGCAA ACAGAAAAGT GCGCTTCCGG TTTGGCTGAC CACAATTGTG 
GCCATGTTGT TGCTGACCGG CGCCTATGCC CAGAACAACC CACCAGTCAG AGGTAAGGTA
CTCGACGAGA CAGGGAAGCC CCTGTCTGGT GTTACCGTCG CTATTAAAAA CACTTCCAGA
GGTACTGTTA CCGGTACTGA TGGTAGTTAC AGTATCAGTG CTGCCGCAGG CGACGTACTG
TTATTCAGTT TTGTGGGGTA TACCACCGCA CAGGTAACGG TGGGCAGTGG CGCCAAGTAC
GAGATCAGCC TTGCTCCGAA TTCACAATCA CTGGAGCAGA TGGTGGTGAT CGGTTATGGT
GCACAGAAGA AAAGTTCCCT GACGGGTTCT GTGGCTTCTG TCAACAGTAA GACCATTGCC
GAATTGCCGG TAGTCAGCGT ACAACAGGCC CTGCAAGGTC GTGTGGCTGG TCTGACCGTT
ACTAATAACG GTACGCCTGG TTCCGATCCG ATCATCCGTA TCAGGGGTAT CAGTTCTATC
AGCTATGCTT CTGACCCGCT GTATGTGATA GACGGATTTC CGACTTCTAA CCTGGCCAGT
TTCGACAGTA GAGATGTGCA ATCGGTAGAA GTGCTGAAGG ATGCGAGTGC GGCAGCTATT
TATGGTTCCC GTGCCACCAA TGGCGTTATC ATCATTACCA CCAAGAAAGG TTCCCGGGAT
GGCAAACCGC ATGTCAACTT TGACTCTTAC GTGGGCGTAC AATCTGCCTG GAAGACCGTC
GACCTGCTTA ATACGCAACA ATACCTGCAA TATGAGCGTG CCTTGAATGG TGCAGCGGGT
ATTGCCAAAC CTCCGCGTCT GGAAGATGCT AATTTCAATC AACCGCTGTA TGACGGTACT
TCCCAGAGCT TTGCACAGAC AAATACAGAT TGGCAGGATG CTTATTTCAG AAAAGCACTG
ATCACACAGT CGAATGTATC TGTGAGCGGC GGTAATGAAG CCTCCCGTTA TTATATGTCC
GCCGGACACC TGAAGCAGGA TGGTATCGCG CAGGGTGTTA ATTATGAGCG TGGTAATTTC
CGTATCAACT CAGAACATAA TATCAGTAAG GCGTTTACCG TAGGAGAAAA CCTATTGCTG
TCTTATTCCA AACAGCGCTA TGACAATACT TCCGGTAACA GGACCCGCCT GGCTAATATT
ATCAGGGCAT TGCCCTATCT GCCGGTATAT GATCCGACGA CTAATGGCGG GTTCCGGAAT
GCGGAGAACA GTGTGGATGG TGCAGACCCT ACCAATCCGG TGGAAGATGC AATTCTTCTG
GGTAATGCAC ACCGCCAGGT ATTCAAGTTA CTGGGTACTG TGTATGCGCA GGTGAATCTT
ACGACCTGGC TGAATTTCCG CTCTACATTC GGGGCGGATT ATGTTTCGAA CTATCAGCAT
GAATTCCTGC CTATTTACAA TGACAAGGGT AGGAATGCGT CTGTCGCGAC GATCAATGAC
CAGCGCTCCA ACAGAACAAC TTTGTTATAT ACACAACAGC TGACTTTTGA CAAGACATTT
GGTCAACACC ATATCAATGC GGTGGCTGTA TATGAGAGGC AACAGGCGGA TAATTTCGGG
GAGACACAAT CCGGCAACCA GAGTACTAAT GACCGGGAAA CATTTGTGGG TGCTACGAAT
GTGACGGCTT TTTCCTCCCG TACGGCTACT TTGATACAGT CTTATATTGG TCGTGTCAGT
TATGATTTTG CCGGCAAATA TTTGCTTAGT GGCGCTATCC GTCGTGATGG ACTTTCCGTA
TGGGCGCCCG GTAAAAAGTT CCAGAGTTTC CCATCTGTAT CAGCAGGTTG GAAGATCGAC
CAGGAGCCTT TCCTGCAACC GGTGACTGCT CTCTCAGAAT TGAAGCTTCG TGGTGGATGG
GGTGTCACAG GTCTGAATGC AATCGGCGTC TTTCCTGCAT TGCAGAATTC CATTCTTTCC
AACGAGTATC CCTGGCAGGC GGTAGTACAA GCCAATGGTG CGAGTTATCC TTTTGGTAAT
ACCATCACAG TCGGTAATGC TTCTTATTAC AATCAGCTGG CCAGCAGCGG ACTGGAATGG
GAGAAAACCA AACAGTTGAA TATCGGACTG GACCTGGGTT TGTTTAATAA CCGTATCACT
TTTACTGCTG AATGGTACCG GAGACAGACA GATAACCTGA TCCTGACAAT TCCGACGCCT
TATAGCTTTG GTTTCGGGGG AACTGGTTCG GAATTGAATG CGGCTTCTAT GAGGAATAAC
GGCGTGGATC TGCAACTGGG CTATAATAAA ACCGGTGGCA ACTTTACCTG GAATCTGAGC
GGTAATATCG GGTTTATCAA GAACAGGATT CTTAGTCTGA ATACCCCCGG TGCTACGATC
GACGCCGGTG CAGATGCGGA CTTTGGCAAC GGCAACATGA CCCGTACAGT AGCCGGACAG
GCGATCCAGT CTTTTTATGG TTACGTGGTA GATGGCATTT TCCAGAGTCA GGACGAAGTG
AATAAGAGCC CGGTGCAGAT TGAAGGGTCT GATCCGGCAA AATCTACTGC TGCCGGTGAT
ATCAAATTCA AAGACCTGAA TGGAGATGGT AAGATCACTG CGGATGACCG TACCTTCCTC
GGTACCTATA TTCCTAAGTT CACTTATGCG CTGAACTACA GCGGTAGCTA TAAAAGCTTT
GATCTCTCCT TATTCTTCCA GGGAGTGCAG GGTAACAAAA TCTTCAATGG TACCCGTGTA
TTGCGTGAAG GTATGGCCCG TCTGTTTGGT GCAGGTGTGG AAGTACTGGA TGCGTGGACA
CCGGGCAATA CCAATACAGA TATCCCGAGG GCAGTCAGTG GTGATCCTAA CCAGAATGCC
CGCGTATCAG ACCGCTGGAT CGAGAATGGC TCTTATCTGC GTCTGAAGAA CGTTATTCTT
GGTTACTCAT TGCCGGCATC TGCGTTACGT ACGCTTACCC ATGGGGCAGT CAGCAATTTC
AGGGTGTATG TTTCCTCTCA GAACCTGCTG ACTTTTACCG GTTACAAGGG ATGGGATCCG
GAGATCGGTT CCAAGAATAC GACACTTACC AATGGTGTTG ATTATGGTCA GTATCCTTCC
GCAAGGTCAT TCCAGTTTGG CTTGCAGGTA GGTTTCTAA
 
Protein sequence
MRPAPGKQKS ALPVWLTTIV AMLLLTGAYA QNNPPVRGKV LDETGKPLSG VTVAIKNTSR 
GTVTGTDGSY SISAAAGDVL LFSFVGYTTA QVTVGSGAKY EISLAPNSQS LEQMVVIGYG
AQKKSSLTGS VASVNSKTIA ELPVVSVQQA LQGRVAGLTV TNNGTPGSDP IIRIRGISSI
SYASDPLYVI DGFPTSNLAS FDSRDVQSVE VLKDASAAAI YGSRATNGVI IITTKKGSRD
GKPHVNFDSY VGVQSAWKTV DLLNTQQYLQ YERALNGAAG IAKPPRLEDA NFNQPLYDGT
SQSFAQTNTD WQDAYFRKAL ITQSNVSVSG GNEASRYYMS AGHLKQDGIA QGVNYERGNF
RINSEHNISK AFTVGENLLL SYSKQRYDNT SGNRTRLANI IRALPYLPVY DPTTNGGFRN
AENSVDGADP TNPVEDAILL GNAHRQVFKL LGTVYAQVNL TTWLNFRSTF GADYVSNYQH
EFLPIYNDKG RNASVATIND QRSNRTTLLY TQQLTFDKTF GQHHINAVAV YERQQADNFG
ETQSGNQSTN DRETFVGATN VTAFSSRTAT LIQSYIGRVS YDFAGKYLLS GAIRRDGLSV
WAPGKKFQSF PSVSAGWKID QEPFLQPVTA LSELKLRGGW GVTGLNAIGV FPALQNSILS
NEYPWQAVVQ ANGASYPFGN TITVGNASYY NQLASSGLEW EKTKQLNIGL DLGLFNNRIT
FTAEWYRRQT DNLILTIPTP YSFGFGGTGS ELNAASMRNN GVDLQLGYNK TGGNFTWNLS
GNIGFIKNRI LSLNTPGATI DAGADADFGN GNMTRTVAGQ AIQSFYGYVV DGIFQSQDEV
NKSPVQIEGS DPAKSTAAGD IKFKDLNGDG KITADDRTFL GTYIPKFTYA LNYSGSYKSF
DLSLFFQGVQ GNKIFNGTRV LREGMARLFG AGVEVLDAWT PGNTNTDIPR AVSGDPNQNA
RVSDRWIENG SYLRLKNVIL GYSLPASALR TLTHGAVSNF RVYVSSQNLL TFTGYKGWDP
EIGSKNTTLT NGVDYGQYPS ARSFQFGLQV GF