Gene Cpin_5117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5117 
Symbol 
ID8361293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6388702 
End bp6392100 
Gene Length3399 bp 
Protein Length1132 aa 
Translation table11 
GC content49% 
IMG OID644967265 
ProductCarbohydrate binding family 6 
Protein accessionYP_003124750 
Protein GI256424097 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.17685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA ACCTGCTGCT AACATTTCAT AAGCGGTTCT TTACGGCCTG CTTTGGAATC 
GCCCTGCTCG CAGCGAGCAC AGACCGTGCC TACAGCCAGG TGCCTACTGG CTTTACGCTG
AAAAAACTAA CCGACAATTC CATCGTTGAA GCTACTGCAA TGGCACATTC TGCTGATGGC
AGAATATTCA TGGCTGAACG TGGAGGTAAG GTGAAAGTTT ATCAGAATGG AACCGTATCT
ACCGTGTATA CCGCTCAAAC AGTAACCGAT GCAGAACAAG GTTTACTGGG TATCACACTT
CACCCTCAAT TCACCAGTAA TGGTCGTTGT TACATTTTCT ACACCAATAG GGAAATGACA
CGTCATTACC TGGACATCTT GTTTATCAAC CAGGCTAACG GTGTCGACTC TGTCAGAAGA
GTAATGGAAT TTGATCAGAT CATCAACGGC GCGCACAATG GTGGCGCGTT GTTATTCCGC
AGAAACCTGC TGTATGTAGC GATCGGCGAG AGCAATGAAG CGATCGAATC TCCGAAGCTG
ACGACTTACC GTGGTAAGAT CCTTCGTCTG ACAGAAGACG GACAACCAGC TCCGGGCAAT
CCTTATTATG ACACCCCTAA CGCCACCCGT CAGCAGAGAA GTATCTGGGC ACGCGGTATG
AGAAATCCGT GGAGGATGTC TCTGGACCCG GTGTCTCAAA GAATCTTTGT GGTGGATGTA
GGCGGTGACT ATGAAGAAAT TAACGATGTT ACCAATCCTG ATCCTGCCAG AGGATATAAC
TATGGCTGGG ACCAGAATCA CAAAACAGGT TATCAGCCGG ATACCACTAC GACGATCCCT
CCGGTTTATT TTTATGACCA CAGTGAAGAT AAAGGCGGAT GTGCTATCAC ATCAGGTGTG
TTCTTCAATC CTCCTGCTAC CAACTATCCG CCACAGTATC TGAATAAGTT CTTCTTCTCA
GACTGGTGTC GTGACTGGTA CAGGTTTGTG GACATCAATG GTCTGAAGCC TTCTTCTCAG
TTTACCGAGT TCTCAGCGAA GAACTTCACC AGGATCCTGG GTACCAGCGT TGGTATAGAT
GGTAACATTT ATTACATCTC TTATGCGGGA GATGGTAGTC TGTATAGAAT TGAATACAAT
AACAACCAGG CGCCATCTAT CGTCAATCAA CCTGAAAATA AAACCGTTAC AGCGGGAGAC
GCGGTGTCTT TCTCCGTTAC TGCTTCCGGT GGTACGCCAT TATCTTTCCA GTGGCAGAAA
AACGGCGTTA ACATTGCCGG CGCTACTGCT GCTACCTACA GCATTGCACA GACCACACAG
GCTGACTCAG GTCAGTACCG TTGTGTAGTG ACCAATCCGA TCAGCACCAT CAACAGTAAT
TCTGCGAAGC TGACCGTATT GCCATTCAAT GCAAGACCAG TTCCGAAGAT CCTTACACCG
GCAGCTACCC TGACCTGGAA CGTAGGTACC ATCGTTAACT ATTCTGCAAC AGCTACCGAT
GCTGAAGACG GTACCTTACC TGATTCCGCT TACACATGGG AGGCACGTTT CTATCATAAA
GACACCCCTA CCAGTGAGCA CTGGCATCCT GGTCCGACCC TGACGAAAGG CGTAAAAACA
GGCAGCTTCA CTGCTGACAA CCTTGGTGAG TCTTCTCCGA ATATCTGGTT CAGACTGATG
CTGACTGTAA AAGATTCTAA CGGCCGTACC GGCGTAGACT CCGTTGACGT TTATCCAAAC
AAAGTACAGG TTACTGCTGC CAGCAATATT CCTGGTATCA GTCTGGTACT GGGTTCAAAA
GAGGTAACAC CGTTCACCAA AACAATGGTG GTAAACTCCC TGACTACACT GCAGGCAGTT
ACCCCACAGT TGCTCGGCGA CACGACCTAC GATTTCGTGT CATGGGCACA TGGCGGCGAT
GCATTACAGT CCATCCGTGT ACCAGCAAAA GATACCGTTT TCCGTGCTAC CTATAAAGCA
GGTGCTTCCC GTCAGAATCC ATACCCTGAC CCGGCCGTAC CTTCCACTAT CCCAGGTAAA
ATCGAGATCG AAAACTTCGA CTATGGTGGC GAAGGCATTG CCTACCATGA CGAAAGCGCT
GCCAATCAGG GTAATCAATA CCGTACCACT GAAGGCGTAG ACCTCGAAAA CTGCGCTGAA
GGTGGTTTCA ATATCGGTTA TGTCAACAAC GGTGAATGGC TGGAATATAC GGCCAACGTA
ACCGTTACCG GTAAATATAC CTTCTCGGCC CGCATCGCTA ATCCGGGTAC TGCCAAGACC
CTCCACGTAG AAATGGATGG CGTTAACATC ACCGGCACCG TAACAGTACC AACAACCGGT
GGTTTCCAGG CATGGCAGAC CGTTTCAACT ACTACCACAC AGTCACTGCC TGCCGGCATA
CATGTGTTCC GTATTGTACT GGAAGCAAAT GACTTCAACG TTAACTACTT CACTTTCGAT
CTGGCAATCG GTAACGCACC AACCGTGAAC ATCACCGCTC CTGTGAATGG CGAAACCTTT
GTCACCAATT CCGATATCCT CCTGAAAGCA AATGCTGCAG ATTCAGACGG ATCTATCAGA
AAAGTGGAAT TCTTCCAGGG TGCTACTAAA ATCGGAGAAG ATACAACTGC TCCTTACCAG
TTCCTCTGGA CAGGCGTTGC CACCGGCGCT TACAGCATCA CTGCAAAAGC AACCGACAAC
ACACAGATGA CAGCTACCTC CACTCCTGTC GCCATCAACG TAACCGCTGC CGCTGTTGAA
AAAACAGTAC CAGGACATAT CGAAGCAGAA AGCTTTGATG CCATGTCCGG CATCCAGACA
GAAGGTTGCG GAGATACCGG CGGTGGTGAA AACATCGGCT GGGTAGATAC CGGCGACTGG
ATGGATTATT TCGTTAACGT TACCGCAGCA GGTAGCTATA ACGCATCCTT CCGCGTAGCC
AGCGCACCAG GTGGTGGTCA GCTGCAGCTG CAGGCAGGCG CTAATATCCT GACTACTGTT
GACGTACCAG CTACCGGCGG ATGGCAGGCA TGGACTACTA TCACTAAAAC AGTCTCCCTG
ACAGCAGGTA AACAGACATT ACGTGTATAT GCGTCACATG CAGACTTCAA CCTGAACTGG
ATTGAATTCG CTGCTACGAC ACAAGCGGCA AGAACATCAG CCGTAGTAGA ACAGAAACCT
TCTATCAGAA TGTATCCAAA CCCGGTTGTG AATCTGCTGA CAGTAGGTAA CGTGAAAGGC
GATGGTCTGT TCACTATTAC CAATGTCGCT ACCTCACAGA CGATCATCAT CAAGGCGACC
AATGGTATAC TTGACGTAAG CAACCTGACA CCAGGTGTTT ATGTACTTAA ATTCACCAAT
AATGGAAAAC CTGTAACGAA GAAATTCGTG AAGATGTAG
 
Protein sequence
MKKNLLLTFH KRFFTACFGI ALLAASTDRA YSQVPTGFTL KKLTDNSIVE ATAMAHSADG 
RIFMAERGGK VKVYQNGTVS TVYTAQTVTD AEQGLLGITL HPQFTSNGRC YIFYTNREMT
RHYLDILFIN QANGVDSVRR VMEFDQIING AHNGGALLFR RNLLYVAIGE SNEAIESPKL
TTYRGKILRL TEDGQPAPGN PYYDTPNATR QQRSIWARGM RNPWRMSLDP VSQRIFVVDV
GGDYEEINDV TNPDPARGYN YGWDQNHKTG YQPDTTTTIP PVYFYDHSED KGGCAITSGV
FFNPPATNYP PQYLNKFFFS DWCRDWYRFV DINGLKPSSQ FTEFSAKNFT RILGTSVGID
GNIYYISYAG DGSLYRIEYN NNQAPSIVNQ PENKTVTAGD AVSFSVTASG GTPLSFQWQK
NGVNIAGATA ATYSIAQTTQ ADSGQYRCVV TNPISTINSN SAKLTVLPFN ARPVPKILTP
AATLTWNVGT IVNYSATATD AEDGTLPDSA YTWEARFYHK DTPTSEHWHP GPTLTKGVKT
GSFTADNLGE SSPNIWFRLM LTVKDSNGRT GVDSVDVYPN KVQVTAASNI PGISLVLGSK
EVTPFTKTMV VNSLTTLQAV TPQLLGDTTY DFVSWAHGGD ALQSIRVPAK DTVFRATYKA
GASRQNPYPD PAVPSTIPGK IEIENFDYGG EGIAYHDESA ANQGNQYRTT EGVDLENCAE
GGFNIGYVNN GEWLEYTANV TVTGKYTFSA RIANPGTAKT LHVEMDGVNI TGTVTVPTTG
GFQAWQTVST TTTQSLPAGI HVFRIVLEAN DFNVNYFTFD LAIGNAPTVN ITAPVNGETF
VTNSDILLKA NAADSDGSIR KVEFFQGATK IGEDTTAPYQ FLWTGVATGA YSITAKATDN
TQMTATSTPV AINVTAAAVE KTVPGHIEAE SFDAMSGIQT EGCGDTGGGE NIGWVDTGDW
MDYFVNVTAA GSYNASFRVA SAPGGGQLQL QAGANILTTV DVPATGGWQA WTTITKTVSL
TAGKQTLRVY ASHADFNLNW IEFAATTQAA RTSAVVEQKP SIRMYPNPVV NLLTVGNVKG
DGLFTITNVA TSQTIIIKAT NGILDVSNLT PGVYVLKFTN NGKPVTKKFV KM