Gene Cpin_5064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5064 
Symbol 
ID8361240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6308873 
End bp6310141 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content49% 
IMG OID644967212 
ProductNHL repeat-containing protein 
Protein accessionYP_003124697 
Protein GI256424044 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00409782 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00484678 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGAATAA TTTTCCTGCT GCTCTTCGGG GCCACTGCCT TTGCGCAGAA AAAAAGCCCC 
GAATTGCCCC CTCCGCACGC GACAAAATCC TACATGAAGT ATAGTAACGT AAAAGGGTGG
GAGAACAATG AAAAGCCGGT AGCGCCGGAA GGGTTTATGG TCAACATGTA TGCAGAGGGA
TTGCAGAATC CCAGATGGCT ATACGAACTG CCGAATGGAG ATCTGCTGGT GGCAGAGTCC
AATTCTCATT ATAAGTTCTT TAAGAAGATA GGGGCTGCGA TTGTGGGTGC TACGCGATCG
AATAGTATGA AGAAAAGTGC CGACAGGATC ACCTTACTGA GAGATGCGGA TCATGACGGT
ATCCCCGAGA CGAAGACCGT TTTCCTGGAA GGACTGAATC AGCCATTTGG AATGTTGCTG
GTAGGCGACC AGTTTTATGT GGCGAATACG GATGGATTGC TGCGGTTTCC ATATATGAAA
GATGCCACGA GTATTACGAC GACAGGAGAG AAGATTGCCG CTTTTCCTGC GGGAAAGGTC
AATCAGCACT GGACGCGGAA TATCATTGCC AATAAAGACA AGACGAAATT TTATGTAGCT
GTGGGCTGTG GTACGGATCA TGGTGAAAAG GGAATGGATA AGGAGGCTTT ACGCGCAAAT
ATCCTTGAAA TGAATCCGGA TGGTAGTGGT ATGCGGGTGT ATGCATCCGG ATTGAGGAAT
CCCGTAGGCA TGGATTGGGC GCCCGGTACC AATACGCTCT GGGTCGCTGT GAATGAAAGA
GATAAGCTGG GTAATGACCT GGTGCCCGAC TACATTACGG GTGTGAAGGA AGGTGCTTTC
TATGGATGGC CGTATAGTTA TTTCGGACAA CATCCGGATC CCCGTGTACC CGAGGCACCT
GCCGGGCTGA TTGAAAGTGC GATCGTGCCT GATTTCCCAT TAAATGCACA TACGGCCTCC
CTGGGATTGG CATTTTATAC CGGCAATGCT TTCCCGGAAA AGTATAAAAA TGGGGCGTTT
GTTACACAAC ATGGTTCCTG GAACCGTAAA CCGGTGTCGG GCTATAAAGT GCTGTTTGTA
CCGTTTAAAA ATGGCAAACC TACAGGACCA CAGGAAGACT TCCTGACCGG CTTTGTAAAA
GACAGCGTAA AGGGTGTGGT GCGGGGCAGA CCGGTAGGTA TTACCGTCTC CCAGACCGGA
GCGATGTTTA TTACAGACGA CCGTACCAAC AGGATCTGGC GTATCAGCTA TGCCGCTCCT
AAACTCTGA
 
Protein sequence
MRIIFLLLFG ATAFAQKKSP ELPPPHATKS YMKYSNVKGW ENNEKPVAPE GFMVNMYAEG 
LQNPRWLYEL PNGDLLVAES NSHYKFFKKI GAAIVGATRS NSMKKSADRI TLLRDADHDG
IPETKTVFLE GLNQPFGMLL VGDQFYVANT DGLLRFPYMK DATSITTTGE KIAAFPAGKV
NQHWTRNIIA NKDKTKFYVA VGCGTDHGEK GMDKEALRAN ILEMNPDGSG MRVYASGLRN
PVGMDWAPGT NTLWVAVNER DKLGNDLVPD YITGVKEGAF YGWPYSYFGQ HPDPRVPEAP
AGLIESAIVP DFPLNAHTAS LGLAFYTGNA FPEKYKNGAF VTQHGSWNRK PVSGYKVLFV
PFKNGKPTGP QEDFLTGFVK DSVKGVVRGR PVGITVSQTG AMFITDDRTN RIWRISYAAP
KL