Gene Cpin_5079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5079 
Symbol 
ID8361255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6323396 
End bp6325105 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content48% 
IMG OID644967227 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_003124712 
Protein GI256424059 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.623805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAATA TTAATACGAG TGCTCAGGAC CGCACATTTG ATGCGATTGT GATAGGATCA 
GGTATCAGTG GAGGATGGGC AGCAAAAGAG TTCACGGAGA AAGGTTTGAA AACGCTGGTA
CTGGAAAGAG GCCGCGACGT AAAGCACCTG AAAGACTATC CCACAACAAA CAAATATCCC
TGGGAATTTC CACACGGCGG ACAAATACCG GAAGCGATCA AGGAAGAAGC GCCTGTCGTA
AGCCGTTGTT ATGCCTTTAA AGAGGATGCG ATGCATTTCT TTGTAAAGGA TAAAGAACAT
CCTTATGAGC AGGATAAACC ATTTGACTGG ATCCGTGGAT ACCAGGTCGG TGGTAAATCA
TTGCTCTGGG CAAGACAGAC ACAACGCTGG AGTGACTATG ACTTCGAAGG GCCTGCAAGG
GACGGCTTCG CAGTAGACTG GCCGATCCGC TATGCAGATA TCGCTCCCTG GTATAGCTAC
GTAGAGAAAT TTGTAGGTAT TTCAGGTAAT AAGGATGGTA TCGATAACCT CCCTGATGGC
GAATTCCTGC CACCTATGGA GCTGACTGCC GTTGAGCAAT ACTTTCAAAA GTTTGTAAAA
GACAATTATA AAGATCGTCA CGTTATATAT GGCCGCTGCG CGCATCTTTC CGAGCCGCAG
CAAATCCACA TCGAACAGGG TAGGGTACAA TGCCAGAAGA GAAATCTTTG TCAGCGTGGA
TGTCCCTTCG GCGGATATTT CAGCAGTAAC TCCTCTACAC TGCCATGGGC AGAAAAAACG
GGTAATCTGA CACTCCGCCC GCATTCCGTT GTACACTCTG TTATCTACGA TGAAAAGAAA
GGTAAGGCAA CCGGCGTGCG TGTGGTAGAC GCGAAAACCA AGGAAATGAC CGAGTACTAT
GCCCGTGTCA TCTTCGTAAA CGCATCTGCG ATCAACTCAA ACCTGATCCT GCTCAATTCC
ACTTCCAGCC GTTTCCCGAA TGGATTGGGT AATGACAGCG GGGTATTGGG TAAATACTTC
GCTTTCCATA ACTACCGCGC TACCATTTAC GCAGACCATG ATGGTCATAT GGACGTTACC
ACCGATGGCC GTCGTCCTAC CAGCGCATAC ATTCCCCGCT TCCGGAACGT GAAGAAACAG
GAGACAGACT TCCTGCGTGG ATATGCTGCC GGATTTGATA CTGGCCGCCG TAAATGGAAT
AGCCATGATG GTATTGGTAA GAGCCTGAAG GATAATCTGT TCAACGAAGA AATGGGTAAC
TGGTACGTAG GATCTCATAT GATGGGTGAA ACCATCCCGA AAGAGATCAG TCAGCTGACC
CTCGATAAAG ATAAAAAGGA TGAATGGGGG ATGCCTGTTA TCCATGTCAA TATCGGCTAC
GATGATAACG ATGAGAAGAT GGTGAAGGAC TTCCATGAGC AGATGACTGA AATGTACACC
AAAGCCGGTT TCACCAATAT CCGTACAGGC GATTCCAAAC AGGCGCCAGG GCTCGATATT
CATGAAATGG GTGGTGCGCG CATGGGGAAA GATCCGAAGA CATCCGTACT CAATAAATGG
AATCAGTTGC ACGACGTGAA TAACGTATTT GTGACCGATG GAGCTTGTAT GACTTCTACT
TCTACGCAGA ACCCATCACT GACTTATATG GCACTTACTG CCAGGGCAGT GGACTATGCG
GTTAGCCAGA TGAAGAAAGG TGAAATATAA
 
Protein sequence
MANINTSAQD RTFDAIVIGS GISGGWAAKE FTEKGLKTLV LERGRDVKHL KDYPTTNKYP 
WEFPHGGQIP EAIKEEAPVV SRCYAFKEDA MHFFVKDKEH PYEQDKPFDW IRGYQVGGKS
LLWARQTQRW SDYDFEGPAR DGFAVDWPIR YADIAPWYSY VEKFVGISGN KDGIDNLPDG
EFLPPMELTA VEQYFQKFVK DNYKDRHVIY GRCAHLSEPQ QIHIEQGRVQ CQKRNLCQRG
CPFGGYFSSN SSTLPWAEKT GNLTLRPHSV VHSVIYDEKK GKATGVRVVD AKTKEMTEYY
ARVIFVNASA INSNLILLNS TSSRFPNGLG NDSGVLGKYF AFHNYRATIY ADHDGHMDVT
TDGRRPTSAY IPRFRNVKKQ ETDFLRGYAA GFDTGRRKWN SHDGIGKSLK DNLFNEEMGN
WYVGSHMMGE TIPKEISQLT LDKDKKDEWG MPVIHVNIGY DDNDEKMVKD FHEQMTEMYT
KAGFTNIRTG DSKQAPGLDI HEMGGARMGK DPKTSVLNKW NQLHDVNNVF VTDGACMTST
STQNPSLTYM ALTARAVDYA VSQMKKGEI