Gene Cpin_5047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5047 
Symbol 
ID8361223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6294353 
End bp6295510 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content48% 
IMG OID644967196 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_003124681 
Protein GI256424028 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.557083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00205277 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGCAG CTGTGTTCCA TAAAATCGGA GACATAAGTG TAGACAACGT GGAAGATCCG 
CGTATTGAAC ACCCGGAAGA CATCATTGTG AAAGTGACCT CAACTGCCAT CTGTGGCTCG
GACCTACATA TCTATGACGG GTTCTTTCCC CAGCTGAAAG ACCAGGTGAT GGGACATGAA
TTCATGGGTA TAGTAGAAGA TGCCGGATCA GGTGTCAGTA AATTGAAAAG AGGCGACCGG
GTTGTTGTTC CCTTCCCCAT TGCCTGTGGC CATTGTTATT TTTGCGATCA TCAGCTACCC
GTTCACTGCG AAAACAGTAA CAAAGAAAAC TATGGTCCGC AGGGAGATAT GACTTCCGGA
AAAGGCGGCG GACTATTTGG CTATACAGAT CTTTATGGCG CCTATGCCGG CGGACAGGCA
GAATATGCGC GTGTTCCCTT TGCTAATTTT GGTCCGCGGG TCGTACCGGA CAATCTTACA
GATGAACAGG CCCTCTTCCT TACGGACATC TTTCCTACCG GCTGGTCAGC TATTGACTGG
GCACAGCTAA GAGGAGGAGA AACAGTCGCG ATCTTCGGTG CAGGTCCTGT GGGATTGATG
GCACAGAAAG CCGCCTGGAT ACAGGGCGCT GGTCGGGTGA TCGCAATTGA TCCCGTGCAG
TACAGACTGG ACATGGCCAA AAACACCAAT GGCGTTGAGG TTATCAACTC TTCTGAATCA
GATCCTGTTC AGGCAATATA TGATCTCACA CATGGAAGGG GCGCAGATGT ATGTGTGGAT
GCTGTAGGCA TGGAAGCAGA CAGGAGCTTC CTTGAAAAAG TAAAGGCAGT GATCAATGTA
GAGAAAGGTA CGGCTAAAGT GCTTGAAAAC TGCTTCAAAG CAGTACGGCG TGGAGGCACC
GTGACAGTAG TAGGCGTCTA TGGCAGTCCA TACGATAATT TCCCTGTACA TCGCATTTTC
GACAAAGGGA TTACCATCAA AACCGGACAG GCGCCTGTGC AAAAGTATAT CGATCATCTG
ATGGAACTCG TGTCGTCAGG AAAGGTAACA CTGCATGATA TCATTACACA TAAGCTGCCA
TTATCTGCGG CAAGCAATGC ATACGATATT TTCAAGAAGA AAGAAGACGG TTGTGTGAAG
GTTGTATTAA AACCGTAA
 
Protein sequence
MKAAVFHKIG DISVDNVEDP RIEHPEDIIV KVTSTAICGS DLHIYDGFFP QLKDQVMGHE 
FMGIVEDAGS GVSKLKRGDR VVVPFPIACG HCYFCDHQLP VHCENSNKEN YGPQGDMTSG
KGGGLFGYTD LYGAYAGGQA EYARVPFANF GPRVVPDNLT DEQALFLTDI FPTGWSAIDW
AQLRGGETVA IFGAGPVGLM AQKAAWIQGA GRVIAIDPVQ YRLDMAKNTN GVEVINSSES
DPVQAIYDLT HGRGADVCVD AVGMEADRSF LEKVKAVINV EKGTAKVLEN CFKAVRRGGT
VTVVGVYGSP YDNFPVHRIF DKGITIKTGQ APVQKYIDHL MELVSSGKVT LHDIITHKLP
LSAASNAYDI FKKKEDGCVK VVLKP