Gene Cpin_5213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5213 
Symbol 
ID8361390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6587759 
End bp6588823 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content49% 
IMG OID644967361 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_003124845 
Protein GI256424192 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000291101 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACATT GTATGCATGC AGCGGTTAAG ACTGCTGACG GAAAGTTTGA CCTTCAGGAG 
GTGGACACAC CGCACATTGC GCGACCTGAT TGGGTAGTGG CCCGTGTGCG TGTGTCTGGT
ATCTGTGGTA CTGATCTGCG ACACTGGAAG AAAGCGGAAG CGCCACTCAC CGGGAAAATT
ATGGGGCATG AACTGGCAGG AGAAATCGTA GAAATCGGCA GTAACGTAAC CAACGTAAAG
GTAGGCGACA GGGTAGTGAT AGAGACACTG TTAGGTGATG AGACCTGTGA CTGGTGCCGG
GTGCAACAAT ATAATCTATG TCCTCACCTG TATGAGGTGC GCATGAAAAC CTTATCACAG
GCATTTGCAC AGTATGTAGC GGGACCTTCT GCAAAGTTCT ACCGCCTGCC CGATCATGTC
AGTTTTGAAG AGGCTACTTT ACTGGATACC TTCTCTGTCG GACTCCATGC GATGAATCTC
AGTGGTATAA AGTTGAATGA TAAGGTTGCT GTCATCGGTG CGGGACCTAT AGGTTTGGGA
CAGCTGCAAC TGGCTAAGCT GGCTGGTGCG GACGTTATCA TAACTGATGT AGTTGATTCA
GCGCTTGAGA TGGCCGGAGA ACTGGGCGCT GATGCAGTGG TAAATACGGA TAAGGAAGAT
GGATATCAAA AAGTGATGGA ATTTACGAAG GGCAGAGGAG TCGATATTGC TTTCGAATGT
GCCGGCGGTC CTTCAATGCC GGTGACCTTA CCACAGGCTG TATCTTTCAG CAGGATCGGC
GGTAAGGTTG TCATAGTAGG CGGCTTTGAT GCCGGCGTAA CAAATATCGG CCTCGAGTGG
CAACGGATAC AGATGTCGGA GATACAGCTG ATATCCAGTG CGAGCTATGC TTACCGGGAT
ATTTATCCGG AGATGCAGAT CTGCCTTGAT TTGCTGGCGA AGGGACAAAT GAATGCAAGG
AAAATGATTA CGCATAGTTT TCCACTATCT GAGATTAACA AGGCGTTTGA AGTAGCGGCA
GACAAAACGA AAACACATGC AATATTTGTT GCGCTTACTA TATAA
 
Protein sequence
MGHCMHAAVK TADGKFDLQE VDTPHIARPD WVVARVRVSG ICGTDLRHWK KAEAPLTGKI 
MGHELAGEIV EIGSNVTNVK VGDRVVIETL LGDETCDWCR VQQYNLCPHL YEVRMKTLSQ
AFAQYVAGPS AKFYRLPDHV SFEEATLLDT FSVGLHAMNL SGIKLNDKVA VIGAGPIGLG
QLQLAKLAGA DVIITDVVDS ALEMAGELGA DAVVNTDKED GYQKVMEFTK GRGVDIAFEC
AGGPSMPVTL PQAVSFSRIG GKVVIVGGFD AGVTNIGLEW QRIQMSEIQL ISSASYAYRD
IYPEMQICLD LLAKGQMNAR KMITHSFPLS EINKAFEVAA DKTKTHAIFV ALTI