Gene Cpin_5420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5420 
Symbol 
ID8361597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6929856 
End bp6930992 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content47% 
IMG OID644967566 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003125050 
Protein GI256424397 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.762295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.240118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACAA CTACAGATGC TGCCGTTTCG GCCGTTCCCG CTAACGCGCA GGACTTTTTA 
CCATTGAATG GTACAGATTA CGTCGAATTT TACGTGGGTA ATGCCAAACA GGCTGCTCAT
TATTATAAAA CCGCATTTGG CTTCCAGTCA CTGGCTTATG CCGGTCCGGA AACAGGTGTA
AAAGACCGTG CTTCTTACGT ACTCGTACAG AATAAACTGC GTTTTGTACT GACCACGCCA
CTGAATCCTG ACAATGAGAT TGCACAGCAT ATACTGGAAC ACGGAGATGG TGTAAAAGTA
ATCGCACTGT GGGTGGATGA TGCCCGTGCT GCATTCGAAG AGACCGTTAA AAGAGGTGCA
AAACCATATC TGGAGCCTGT TACGGAAGAA GATGAATTTG GTGCTGTCGT ACGCAGTGGT
ATCCATATCT ATGGCGATAC AGTACATCTG TTCGTTGAAC GTAAGAACTA TAACGGTCCG
TTCCTGCCAG GTTATAAAGC ATGGAATCCG GCTTATCAGC CAACGGATAC CGGTCTGCAA
TATGTAGATC ATTGCGTGGG TAATGTGGGC TGGAATGAAA TGAATACCTG GGTAGATTTC
TATGAGCAGG TGATGGGCTT CCGCAATTTG CTGTCCTTCG ACGATAAAGA CATTTCTACC
GAGTATTCTG CCCTCATGAG TAAGGTCATG AGCAATGGCA ACGGACGTGT GAAATTCCCG
ATCAATGAGC CGGCAGAAGG TAAAAAGAAA TCCCAGATCG AAGAGTACCT GGATTTCTAT
CGTGGCGCAG GAGTACAGCA CGTTGCGATC GCTACCAACA ATATTATCCA GACAGTGACC
GATCTGCAGA ACAGAGGGGT TGAATTCCTG AAAGTACCGG AGTCTTACTA TGCTACATTG
CTGGACAGGG TAGGGCAGAT CGATGAAGAC CTGCTGCCGC TGAAACAGCT GGGTATCCTG
GTTGACCGCG ATGATGAAGG ATATCTGTTG CAGATCTTTA CAAAACCTGT ACAGGATCGT
CCGACAGTAT TCTTTGAAAT CATCCAGCGT AAAGGTGCTA AATCTTTCGG TAAAGGTAAT
TTCAAAGCGC TGTTTGAATC TATTGAAAGA GAACAGGCGC TGAGAGGCAA CCTGTGA
 
Protein sequence
MHTTTDAAVS AVPANAQDFL PLNGTDYVEF YVGNAKQAAH YYKTAFGFQS LAYAGPETGV 
KDRASYVLVQ NKLRFVLTTP LNPDNEIAQH ILEHGDGVKV IALWVDDARA AFEETVKRGA
KPYLEPVTEE DEFGAVVRSG IHIYGDTVHL FVERKNYNGP FLPGYKAWNP AYQPTDTGLQ
YVDHCVGNVG WNEMNTWVDF YEQVMGFRNL LSFDDKDIST EYSALMSKVM SNGNGRVKFP
INEPAEGKKK SQIEEYLDFY RGAGVQHVAI ATNNIIQTVT DLQNRGVEFL KVPESYYATL
LDRVGQIDED LLPLKQLGIL VDRDDEGYLL QIFTKPVQDR PTVFFEIIQR KGAKSFGKGN
FKALFESIER EQALRGNL