Gene Caci_3256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3256 
Symbol 
ID8334609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3591756 
End bp3592859 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content71% 
IMG OID644956401 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003114004 
Protein GI256392440 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCC TCGGCATAGA TCATGTCGAG TTCTACGTGG GAGACGCGCG CCAAGCCGCG 
TTCTTCCTGT GCACCGCGTT CGGCTTCCAC ATCGCCGGGC ACGGCGGCCC GGAGACCGAG
CTGCCGCGGC AGCGCAGCCT GCTGCTGGCC CACGGCGACA GCAGAGTCCT GCTGACCTCG
GCCCTGAGCA GCGGACACCC CGCGGACGGC TACGTCTCCC GGCACGGCGA CGGCGTCGGC
GTCATCGCCT TCGCCACGGT CGACGCCACC GGAGCCTACG AGGAGGCGGT GGCCGCCGGC
GCGACCGCCG TGGAGGCGCC GCGCACCTAC AAGGCCGACG GCGACACCGT CACCACCGCG
AGCGTCGGCG GATTCGGCGA CGTGGTGCAC CGCTTCGTCG AACGGCGCGG CGCGGCGTTC
TGGCCCGGCG CGATCGAGCC CCAGCCGGCT CCGGCGCGCA CCGAGCCCGA ACTGGTCCGT
GCCATCGACC ACGCCGCCGT GCTGGTCCCG GACGGGGAGC TGGAGCCGAC CGTCGAGTAC
TACCAGCGGG TCTTCGGCTT CAAGCTGATC TTCGAGGAGT ACGTCGAGGT CGGCGCGCAG
GGGATGTACT CCCAGGTGGT CCAGAGCCCG TCGGGCGGGG CGACGTTCAC CATCATCCAG
CCCGACATGA GCCGGGACAG CGGGCAGATC GACGACTTCC TGGCCTGGCA CGGCGGCGCC
GGCGTGCAGC ACCTGGCGCT GAGCACCGAC GACATCGTGG CCACGGTGAA GTCCTTCGCC
GCCAACGGCG CGGGCTTCGC CCAGACCCCC GGCGCGTACT ACGACGTGCT GCCCGAGCGC
ATCGGCGCCA CCGACCTGCC GGTGGACCAG CTGCGGCCGC TGGGCATCCT GGTCGACCGC
GACCACTGGG GGCAGATGTA CCAGATCTTC TCCAAGTCGA TGCACATCCG CCGGACCTTC
TTCTGGGAGC TCATCGAGCG GCACGGTGCC AAGACCTTCG GTACCAGCAA CATCCCCGCG
CTGTACGCGG CGAAGGAGCG CGAGCTCGCC GGGGTGCGGG ACTCCGTGGA TGTGGGAGCG
ATGGGAACGG CGGGACAACG ATGA
 
Protein sequence
MDILGIDHVE FYVGDARQAA FFLCTAFGFH IAGHGGPETE LPRQRSLLLA HGDSRVLLTS 
ALSSGHPADG YVSRHGDGVG VIAFATVDAT GAYEEAVAAG ATAVEAPRTY KADGDTVTTA
SVGGFGDVVH RFVERRGAAF WPGAIEPQPA PARTEPELVR AIDHAAVLVP DGELEPTVEY
YQRVFGFKLI FEEYVEVGAQ GMYSQVVQSP SGGATFTIIQ PDMSRDSGQI DDFLAWHGGA
GVQHLALSTD DIVATVKSFA ANGAGFAQTP GAYYDVLPER IGATDLPVDQ LRPLGILVDR
DHWGQMYQIF SKSMHIRRTF FWELIERHGA KTFGTSNIPA LYAAKERELA GVRDSVDVGA
MGTAGQR