Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2004 |
Symbol | |
ID | 4070910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2402949 |
End bp | 2404034 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637984018 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_591079 |
Protein GI | 94969031 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.962257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAATC CTCTTAAGCT CAAGAAGATC CATCACGTTG AATTTTGGGT AGGGAACGCC AAGCAGGCGG CCTACTATTA TCGCAAAGGC TTCGGATTCA ACCAGCTTGC GTATCGCGGT CTTGAGACCG GCTCACGCGA CGTCGCCTCG TACGTCCTCC AGCAGAACAA GTGCAACTTC GTCCTGACAA CGGCGATAAA CCCGGAGCAT CCTGCGGCCG ACCACGTTCG TCGGCATGGC GACGGCGTGA AAGACATCGC TCTGCACGTG GAAGACGCCG ATTTCGCCTT CGAAGAAGCC GTGAAGCGCG GCGCAAAGGC TGTCTTCGAG CCTAAAGATG TGATCGACAA TTACGGTACC GTTCGTTGGG CTGCGGTGCA CACCTACGGC GAAACCATCC ACTCGTTTAT TTCCTACAAG AATTATGGCG GTCCATTTCT GCCGGGATTT CAATCGGCGA TTGTCCCCGG CGAAGAGACC GGCATCCTGC TGGTGGACCA CATGGTCGGG AACGTCGAAC TCGGCAAGAT GAATGCCTGG GCCGATTTCT ACCACGATGT ATTCGGCTTC CATCGCTTCA TCAGCTTCGA CGACAAGGAC ATCTCGACCG AGTACTCGGC GCTGATGTCC ATCGTGATGT CGGACGATTC GTACTTCGTG AAGTTTCCGA TCAATGAGCC CGCACCGGGT AAGCGCAAGA GCCAGATTGA CGAGTACCTC GAAGCGTATG GCAGCCCCGG GGTACAGCAC ATCGCGCTGC GCGTAACCGA CGTCATTGAG ACCGTCAGCA AGCTCCAGAA GAACGGTATC GAGTTCCTGC GCGTCCCAGA CTCGTATTAC GACATTGTGC AGGAACGCGT CGGGCCGATT GACGAGCCGA TTGAGAAGAT CAAGCAGCTC GGTATCCTGA TCGACAAGGA CGATGAAGGC TACCTGCTGC AGATCTTCAG CAAGCCGGTG GAGGACCGTC CCACGGTGTT TTTTGAGATT ATTCAGCGCA AGGGAAGCCG CGGTTTCGGC AAAGGCAACT TTAAAGCACT GTTCGAGGCA TTGGAATTGG AACAGGCACG GCGCGGCAAC CTGTAA
|
Protein sequence | MQNPLKLKKI HHVEFWVGNA KQAAYYYRKG FGFNQLAYRG LETGSRDVAS YVLQQNKCNF VLTTAINPEH PAADHVRRHG DGVKDIALHV EDADFAFEEA VKRGAKAVFE PKDVIDNYGT VRWAAVHTYG ETIHSFISYK NYGGPFLPGF QSAIVPGEET GILLVDHMVG NVELGKMNAW ADFYHDVFGF HRFISFDDKD ISTEYSALMS IVMSDDSYFV KFPINEPAPG KRKSQIDEYL EAYGSPGVQH IALRVTDVIE TVSKLQKNGI EFLRVPDSYY DIVQERVGPI DEPIEKIKQL GILIDKDDEG YLLQIFSKPV EDRPTVFFEI IQRKGSRGFG KGNFKALFEA LELEQARRGN L
|
| |