Gene Acid345_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2004 
Symbol 
ID4070910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2402949 
End bp2404034 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content56% 
IMG OID637984018 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_591079 
Protein GI94969031 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.962257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATC CTCTTAAGCT CAAGAAGATC CATCACGTTG AATTTTGGGT AGGGAACGCC 
AAGCAGGCGG CCTACTATTA TCGCAAAGGC TTCGGATTCA ACCAGCTTGC GTATCGCGGT
CTTGAGACCG GCTCACGCGA CGTCGCCTCG TACGTCCTCC AGCAGAACAA GTGCAACTTC
GTCCTGACAA CGGCGATAAA CCCGGAGCAT CCTGCGGCCG ACCACGTTCG TCGGCATGGC
GACGGCGTGA AAGACATCGC TCTGCACGTG GAAGACGCCG ATTTCGCCTT CGAAGAAGCC
GTGAAGCGCG GCGCAAAGGC TGTCTTCGAG CCTAAAGATG TGATCGACAA TTACGGTACC
GTTCGTTGGG CTGCGGTGCA CACCTACGGC GAAACCATCC ACTCGTTTAT TTCCTACAAG
AATTATGGCG GTCCATTTCT GCCGGGATTT CAATCGGCGA TTGTCCCCGG CGAAGAGACC
GGCATCCTGC TGGTGGACCA CATGGTCGGG AACGTCGAAC TCGGCAAGAT GAATGCCTGG
GCCGATTTCT ACCACGATGT ATTCGGCTTC CATCGCTTCA TCAGCTTCGA CGACAAGGAC
ATCTCGACCG AGTACTCGGC GCTGATGTCC ATCGTGATGT CGGACGATTC GTACTTCGTG
AAGTTTCCGA TCAATGAGCC CGCACCGGGT AAGCGCAAGA GCCAGATTGA CGAGTACCTC
GAAGCGTATG GCAGCCCCGG GGTACAGCAC ATCGCGCTGC GCGTAACCGA CGTCATTGAG
ACCGTCAGCA AGCTCCAGAA GAACGGTATC GAGTTCCTGC GCGTCCCAGA CTCGTATTAC
GACATTGTGC AGGAACGCGT CGGGCCGATT GACGAGCCGA TTGAGAAGAT CAAGCAGCTC
GGTATCCTGA TCGACAAGGA CGATGAAGGC TACCTGCTGC AGATCTTCAG CAAGCCGGTG
GAGGACCGTC CCACGGTGTT TTTTGAGATT ATTCAGCGCA AGGGAAGCCG CGGTTTCGGC
AAAGGCAACT TTAAAGCACT GTTCGAGGCA TTGGAATTGG AACAGGCACG GCGCGGCAAC
CTGTAA
 
Protein sequence
MQNPLKLKKI HHVEFWVGNA KQAAYYYRKG FGFNQLAYRG LETGSRDVAS YVLQQNKCNF 
VLTTAINPEH PAADHVRRHG DGVKDIALHV EDADFAFEEA VKRGAKAVFE PKDVIDNYGT
VRWAAVHTYG ETIHSFISYK NYGGPFLPGF QSAIVPGEET GILLVDHMVG NVELGKMNAW
ADFYHDVFGF HRFISFDDKD ISTEYSALMS IVMSDDSYFV KFPINEPAPG KRKSQIDEYL
EAYGSPGVQH IALRVTDVIE TVSKLQKNGI EFLRVPDSYY DIVQERVGPI DEPIEKIKQL
GILIDKDDEG YLLQIFSKPV EDRPTVFFEI IQRKGSRGFG KGNFKALFEA LELEQARRGN
L