Gene Caul_3653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3653 
Symbol 
ID5901108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3941996 
End bp3943057 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content65% 
IMG OID641564164 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001685278 
Protein GI167647615 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.117018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCA ATGCGCAAAA TCCCCTCGGC CTCGACGGCT TCGAGTTCGT CGAGTTCACC 
AGCCCCGATC CAGCCGCCAT GAAGGCCCTG TTCGAACAGC TGGGCTTCGT CGCCGCCAGC
CAGCATCCGA CCAAGGCCGT GACCCGCTAC AAGCAGGGCC GTATCAACCT GCTGGTCAAT
GAAGAGACGT CCGGCCAGGT CGCCGCGTTC CGCGCCGCCC ACGGCCCCTC GGCCAACGGC
ATGGCCTTCC GGGTCGAGAA CGTCGATCAG GCCTATGCCG AGGCCCTCAA GCGCGGCGCC
GTCGCGGCGG ACGCCGGCAA GACCGTGCTG GGCGAGGGCG CCAAGGTGCT GGAAGGCATC
GGCGGTTCGA TGCTGTACCT CGTCCCGGCC GAGGGCTCGG TCTATGACAG CTGGACCCCG
GTCCCCGGCG CGGCGGAAGC CGAAGCGGCC AACAACGTCG GCCTCGACCT GCTCGACCAC
CTGACCCACA ACGTCAAGCG CGGCCAGATG CGCACCTGGT CGACCTTCTA TCGCGACGTC
TTCGGCTTCG AGGAGCAGAA GTATTTCGAC ATCAAGGGCC AGGCCACCGG CCTGTTCAGC
CAGGCGATGA TCGCGCCAGA CAAGGCCATC CGCATCCCGC TGAACGAGAG CCAGGACGAC
CACAGTCAGA TCGAGGAGTT CCTCCGCCAG TACAACGGCG AAGGCATCCA GCACCTGGCC
CTGACCACGC CCGACATCTA CGACACCGTC GAGAAGCTGC GCGCCCGGGG CGTCAAGCTG
CAGGACACCA TCGAGACCTA TTACGAGCTG GTCGACAAGC GCGTGCCAGG CCACGGCGAG
GACCTGGAGC GCCTGAGGAA GAACCGCATC CTGCTGGACG GCAAGGTCGG CGAGGAAGGC
CTGCTGTTGC AGATCTTCAC CGAGAACCTG TTTGGGCCGA TCTTCTTCGA GATCATCCAG
CGCAAGGGCA ATGAAGGCTT CGGCAACGGC AACTTCCAGG CTCTGTTCGA GAGCATCGAG
CTGGATCAGA TCCGCCGCGG CGTGATCACG GTCGAGGCCT AG
 
Protein sequence
MTVNAQNPLG LDGFEFVEFT SPDPAAMKAL FEQLGFVAAS QHPTKAVTRY KQGRINLLVN 
EETSGQVAAF RAAHGPSANG MAFRVENVDQ AYAEALKRGA VAADAGKTVL GEGAKVLEGI
GGSMLYLVPA EGSVYDSWTP VPGAAEAEAA NNVGLDLLDH LTHNVKRGQM RTWSTFYRDV
FGFEEQKYFD IKGQATGLFS QAMIAPDKAI RIPLNESQDD HSQIEEFLRQ YNGEGIQHLA
LTTPDIYDTV EKLRARGVKL QDTIETYYEL VDKRVPGHGE DLERLRKNRI LLDGKVGEEG
LLLQIFTENL FGPIFFEIIQ RKGNEGFGNG NFQALFESIE LDQIRRGVIT VEA