Gene Aave_4191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_4191 
Symbol 
ID4666826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp4642813 
End bp4643949 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content65% 
IMG OID639825377 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_972505 
Protein GI120612827 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCC CGCTGCCCCA CCCCGCCGTC CAGGACACCG CCGCCTGGGA GAACCCGATG 
GGTACCGACG GCTTCGAATT CATCGAATAC GCCGCTCCCG ATCCGCAGGC CATGGGCCGG
GTGTTCGAGG GCATGGGCTT CAAGCCCGTG GCGCGCCACC GCCACAAGAA CGTGACGCTC
TACCGCCAGG GCGAAATCAA CTTCATCATC AACGCCGAGC CCGACAGTTT CGCGCAGCGT
TTCGCGCGGC TGCACGGCCC CAGCGTCTGC GCCATCGCCT TCCGCGTGCA CGACGCCAAG
GCCGCCTACG AGCGCGCGCT GAACCTGGGT GCCTGGGGCT ACGCCGGCCA GGCCGGCCCG
GGCGAGCTGA ACATTCCCGC CATCAAGGGC ATCGGCGACA GCCTGATCTA CCTGGTGGAC
CGCTGGCGCG GCAAGAACGG CGCGCAACCG GGCGACATCG GCAACATCGG CTTCTTCGAC
GTCGATTTCG AGCCGCTGCC GGGCGTGACC GCCGAGGAGG CGCTGAATCC CAAGGGCCAC
GGCCTGACCT ACATCGACCA CCTGACGCAC AACGTGCACC GCGGCCGGAT GATCGAATGG
GCGAACTTCT ACGAGCGCCT GTTCAACTTC CGCGAGATCC GCTACTTCGA CATCGAAGGC
CAGGTCACCG GCGTGAAGAG CAAGGCCATG ACCAGCCCCT GCGGCAAGAT CCGCATCCCG
ATCAACGAAG AGGGCAAGGA AAAGGCCGGC CAGATCCAGG AATACCTGGA CATGTACAAC
GGCGAGGGCA TCCAGCACAT CGCCATGGGC TCGGACGACC TCTACGCCAC GGTGGACGCC
CTGCGCGGCT CCGGCGTGCG CCTGCTGGAC ACGATCGACA CCTACTACGA GCTGGTGGAC
AAGCGCATTC CCGGCCACGG CGAGAGCGTG GAAGAGCTGC ACAAGCGCAA GATCCTGATC
GACGGCAAGA AGGACGCGAT CCTGCTGCAG ATCTTCAGCG AAAACCAGCT CGGCCCGATC
TTCTTCGAGT TCATCCAGCG CAAGGGGGAC GACGGCTTCG GCAACGGCAA CTTCAAGGCG
CTGTTCGAGA GCATCGAGCT CGACCAGATG CGCCGCGGGG TGCTGCAGGG CGCCTGA
 
Protein sequence
MNTPLPHPAV QDTAAWENPM GTDGFEFIEY AAPDPQAMGR VFEGMGFKPV ARHRHKNVTL 
YRQGEINFII NAEPDSFAQR FARLHGPSVC AIAFRVHDAK AAYERALNLG AWGYAGQAGP
GELNIPAIKG IGDSLIYLVD RWRGKNGAQP GDIGNIGFFD VDFEPLPGVT AEEALNPKGH
GLTYIDHLTH NVHRGRMIEW ANFYERLFNF REIRYFDIEG QVTGVKSKAM TSPCGKIRIP
INEEGKEKAG QIQEYLDMYN GEGIQHIAMG SDDLYATVDA LRGSGVRLLD TIDTYYELVD
KRIPGHGESV EELHKRKILI DGKKDAILLQ IFSENQLGPI FFEFIQRKGD DGFGNGNFKA
LFESIELDQM RRGVLQGA