Gene Xaut_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_3333 
Symbol 
ID5422759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp3707547 
End bp3708914 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content67% 
IMG OID640882582 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001418219 
Protein GI154247261 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGTC TGCCCCACCC CCACCTTCAG GGCGCGCCGG TGCGCCACAG CGCGCTCGTG 
CCCGGCTACA TGTCCGGCTT CGGCAATTCG TTCGAGACCG AGGCGCTGGA AGGCACCTTG
CCCATCGGCC GCAACTCGCC GCAGAAGATC AATTACGGGC TCTATGCCGA GCAGCTCTCC
GGCTCGCCCT TCACCGCGCC GCAGGCGGTG AACGAGCGCT CCTGGCTCTA CCGCATCCGG
CCCACGGTGA AGCATTCCGG TCGCTACCGC CGCGTGGACA AGGGCCTGAT CCGCACCGCG
CCCATGGCCC GCGACGAGAG CGAGTTGACG CTCGGCCAGT ACCGCTGGAG CGCGCTCCCG
CTACCGCAGG ACAAGCGCAC CTTCGTTTCG GGCCTTGCCA CCCTGACCAC GGCGGGGGAC
GCCGATGGCC AGAGCGGCAT GGCGGCCCAC ATGGCCTTCG TCACGGCGTC CATGGAAAAC
GACTATTTCT TCAACGCGGA CGGCGAATTG CTGGTGGTGG CGCAGCAGGG GGCGCTGCGC
TTCCGCACCG AATTCGGCGT CATCGACATC GCGCCCGGCG AGATCTGCGT GATCCCGCGC
GGCGTGATCT TCAAGGTGGA GCTGATCGAC GGGCCGGCCC GCGCCTATGT CTGCGAGAAT
TACGGCGCCA CCTTCACCCT GCCGGACCGT GGCCCCATCG GCGCCAATTG CCTTGCCAAC
CCGCGCGACT TCCTCACCCC CGTCGCCGCC TACGAGGACC GGGAGGAGCC CTCGCAGCTG
TTCGTGAAGT GGGGCGGCGA ATTGTTCGTC ACCGACATCG GCCAGTCGCC CCTCGACGTG
GTGGCCTGGC ACGGCAATTA CGCGCCGTAC AAATATGACC TGCGCACCTT CTCGCCCGTC
GGCGCGCTGA TGTTCGACCA TCCGGACCCG TCCATCTTCA CCGTGCTCAC CTCGCCGTCG
GGCACGCCGG GCACGGCCAA CATCGATTTC GTCATCTTCC CCGAGCGCTG GATGGTGGCG
GAGAATACGT TCCGCCCGCC GTGGTACCAC CGTAACATCA TGTCCGAATT CATGGGGCTC
ATCTTCGGCG TCTACGACGC CAAGCCCGAG GGCTTCGAGC CCGGCGGCTT CTCCCTGCAC
AACCTCATGC TGCCCCACGG GCCGGACGAG CAGGCCTTCG AGCACGCCTC CACCGGCGAG
CTGAAGCCGG TGAAGCTGGA GAATACGCTG GCCTTCATGT TCGAGACCCG CGTGGCCCAG
CGCGTCACCG CCTATGCCGC CGGCGTGCCG CAGCTCCAGG CCGATTATGT GGACTGCTGG
GCCGGCCTGA AGAAGCGCTT CGACCCTACC CGCAAGGATG CGTGGTGA
 
Protein sequence
MNSLPHPHLQ GAPVRHSALV PGYMSGFGNS FETEALEGTL PIGRNSPQKI NYGLYAEQLS 
GSPFTAPQAV NERSWLYRIR PTVKHSGRYR RVDKGLIRTA PMARDESELT LGQYRWSALP
LPQDKRTFVS GLATLTTAGD ADGQSGMAAH MAFVTASMEN DYFFNADGEL LVVAQQGALR
FRTEFGVIDI APGEICVIPR GVIFKVELID GPARAYVCEN YGATFTLPDR GPIGANCLAN
PRDFLTPVAA YEDREEPSQL FVKWGGELFV TDIGQSPLDV VAWHGNYAPY KYDLRTFSPV
GALMFDHPDP SIFTVLTSPS GTPGTANIDF VIFPERWMVA ENTFRPPWYH RNIMSEFMGL
IFGVYDAKPE GFEPGGFSLH NLMLPHGPDE QAFEHASTGE LKPVKLENTL AFMFETRVAQ
RVTAYAAGVP QLQADYVDCW AGLKKRFDPT RKDAW