Gene Noca_1295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1295 
Symbol 
ID4598917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1368677 
End bp1369870 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content68% 
IMG OID639775889 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_922496 
Protein GI119715531 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.498463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTGA CACCGGACGA GCTCAAGGCC GACCTGACCC TCGACCAGCT CAAGGAGCTG 
GTCGGCCTCG TGGAGTACGA CGCCGCCAAC GACCCCTTCC CGGTGATCGC CCAGGACGCC
GTCTGCTTCG TGGTCGGCAA CGCCACCCAG ACCGCGTCCT TCTACCAGCT CGCGCTCGGG
ATGGAGCTCG AGGCCTACCG TGGCCCGGAG AACGGGTGTC GCGAGTCCAA GTCCTACGTG
CTGCGCTCGG GCAGCGCCCG GTTCGTCTTC ACCGGCGGGG TCACCCCGGA CAGCCCGGTG
CTCGACCACC ACCGCAGGCA CGGCGACGGG GTCGTCGACC TGGCGATGGA GGTGCCCGAC
GTCGACCGGT GCATCGAGCA CGCCCGGGCG ATGGGCGCCA CGATCCTGGT GGAGCCCCAC
GACGAGACCG ACGAGCACGG CACGGCCCGG CTCGCCGCGA TCGCGACGTA CGGCGAGACC
CGCCACACCC TGGTCGACCG CTCGCGCTAC TCCGGCCCCT ACCTGCCCGG CTACGCGCCG
GCCACGACCA CGGTGACCCG CCGCGAGGGC CGGCCCAAGC GCCTGTTCCA GGCGATCGAC
CACTGCGTCG GGAACGTCGA GCTCGGCCGC ATGGACGAGT GGGTGACGTT CTACAACAGG
GTGCTCGGCT TCACGAACAT GGCCGAGTTC ATCGGCGACG ACATCGCGAC CGACTACTCC
GCGCTGATGT CCAAGGTCGT CGCGAGCGGC AACCACCGGG TGAAGTTCCC GCTCAACGAG
CCCGCCGTGG CGAAGAAGAA GTCCCAGATC GACGAGTACC TCGAGTTCTA CGACGGTGCC
GGCTGCCAGC ACATCGCGCT GGCGACCAAC GACATCCTGC GCAGCGTCGA CGTCCTGCGC
GAGAACGGCA TCCAGTTCCT CGACACCCCG GACTCCTACT ACGACGACCC CGAGCTGCGC
GCCCGGATCG GCGAGGTGCG GGTGCCGATC GAGGAGCTGA AGAAGCGCAA GATCCTCGTC
GACCGCGACG AGGACGGCTA CCTGCTGCAG ATCTTCACCA AGCCGATGGG GGACCGGCCG
ACGGTCTTCT ACGAGTTCAT CGAACGGCAC GGCTCGCTCG GCTTCGGCAA GGGCAACTTC
AAGGCGCTGT TCGAGGCGAT CGAGCGCGAG CAGGAGCTCC GCGGCAACCT CTGA
 
Protein sequence
MTLTPDELKA DLTLDQLKEL VGLVEYDAAN DPFPVIAQDA VCFVVGNATQ TASFYQLALG 
MELEAYRGPE NGCRESKSYV LRSGSARFVF TGGVTPDSPV LDHHRRHGDG VVDLAMEVPD
VDRCIEHARA MGATILVEPH DETDEHGTAR LAAIATYGET RHTLVDRSRY SGPYLPGYAP
ATTTVTRREG RPKRLFQAID HCVGNVELGR MDEWVTFYNR VLGFTNMAEF IGDDIATDYS
ALMSKVVASG NHRVKFPLNE PAVAKKKSQI DEYLEFYDGA GCQHIALATN DILRSVDVLR
ENGIQFLDTP DSYYDDPELR ARIGEVRVPI EELKKRKILV DRDEDGYLLQ IFTKPMGDRP
TVFYEFIERH GSLGFGKGNF KALFEAIERE QELRGNL