Gene Namu_0528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0528 
Symbol 
ID8446111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp587219 
End bp588424 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID645039663 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003199935 
Protein GI258650779 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGT CGTTCGAGGA CACCCGGCGC GCAGCACCCG ATGAACCCAC CGACGAGGCC 
TTGCGCGACC TGGTCGGCCT GGTCGACCAC GACGTGACCA CCGACCCGTT CCCGGTGCTC
GGCATGGACG CCGTGGTGTT CGTCTGCGGC AACGCCACCC AGTCCGCCCA GTACTTCCAG
GCGGTGTGGG GGATGGACCC GTTCGCCTAC CGCGGCCCGG AGACCGGCCA CCGCGGCGAC
GTCGCCTACG CGCTCAAATC CGGGTCCGCC CGCTTCGTGC TGACCGGCGG CGTGGCCCCG
GACAGCCCGC TGCTGGACAT CCACCGTGAG CACGGTGACG CCGTCGTCGA CCTGGCCATG
GAGGTCGGTG ACGTCGACCG GTGCATCGAG CAGGCGCGGC GGCAGGGCGC GCGCATCCTG
GCCGAACCGC ACGACGAGAC CGACGAGCAC GGCACCGTCC GCAGCGCCGC CATCGCCACC
TACGGCCACG TCCGGCACAC GCTGATCGAC CGGCGCCGGT ACACCGGCCC CTACCTGCCC
GGCTTCCGCC CCGTCACGAC CACCGCGGTC CGGCCGGACG GCGCCCCGCG GCGGCTGTTC
CAGGCCCTGG ACCACGCGGT CGGCAACGTC GAACTCGGCC GCATGGACGA ATGGGTCGAC
TTCTACAACC GCGTCATGGG TTTCGTGAAC ATGGCCGAGT TCGTCGGCGA CGACATCGCC
ACCGAGTACT CGGCGCTGAT GAGCAAGGTC GTCGCCAACG GCAACCACCG GGTGAAGTTC
CCGCTGAACG AACCGGCGGC CGGCAAGCGC AAGTCGCAGA TCGACGAGTA CCTGGAGTTC
CATGCCGGCC CGGGATGCCA GCACCTGGCG CTGGCCACCG GCGACATCCT GGGCACGGTG
GACGCCCTGC GGGATCGGGG CGTGCGGTTC CTGGACACTC CGGACGCCTA CTACGACGAC
CCGGAGCTGG CGGCCCGGAT CGGGCCGGTG CGGGTGCCGG TCGCCGAACT CAAGAAGCGC
CGGATCCTGG TCGACCGGGA CGAGGACGGC TACCTGCTGC AGATCTTCAC CAAGCCGCTG
GGCGACCGGC CGACGATCTT CTTCGAGATC ATCGAGCGGC ACGGCTCTCT CGGCTTCGGA
AAGGGCAACT TCAAGGCGCT GTTCGAATCG ATCGAGCGGG AGCAGGCGGC CCGCGGGAAC
CTGTGA
 
Protein sequence
MTLSFEDTRR AAPDEPTDEA LRDLVGLVDH DVTTDPFPVL GMDAVVFVCG NATQSAQYFQ 
AVWGMDPFAY RGPETGHRGD VAYALKSGSA RFVLTGGVAP DSPLLDIHRE HGDAVVDLAM
EVGDVDRCIE QARRQGARIL AEPHDETDEH GTVRSAAIAT YGHVRHTLID RRRYTGPYLP
GFRPVTTTAV RPDGAPRRLF QALDHAVGNV ELGRMDEWVD FYNRVMGFVN MAEFVGDDIA
TEYSALMSKV VANGNHRVKF PLNEPAAGKR KSQIDEYLEF HAGPGCQHLA LATGDILGTV
DALRDRGVRF LDTPDAYYDD PELAARIGPV RVPVAELKKR RILVDRDEDG YLLQIFTKPL
GDRPTIFFEI IERHGSLGFG KGNFKALFES IEREQAARGN L