Gene TM1040_0582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0582 
Symbol 
ID4076147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp619950 
End bp621050 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content56% 
IMG OID638005879 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_612577 
Protein GI99080423 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.764732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000507477 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGACCTT TCCCTCATGA TGCCCCCAAA TCGGTGATCA GCGCTGAGAA CCCGGCCGGA 
ACCGATGGAT TTGAATTTGT AGAGTTTGCC AGCCCGAACC CCGAAGAACT GCGCGAGCTC
TTTGCCAAGA TGGGGTACGA GTTGGTCGGA CGTCACAAGA CCAAGCCGGG TATCGAGCTT
TGGCAGCAGG GGGACATCAC CTACATCCTC AATGCCGAGA AAGGCTCTTT TGCGGAAAAG
TTCGTTGAAC TTCACGGCCC CTGTGCCCCC TCGATGGGCT GGCGTGTGGT CGATGCGCAA
AAGGCGTTTG AGCACGCGGT GGCCAAGGGG GCGGAGCCCT ATGAAGGCGA TGACAAAACA
ATGGATGTGC CTGCAATCAA AGGGATTGGC GGCTCGCTCA TCTACTTCAT CGACCAGTAT
TACGACACCT CGCCCTATAA CGAGGAATTC GAGTGGCTGA AGCAGTCCAA ACCGCGCGGC
GTCGGTTTTT ATTACCTCGA TCACCTCACG CACAATGTCT TCAAAGGCAA CATGGACAAG
TGGTTCCACT TTTATGGCGA CCTGTTCAAC TTCAAGGAAA TCCGGTTCTT TGACATTCAG
GGCAAGTATA CCGGCCTCTT CAGCCGTGCC TTGACCTCGC CTTGCGGCCG CATTCGCATT
CCGATCAACG AGGACCGTGG CGAGACCGGG CAGATCGTTG CCTATCTCAA GAAGTACAAT
GGCGAAGGCA TCCAGCACAT CGCTGTGGGC GCGCGTGACA TCTATGATGC CACTGACGAG
ATCTCCGAAC GTGGCATCCA GTTCATGCCG GCCCCGCCTG CAACCTATTA CGACATGAGC
CACGACCGTG TCCAAGGCCA CGAAGAGCCG CTGGATCGTA TGAAAAAGCA CGGCATCCTC
ATCGACGGCG AAGGCGTGGT GGACGGGGGC GAGACACGCA TCCTGCTGCA GATCTTCTCA
AAAACGGTGA TCGGGCCGAT CTTCTTTGAG TTCATCCAGC GCAAAGGCGA TGACGGCTTT
GGCGAGGGCA ACTTCAAGGC GCTCTTTGAA TCGATCGAAC AGGAGCAAAT CAACAACGGT
GAAATCTCCG CTGCCGAGTG A
 
Protein sequence
MGPFPHDAPK SVISAENPAG TDGFEFVEFA SPNPEELREL FAKMGYELVG RHKTKPGIEL 
WQQGDITYIL NAEKGSFAEK FVELHGPCAP SMGWRVVDAQ KAFEHAVAKG AEPYEGDDKT
MDVPAIKGIG GSLIYFIDQY YDTSPYNEEF EWLKQSKPRG VGFYYLDHLT HNVFKGNMDK
WFHFYGDLFN FKEIRFFDIQ GKYTGLFSRA LTSPCGRIRI PINEDRGETG QIVAYLKKYN
GEGIQHIAVG ARDIYDATDE ISERGIQFMP APPATYYDMS HDRVQGHEEP LDRMKKHGIL
IDGEGVVDGG ETRILLQIFS KTVIGPIFFE FIQRKGDDGF GEGNFKALFE SIEQEQINNG
EISAAE