Gene TM1040_3467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3467 
Symbol 
ID4075101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp491473 
End bp492453 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content62% 
IMG OID638004976 
Product3,4-dihydroxyphenylacetate 2,3-dioxygenase HpaD 
Protein accessionYP_611701 
Protein GI99078443 
COG category[R] General function prediction only 
COG ID[COG2514] Predicted ring-cleavage extradiol dioxygenase 
TIGRFAM ID[TIGR02295] 3,4-dihydroxyphenylacetate 2,3-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATTC CCGCCGCAAA TCTTTATCCG CCCTTCAACA TCACCCGCCT CAGCCATGTG 
GAATACGGCG TCACCGACCT TGCGGCCTCC CGCACCTTCT ACGTCGACAT CCTCGGCCTG
CAGGTCACTC ATGAGGACGA CAGCCGGATT TATCTGCGCG CCATGGAAGA ACGCGGCCAC
CACTGCATCA TCCTGCGCCA ATCCGACCAC GCGGGCGTCG CGTGCCTCGG CTTCAAACTC
TATGACACGC CGGATCTTGA GAAGGCCGCC GCCTTTTTCG AGGGCAAAAG CCTGCCGGTG
GAGTGGATCG AACGCCCCTT CATGGGGCCG AGCTTGCGCA CCCGCGATCC ATGGGGTGTG
CCGCTGGAGT TCTACGTCAA GATGGACCGC CTCCCGCCGA TACATCAGCA GTACAGGCTC
TATAATGGCG TGAAACCCCT CCGCATCGAC CACTTCAACA TGTTTTCGGC CAATGTCGAC
GCGGCGGTGG CCTTCTACGG CGAGATGGGG TTTCGCGTCA CCGAATATAC CGAGGATGAC
GACTCCGGCC GCGTCTGGGC AGCCTGGATG CACCGCAAGG GCGGCGTGCA TGATGTGGCC
TTCACCAATG GAACCGGCCC GCGTCTGCAT CACACCGCCT TTTGGGTACC AACCCCGCTC
AACATCATCG ACCTCCTCGA TCTGATGTCG ACCACCGGCT ATGTCGCCAA TATCGAACGC
GGCCCCGGCC GCCACGGCAT TTCCAACGCG TTCTTCCTCT ATGTGCGCGA CCCCGACGGC
CACCGGATCG AAATCTATTG CTCGGACTAT CAGACCTGCG ATGCGGATCT GGAGCCGATC
AAATGGTCCC TCACCGACCC GCAGCGCCAG ACCCTCTGGG GCGCACCCGC ACCGCGCAGC
TGGTTCGAAG AAGGCTCCCT GTTCGACGGG GCCGAAACAC GCGACAGTGA TCTCAAAGCA
CAACCGATCA TCGCGCCGTA A
 
Protein sequence
MPIPAANLYP PFNITRLSHV EYGVTDLAAS RTFYVDILGL QVTHEDDSRI YLRAMEERGH 
HCIILRQSDH AGVACLGFKL YDTPDLEKAA AFFEGKSLPV EWIERPFMGP SLRTRDPWGV
PLEFYVKMDR LPPIHQQYRL YNGVKPLRID HFNMFSANVD AAVAFYGEMG FRVTEYTEDD
DSGRVWAAWM HRKGGVHDVA FTNGTGPRLH HTAFWVPTPL NIIDLLDLMS TTGYVANIER
GPGRHGISNA FFLYVRDPDG HRIEIYCSDY QTCDADLEPI KWSLTDPQRQ TLWGAPAPRS
WFEEGSLFDG AETRDSDLKA QPIIAP