Gene TM1040_2622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2622 
Symbol 
ID4077925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2754850 
End bp2756106 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content63% 
IMG OID638007946 
Productfumarylacetoacetate hydrolase 
Protein accessionYP_614616 
Protein GI99082462 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.817563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTGA AAAAGTCCTG GGTGACGTCG GCCAATTCCG CTGACCACCC TTTTCCGCTG 
AACAACCTTC CCTATGGCGT GTTTTCAACC CCCGACAGTG ATCCGCGCTG TGGGGTCGCC
ATCGGCGATA TGATCCTCGA TCTTGCTGCG GCAGAGGCGG CGGGGCTGAT CGACCTCGCA
GATGAGCCAA TGTTCGAAAT GCCCTTCTGG AACGAGTTTA TGGAAGAAGG CCCCGCGGTC
TGGGCCGCGT TGCGCACCCG CCTGATCGCG CTTCTGGAAG AGGGCTCGGA CGCGCAAAAC
AAGGTCGAGC CTTGTCTCGT ACCGATGTCA GGCGCTGAGA TGCATATGCC AATCATGGTG
TCGGAATACA CCGATTTCTA TGCCGGGCGT CATCACGCCA CCAATGTCGG CACCATGTTT
CGGGGCGCCG AGAACGCGCT GCCGCCAAAC TGGCTGCATA TTCCGATCGG ATACAATGGT
CGTGCCTCCT CGGTTGTGGT CTCCGGCACC GATGTGCGCC GTCCCTGGGG TCAGCTCAAG
GGGCCAAATG ACGACGCACC CCGTTGGGCC CCCTGTGCGC GCTTTGACAT CGAACTGGAG
ATGGGAGCCA TCGTCGGGAC ACCCTCCGAC GGGCCGATCA CGGTGCAGGA CGCGGACGAT
CACATCTTTG GCTATGTGCT GCTGAACGAC TGGTCCGCCC GCGATATTCA GGCTTGGGAA
TACCAGCCGC TCGGCCCTTT TCAGGCCAAG GCGACCGCCA CCACCATCAG CCCCTGGATC
GTCACCAAGG CCGCATTGGA GCCCTTCCGC TGCGACACCC CCGCGCGGGA GGTCGAGCTT
CTCGATCACC TCAAGGACTG TGGCCCGATG CTCTATGACA TCGACCTTGC CGTGACCCTG
CGCCCCGAGG GGGGCGAAGA GGCCACCATC GCACGCACCA ACTACAAGGA AATGTACTAT
TCCGCCGCGC AGCAACTGGC CCATCACACC ACATCGGGCT GCCCGATGAA CGCGGGCGAC
CTGCTGGGTT CCGGCACCAT CTCCGGTCCG AACAAGGACG AACGCGGCTC GCTCCTCGAA
CTCAGCTGGG GTGGCAAGGA GCCTCTCACC CTGCCTTCGG GCGATACCCG CAGCTTTATC
GAGGATGGCG ACACGCTGAC CCTCAAAGGC GCGGCCAAGG GCGAGGGCTA CACCATCGGC
TTTGGCGACT GCACCGGCAC GGTGCTGCCC GCCCTCAGCG ATCCGTTTGC ACGCTGA
 
Protein sequence
MPLKKSWVTS ANSADHPFPL NNLPYGVFST PDSDPRCGVA IGDMILDLAA AEAAGLIDLA 
DEPMFEMPFW NEFMEEGPAV WAALRTRLIA LLEEGSDAQN KVEPCLVPMS GAEMHMPIMV
SEYTDFYAGR HHATNVGTMF RGAENALPPN WLHIPIGYNG RASSVVVSGT DVRRPWGQLK
GPNDDAPRWA PCARFDIELE MGAIVGTPSD GPITVQDADD HIFGYVLLND WSARDIQAWE
YQPLGPFQAK ATATTISPWI VTKAALEPFR CDTPAREVEL LDHLKDCGPM LYDIDLAVTL
RPEGGEEATI ARTNYKEMYY SAAQQLAHHT TSGCPMNAGD LLGSGTISGP NKDERGSLLE
LSWGGKEPLT LPSGDTRSFI EDGDTLTLKG AAKGEGYTIG FGDCTGTVLP ALSDPFAR