Gene TM1040_3724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3724 
SymbolpaaA 
ID4075431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp784063 
End bp785055 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content60% 
IMG OID638005244 
Productphenylacetate-CoA oxygenase subunit PaaA 
Protein accessionYP_611953 
Protein GI99078695 
COG category[S] Function unknown 
COG ID[COG3396] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02156] phenylacetate-CoA oxygenase, PaaG subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.802749 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGCGC AGATGGTCAA ATCGACCGGC ACGGGAGTTA AATCTACCGA GGAAATGAGC 
GCCGAAGAGC GCGCGTTTCA GGCCCGTATC GATGCGGGCG AAAAAATCGA GCCGAAAGAC
TGGATGCCCG AGGGCTATCG CAAGACGCTG ATCCGCCAGA TCGGCCAGCA CGCGCATTCC
GAGATTGTCG GCCAGCTGCC CGAGGGCAAC TGGATCACCC GCGCACCGAC GCTGGAACGC
AAGGCGATCC TGCTGGCGAA AGTACAAGAC GAGGCGGGCC ACGGGCTCTA TCTCTACTGT
GCCGCTGAAA CGCTGGGCGT CAGCCGTGAC GAGATGACCG AGATGCTCTT GGACGGGCGG
ATGAAGTATT CGTCGATCTT CAACTATCCG ACCCTGACAT GGGCCGATAT GGGTGCTGTC
GGCTGGCTCG TGGATGGCGC GGCGATCATG AACCAGGTGC CGCTGCAGCG CACCTCCTTT
GGCCCCTATT CGCGTGCGAT GATCCGGGTG TGCAAGGAAG AGAGTTTTCA TCAGCGTCAG
GGCTTTGACA TCATGATGAA GATGGCGCAG GGCACGCCGC AGCAAAAAGC GATGGCTCAG
GATGCGCTCA ACCGCTTCTG GTATCCGGCG CTGATGATGT TCGGCCCCTC GGACAAGGAC
TCGGTGCATT CCGCGCAGTC GATGGCGTGG AAAATCAAGA TGAACACCAA TGACGAGCTG
CGCCAGAAGT TCGTCGATCA GACCGTGCCA CAGGCGGAAT ACCTCGGCCT AACCGTGCCG
GACGAGAACC TCAAATGGAA CGAGGAAAAG GGCGGCTACG ACTTTTCCGA GCCCGACTGG
GAAGAGTTCT TTGAGGTCAT CAAAGGCAAC GGCCCCTGCA ACACCGACCG CCTGGCCGCG
CGCAACAAGG CCTGGGACGA CGGCAAATGG GTGCGCGAGG GCATGATGGC CCACGCCGAA
AAGAAACGCG CCCGCAAGAT GGCGGCGGAG TAA
 
Protein sequence
MYAQMVKSTG TGVKSTEEMS AEERAFQARI DAGEKIEPKD WMPEGYRKTL IRQIGQHAHS 
EIVGQLPEGN WITRAPTLER KAILLAKVQD EAGHGLYLYC AAETLGVSRD EMTEMLLDGR
MKYSSIFNYP TLTWADMGAV GWLVDGAAIM NQVPLQRTSF GPYSRAMIRV CKEESFHQRQ
GFDIMMKMAQ GTPQQKAMAQ DALNRFWYPA LMMFGPSDKD SVHSAQSMAW KIKMNTNDEL
RQKFVDQTVP QAEYLGLTVP DENLKWNEEK GGYDFSEPDW EEFFEVIKGN GPCNTDRLAA
RNKAWDDGKW VREGMMAHAE KKRARKMAAE