Gene TM1040_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0289 
Symbol 
ID4077424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp295557 
End bp296987 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content65% 
IMG OID638005583 
Productmulticopper oxidase, type 3 
Protein accessionYP_612284 
Protein GI99080130 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGA AATCCAAAAG GTGTGTCATG AACCCCAGCC GTCGCCATGT CCTTGCAGGT 
CTCACCGCAT GTGCCAGCCT GCCCGCCCTG CCCCGCTTTG CACAGGCCAG CACAGAGACA
AATGTGGAAT CAGCGGCGCA TCGTCTGACC CTTGCCCCGG CCCGCCAGCA GATTGCACCA
ACCGACTATC CCGCCACAGA TCTGTGGTGC GTCAACGGCA CAATGCCGGG CGAAACCCTG
CGCCGGACAC AGGGCGCGCG TCTGGTGGTT GAGGTCGACA ACCAGCTCCC GCAGGCCACA
AGCCTCCATT GGCACGGCAT CCGCATCGAC AATGCCATGG ACGGGGTGCC GGGCGTCACC
CAGGATCCGA TCCAGCCCGG CACGCGCTTC ACCTATGACT TCGCCCTGCC CGATGCGGGC
ACATATTGGT ATCATTCCCA CGCGCAATCG GTGGAGCAGG TCGAACGTGG CCTGCAAGGC
GCATTGATCA TCGAAGAACC CGACGCGCCC GACGTGGATC AGGACCTGCC GCTGGTGATC
GACGACCTGC GCGTCACAGC CGAGGCGGCG ATAGATGCCA ATTTCGCCGT GCCGCACGAT
CTGTCACACG CCGGGCGCAT GGGGAACATC CTGCTCTGCA ATGGCAAAAT GGTTGCCGAT
CATCCCGTCA AAACCGGAGA CCGCTTGCGC CTGCGCCTCA TCAACAGCGC CAATGCGCGT
ATCTTCACCC TCGGCCTGCA GGGCCTCGAG GGCTGGGTGA TGGCCTATGA CGGTATGCCG
CTGGCCACCC CTGAGGCGCT GCCGGACCGG CTCCTGCTTG CCCCCGCCCA GCGCGTCGAC
CTCTTTGTGG ATGTGACCGC AGCCGCAGGC GAAGACGCCT TCCTTGTACA GTTTGAACGC
GATGGTGGCT ACGAGCTTGC GCGCTTCCCG GTGGCTTCCG GCACCCGCGC CCGCCGCCCC
GCGCCCAGCG CTCTGCCCCC CAATCCCGAC TTCCCGATCG ACCTCCCGTC CGCGCGCAGC
GTCGATCTCA TCACCGAGGG CGGCGCGATG GCGTGGCTCA GCGGCGCCAG GTTCAAGGGC
GAGGATCTCT CCGGTCAGGA TCTCGCCCAA TTGGGGCAGT TCTGGGCGTT CAACGGCAGC
GCCGGCCGCC CGCCTGAGCC CTTTGTGACC GCGCGCCTCG GCGAGACCAT TCGCATCCGC
ATGGTCAACG ACACTCGCTT TCCGCATGCG ATGCACCTCC ATGGCATGCA TTTCTCCGAG
GTCCGTGCGG ACGGCAGCCT CGGCCCCCTG CGCGACACAC TCCTGATGCT GCAGGGAGAG
ACCCGCGAGA TCGCTTTTCA GGCGCACAAC GCAGGCGACT GGCTCTTTCA CTGCCATATG
TTGTCGCATC ACGCCGCGGG CATGGGCACA TGGGTGCGCG TCACAGCCTG A
 
Protein sequence
MTSKSKRCVM NPSRRHVLAG LTACASLPAL PRFAQASTET NVESAAHRLT LAPARQQIAP 
TDYPATDLWC VNGTMPGETL RRTQGARLVV EVDNQLPQAT SLHWHGIRID NAMDGVPGVT
QDPIQPGTRF TYDFALPDAG TYWYHSHAQS VEQVERGLQG ALIIEEPDAP DVDQDLPLVI
DDLRVTAEAA IDANFAVPHD LSHAGRMGNI LLCNGKMVAD HPVKTGDRLR LRLINSANAR
IFTLGLQGLE GWVMAYDGMP LATPEALPDR LLLAPAQRVD LFVDVTAAAG EDAFLVQFER
DGGYELARFP VASGTRARRP APSALPPNPD FPIDLPSARS VDLITEGGAM AWLSGARFKG
EDLSGQDLAQ LGQFWAFNGS AGRPPEPFVT ARLGETIRIR MVNDTRFPHA MHLHGMHFSE
VRADGSLGPL RDTLLMLQGE TREIAFQAHN AGDWLFHCHM LSHHAAGMGT WVRVTA