Gene TM1040_3362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3362 
Symbol 
ID4075261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp373152 
End bp374306 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content58% 
IMG OID638004870 
Productalkanesulfonate monooxygenase 
Protein accessionYP_611596 
Protein GI99078338 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.460731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTG TCCCCGTGAC ATCCGCCGAT CTAGACGCCG TGGAAGTGTC CTGGTTCGCC 
GCGCTTTGCT CAGACGATTA CCAGTTTCTG GGCGTGCCTG ACGGCAACCT GCGTTCGTCC
TGGGCACATT GTTCAAACAT TGTGAAAGAG GCGGAACAAC AGGGGTTTCG GAACATCCTT
TGCCCTTCTT CCTATCAGGT CGGGCAGGAC ACGCTTAGCT TTGTTGCGGG CTGTGCGCCG
ATCACCGACA AGATCAATCT TCTGGCCGCT GTCCGCTGTG GAGAAATGCA GCCGATCATG
TTGGCGCGCA CGATTGCAAC GCTCGATCAC ATGCTCGAGG GGCGCTTGAC GGTCAATATC
ATCTCTTCCG ACTTCCCGGG CGAAAAAGCG GATAGCGATT ATCGCTACCA GCGCTCACGC
GAGGTCGTCG AAATCCTGAA GCAGGCCTGG ACCCGCGATG AGATCAACTA TCAGGGTGAG
GTCTACAGTT TCAGCGGTCT CACCACGGAC CCGGCCCGGC CGTATCAGAC CGGTGGCCCT
CTCCTGTACT TTGGCGGGTA CTCCCCAGCG GCATTGGAAC TCTGCGGTCA GCATTGCGAT
GTTTACCTCA TGTGGCCGGA GAAGATGGAA GAGCTCCAAG GGCGTATGCA GGCTGTCAAC
GCTGTGGCGG AAAAATACAA TCGCACATTG GATTATGGGC TGCGTGTTCA TACCATCGTG
CGCGACACCG AGGCTGAAGC CCGTGAATAT GCGGATTATA TCGTCTCCAA GCTGGAAGAC
GAGCGGGGCA GCGCCATCCG CGAGCGTGCC CTGGACGCCA AATCACTCGG CGTGAGCCAT
CAGGCGAAGA ACCGAGAAAT TGCAGATAGC CATGGCTTTA TCGAACCGAA CCTCTGGACT
GGCGTCGGGC GCGCCCGCTC CGGCTGTGGT GCCGCGCTGG TCGGCTCCAC CGATCAGGTC
ATGAGCAAGC TCGAAGACTA CCAGAAGATG GGCATTCGCG CGTTTATCTT CTCGGGCTAC
CCGCACCTCG AAGAAGCCAA GCATTTTGGT GCGCGGATCA TGCCGCATTT GAAAACCTGT
TCGTTGCCCG AGGCGCATGG CCGAGTTCCG TCCACAGCCC CGGCGACACC TCTTGCCGTA
GGAGAACGCC GCTAA
 
Protein sequence
MTVVPVTSAD LDAVEVSWFA ALCSDDYQFL GVPDGNLRSS WAHCSNIVKE AEQQGFRNIL 
CPSSYQVGQD TLSFVAGCAP ITDKINLLAA VRCGEMQPIM LARTIATLDH MLEGRLTVNI
ISSDFPGEKA DSDYRYQRSR EVVEILKQAW TRDEINYQGE VYSFSGLTTD PARPYQTGGP
LLYFGGYSPA ALELCGQHCD VYLMWPEKME ELQGRMQAVN AVAEKYNRTL DYGLRVHTIV
RDTEAEAREY ADYIVSKLED ERGSAIRERA LDAKSLGVSH QAKNREIADS HGFIEPNLWT
GVGRARSGCG AALVGSTDQV MSKLEDYQKM GIRAFIFSGY PHLEEAKHFG ARIMPHLKTC
SLPEAHGRVP STAPATPLAV GERR