Gene TM1040_3839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3839 
Symbol 
ID4074902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp84443 
End bp85978 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content55% 
IMG OID638004496 
Productmethyltransferase type 11 
Protein accessionYP_611231 
Protein GI99077972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0810259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATT GGACAGCAGG GTATGTCGCG GATATAGGTT ATACTTACGG GTATTACACG 
GAACTGAACC CGGTGCGCAT TCAGCTGGCG TTCCTCAACG CAGGTCTTGC TCCGCCGAGG
GTTGGAGCGG CCTGTGAGCT TGGCTTTGGG CAGGGGCTTT CAACGAACAT CCACGCTGCG
GCATCGGTTA CTACGTGGCA TGGTACGGAT TTCAATCCTT CTCAGGCGGG CTTTGCGCAA
GAGCTGGCCA CGGATTGTGG GAACGGTGCC CAGCTCTTTG ATCAAGCCTT TGATGAGTTC
TGCGCACGAG AAGACCTGCC CGAGTTCGAC TTCATTGCCC TGCACGGCAT CTGGAGCTGG
ATCTCAGACG AGAACCGCGC CGTGATTGTG GACTTCTTAC GCCGCAAGCT GAAGGTCGGC
GGTGTTCTCT ACATCAGCTA CAACACCCTA CCGGGATGGG CGGGGTTTGC TCCTATGCGC
CATCTCATGA CGGAGCATGC CGCGCGCTTT GGCACCAATG GTCAGGGGAT CGTAAGCCAG
ATCAACGGTG CGATAGAGTT TGGCGCTCAT ATGTTCGACC TTGACGCTAG ATACACCAAG
GCAGTTGTTG GAGCCAAAGA GCGATTTGAC AAGCTCAAAG AGCAAGATCG ACATTATCTT
GCCCATGAGT ACTTTAACCA AGATTGGTTG CCGATGTACT TCGCAGACAT GGCTCGGTGG
TTGGAACCCG CCAAGCTGGA GTTTGCCTGT TCTGCCAACT ACCTTGATGC CGTTGATCTG
ATCAACCTGA CACAAGAGCA GCAGGACCAT CTAGCGGGCA TCCCTGACCC GCTATTTCGA
CAGACTGTGC GTGACTTTTT GGTTAATCAA CAGTTTCGGA AGGACTACTG GGTGCGGGGC
GCGCGTCAAC TCACGGCTTT AGATCGTTCT GAAGCTCTGC GCGAATGCCG CGTCATTTTG
ACAGTTGCAG TGGAGGATGT GCCATTGGAA GTGACGGGCA CGCTATTAAA AGCCACCCTG
CAGGATGCAA TCTACCAACC CATCTTGAAA GTGTTGGGTG ATCAAAAGCC TCATAGCCTG
GGGCAGATTG AAGCGGCTGT GGCCTCAGAG GGCGTGAACT TTGCACAGCT TCTCCAAGCG
ATCACTCTGT TGATAGGTGC AGGGTCTGTC GCGCCCGCGT CCGATGCGGA AGCCTCATCT
AAGCGTAAGA AGCAGGTGCA GCGCTTGAAC ACTAAGCTTA TGCTGAAGGC CCGTAGCAGC
AATGATCTTC GGTACCTTAC GTCTCCACTG ACTGGCGGAG CTATTACGGT TCCCCGGTTC
CAACAGCTCT TTTTGCTGGC CAAGCAGAAC GGCCAGAAAA TCCCTGAGGA TTGGGCGAAG
TGGGTATGGC AGGTGTTGGC TGCGCAAGGC CAAAGCCTTG TCAAAGGAGG GAAGACGCTT
CAGACGCCGG AAGAGAACCT CGCAGAACTC ACAACGCAGG CCGAAGAGTT TGAGACGAAA
GCTCTGCCCG TGCTCAAGGC CCTTCAAATA GCGTAG
 
Protein sequence
MSDWTAGYVA DIGYTYGYYT ELNPVRIQLA FLNAGLAPPR VGAACELGFG QGLSTNIHAA 
ASVTTWHGTD FNPSQAGFAQ ELATDCGNGA QLFDQAFDEF CAREDLPEFD FIALHGIWSW
ISDENRAVIV DFLRRKLKVG GVLYISYNTL PGWAGFAPMR HLMTEHAARF GTNGQGIVSQ
INGAIEFGAH MFDLDARYTK AVVGAKERFD KLKEQDRHYL AHEYFNQDWL PMYFADMARW
LEPAKLEFAC SANYLDAVDL INLTQEQQDH LAGIPDPLFR QTVRDFLVNQ QFRKDYWVRG
ARQLTALDRS EALRECRVIL TVAVEDVPLE VTGTLLKATL QDAIYQPILK VLGDQKPHSL
GQIEAAVASE GVNFAQLLQA ITLLIGAGSV APASDAEASS KRKKQVQRLN TKLMLKARSS
NDLRYLTSPL TGGAITVPRF QQLFLLAKQN GQKIPEDWAK WVWQVLAAQG QSLVKGGKTL
QTPEENLAEL TTQAEEFETK ALPVLKALQI A