Gene TM1040_2402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2402 
Symbol 
ID4076728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2535708 
End bp2540267 
Gene Length4560 bp 
Protein Length1519 aa 
Translation table11 
GC content61% 
IMG OID638007724 
Productamino acid adenylation 
Protein accessionYP_614396 
Protein GI99082242 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism
[J] Translation, ribosomal structure and biogenesis
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0223] Methionyl-tRNA formyltransferase
[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATTT CAACCGTCGT CATTGGCAAT GAATCCCTGC TGATCGGTTG TTGCGAGGCC 
TTGCTGGCAC GCGGTCATGA TATTCGCGCC GTCGTCAGCA CAGATGCTGA GATCACCTCT
TGGGCAGCCT CCAAGGGTTT GAACACACAT TCTAAGCCGC TAGAGATTGA CATTGAATTC
GACTGGCTTT TGTCGATTGC AAACCTGCAG GTCCTGCCAG AGGCGGTGAT CTCCAAAGCG
CGACTGGGGG CGGTCAACTT TCACGATGGC CCGCTGCCAG ACCGGGCGGG TCTCAATACG
CCAAATTGGG CCATTCTCGA AGGCGCTGAA GAGCATGGCA TCACGTGGCA CCTGATCGAA
GGCGGCGTCG ACGAGGGCGA TATTCTGGCC CAGCGCCGGT TTGCGATTGC ACCAGATGAG
ACGGCCTTTA GCCTCAACTC CAAATGCTAC GGTGCTGCAC TCGACAGCTT TGGCGAAGTG
CTCGAGCAAC TCGAGAGCGG CGCACTAGAG CGCACGCCGC AAGACCTCAG CCATCGCCGC
TATCATGCAC GCAAGGATCG CCCCGATGCG GCTGGCTTCT TGCGGTTTGA CCGCAACCGC
GACACGCTTT TGTGTCTGGT GCGGGGCCTT GATATGGGCG GCTATTGGAA CCCGTTGACC
ACGGCCAAAA TCCGCTTGCC GGGACAGGTC GCTGCGGTCG GTGCAGCGCA AGCCACCGAA
GGCGAGGGTA TTCCGGGGGA AGTTCTCGCG GTGAGCGCAA ATGCTCTGAC CGTCGCCTGC
GCCGATGGTG CAATAGAACT GCGCGACCTG AAGACCCTCT CGGGAGGGGC GTTGCAGCCC
TCTGCCCTCC TCGAAGTGGG TGATGTGCTG CCCACCCTCT CCGATCAAGA GGTGGAGGCC
CTCAATGCGG CGGCGCAGAC CTGCCAAAAG GCGGAACCTT ATTGGCGCCG CTCTTTGGCT
GAGATGCTAC CCTTGCCCGT GCCACTGCAG GGCGCGGAAC TCGACGTCGA GGCCACTGCC
GACATCCAGT TGCCGACGGA TTGTGCGCCC CAGACCCTGA TGGCCGCAAT GCTGGCGTGG
CTGTTGCAGA GCCTCGGTGA GACCGCAGGC TCAATCGCGC TCTCGCTACC GGATCTACAA
ACGTCAGACG ACTTGGGAGT GCTTTCAAAC TGGGTGCCGC TGAGCGCGGA GATCGACGAA
AAGATCAAAA CTGTAGAGGC CCGCGCGGCC GAGGTTCTGG ACCGCTTGGA CAACATGGGG
GGCTTTGCAA CCGACCTCAT CCTGCGCGCG CCGGAAGTCA CGATTGAAAC ACCTCCAGAG
GTTGCGCTGT GCCTTGGGAC GAACCTCGCC CCCGCACAGG CCAAACTGGC CCTGATCCAG
TCGGATAGCC GATTGCAGCT GCGCTCTTGC CAGCTATCAC CCGAGGCGCT GTCGATGCTG
GCTGCGCGGC TTCAGTGCCT CTTGAACACG CTGCCCGAAG ACGGTGCAGC GACGCTCTCC
ACGCTGCCAA TGCTGCCAGA TTCAGAGCGG ACGCGCCTGA TTGAGACATG GAACGATACC
GGTTGTGACT ACCCCACAGA TCAAACCATT CACGCCGCTT TTGAGGCGCA GGTGGCAAGA
ACCCCAGATG CGATAGCGGT CGTTTTTGAA GATCAACAGC TGACCTATCG CGCGTTGAAC
CAACGCGCTG ACGCGCTGGC CCGACATCTG CATGACCTCG GTGCAAAACC GGGCAGCCAT
GTGGGCGTGT ATCTGCGTCG CTCCATGGAT CTGGTGATTG CAACGCTTGG TATTTTGAAG
GCCGGCGCGG CCTATGTACC GCTTGATCCT ACCTACCCTG CCGACCGTAT CGCGCATTAC
ATCACCGACA GTCAGGCCGC GGTCATTGTC ACCCATGAAA GCCTCGCTGC GGAACTCCCC
GAAAGCGATG CCAGGCAGCT GCATCTCGAC GCGCTCGACC TCACAACCGA GGCGGACCCG
ATCCGGGCAG GTGCTGCCGA AGATCTCGCG TATCTGATCT ATACCTCAGG CTCCACGGGC
TTGCCAAAAG GCGTGATGGT GCGTCATCGC AATGTGGCGA ACTTCTTTAC CGCGATGGAT
GCGCGCATCC CCCATCAGCC GGGAGACGCC TGGCTCGCGG TGACAAGCCT CTCGTTTGAT
ATTTCGGTGC TGGAGCTTTT CTGGACGCTT GCGCGCGGCT TCAAACTGGT GCTCTCCAGC
GACGAAAGCC GCCTACAGCT GGCCAACGGT CCCATCGGGC TCAGCGATCG CAAGATGGAT
TTCAATCTCT ATTACTGGGG CAATGACGAC GGCGCGGGCG AAAAGAAATA CGAATTGCTT
CTAGAGGGCG CCAAGTTTGC GGATGCCCAT GGGTTCAATG CGGTCTGGAC GCCAGAGCGG
CACTTCCACG CCTTTGGTGG CCCCTACCCG AACCCCTCCG TGACCGGTGC CGCCGTGGCA
GCGGTGACCC GCAATATCGG TGTGCGGGCC GGCTCCTGTG TGGCGCCGCT TCATCACCCC
GCGCGCATCG CCGAAGAATG GGCGGTCATC GACAATCTGA CGGGCGGGCG CGCGGGTATT
GGCTTTGCCT CGGGCTGGCA GCCGGATGAC TTCATCTTGC GCCCCGAGAA CACGCCGCCT
GCCAACAAAC CAGCGCTGTT TGAAGCCCTT CAGACAGTGC GAAAGCTCTG GCGAGGCGAA
GAAGTCGCTT TCCCGCGCGC GGATGGTGGC CAGCATTCGG TGGTCACGCA ACCACGCCCG
GTTTCAGAGG AACTTGCCGT CTGGGTCACC ACGGCAGGCA ACCCGGAGAC ATGGCGCGAG
GCTGGGCGCG TGGGGGCCAA TGTGCTTACG CATCTTCTAG GCCAGAGTAT CGACGAGGTC
GGCGACAAGA TCGCGCTATA CCATGCCGCG CTGCGCGAGG CGGGGTATGA CCCGGCGGAT
TTCACCGTGA CACTCATGTT GCACACCTAT CTCGCCGAGA CCCGCGAGGC CGCCCGCGAG
GTGGCGCGCG AGCCGATGAA GGACTACCTG CGCTCTGCGG CTGGGCTCAT TAAACAGTAT
GCCTGGGCTT TTCCAGCGTT CAAACGCCCA AAGGGCGTCG ACAATCCCTT TGAAATGGAT
CTTGGCAGTC TCAGCAGCGA GGAGCTGGAA GCCATCCTCG ATTTCGCATT TGAGCGATAC
TTCGAAGAGT CAGGTCTCTT TGGCACGGTC GACGACGCAG TCGCGCGCGT CGAGGATCTC
AAGCGCATCG GCGTCGATGA GGTCGCCTGC CTGATTGATT ATGGGATTGC ACCGCATGTG
GTGCTTGAAG GGCTCAAACC CCTTGCAGAA GTCCTGCGCC GCTCCAATCG CGTGCAAGAG
TTGGCAAATG ATGACTTCTC GCTCGCCGCG CAGATCGTGC GTCACGGGGT GACGCATATG
CAATGCACCC CCTCGATGGC GCGCATGATT GCCTCTGATG CAGATGCAAG CCCCGCGCTC
GGGCGCTTGC GACACCTTCT CGTTGGCGGC GAGGCCTTGC CCGGCGATCT TGTTGCCGCC
CTGCGCGAGC AGACCAGGGC CACGCTTCAC AATATGTATG GCCCCACTGA GACTACGATC
TGGTCCACGG TTGAAACGCT GGAACGCGCC CCAACGGGTA TTGCCCCGAT CGGAACGCCT
GTGGCCAATA ACTCTGTCTA CGTTCTCGAC GCCAAGGGGC AACTCGCCCC CGTCGGGGCG
GCAGGTGAAC TCTACATCGG CGGCGCAGGC GTGACGGCCG GGTATTGGCG GCGCGACGCG
CTCACGGCCG AACGGTTCCC CGATGATCCC TTCCTGTCAG GGCAAAAGAT GTATCGTACC
GGGGATTTGG TGCGGTGGCG CGAGGACGGC AAGCTCGATT TTCTCGGGCG CACGGATCAT
CAGGTCAAGA TCCGCGGCCA GCGGATCGAA CTGGGCGAAA TCGAAACCGC GCTGACCGCA
GTCGAGGGCG TGACCGCTGC CGTTGTGATC CCACGCAAGG TCGGCTCAGA CGAGCGGCTG
GTTGGCTATG TCACCGCATC CGGCGTTTTC TCAGAGACCG AGGCCAAGGC ACAACTGTCG
CGGCAACTCG CCGGGGCAAT GGTGCCGTCA CACATAGTGG TGCTTGAGGC TTTCCCTCTG
ACGCCCAACA AGAAAATCGA CCGCAAGGCG CTGCCCGACC CAACCCCCAG CCGACAAGAA
GCACATGTAA TCTCTGCCGA ACCTCAGTCG CCCGTACAGC AGAAGATCTC GGAGATCTGG
AAGGCGCTCC TTGGCGTGCC GGCTGTTGGC AAAAGCGATA ATTTCTTTGC GCTCGGTGGG
CATTCCCTAT TGGCGGTTCA GGCTCATCGC GACATTCGCA GCGCGCTTGA GGTCGAGCGG
TTGTCGATCA CGGATATCTT CCGCTTTCCG ACCCTTGACG GGTTGGCCTC ACACCTGGAA
CACCTGCAGA CCGGCGACAC CACGGTTGCG GAGACGTCCC CTGCCGCCCC GGCAGGGCGT
GCAGACACCA TGTCCAAACG CCGCGCAATG CGCGCCAATC GCAAAGCCCG CTCTGGATGA
 
Protein sequence
MTISTVVIGN ESLLIGCCEA LLARGHDIRA VVSTDAEITS WAASKGLNTH SKPLEIDIEF 
DWLLSIANLQ VLPEAVISKA RLGAVNFHDG PLPDRAGLNT PNWAILEGAE EHGITWHLIE
GGVDEGDILA QRRFAIAPDE TAFSLNSKCY GAALDSFGEV LEQLESGALE RTPQDLSHRR
YHARKDRPDA AGFLRFDRNR DTLLCLVRGL DMGGYWNPLT TAKIRLPGQV AAVGAAQATE
GEGIPGEVLA VSANALTVAC ADGAIELRDL KTLSGGALQP SALLEVGDVL PTLSDQEVEA
LNAAAQTCQK AEPYWRRSLA EMLPLPVPLQ GAELDVEATA DIQLPTDCAP QTLMAAMLAW
LLQSLGETAG SIALSLPDLQ TSDDLGVLSN WVPLSAEIDE KIKTVEARAA EVLDRLDNMG
GFATDLILRA PEVTIETPPE VALCLGTNLA PAQAKLALIQ SDSRLQLRSC QLSPEALSML
AARLQCLLNT LPEDGAATLS TLPMLPDSER TRLIETWNDT GCDYPTDQTI HAAFEAQVAR
TPDAIAVVFE DQQLTYRALN QRADALARHL HDLGAKPGSH VGVYLRRSMD LVIATLGILK
AGAAYVPLDP TYPADRIAHY ITDSQAAVIV THESLAAELP ESDARQLHLD ALDLTTEADP
IRAGAAEDLA YLIYTSGSTG LPKGVMVRHR NVANFFTAMD ARIPHQPGDA WLAVTSLSFD
ISVLELFWTL ARGFKLVLSS DESRLQLANG PIGLSDRKMD FNLYYWGNDD GAGEKKYELL
LEGAKFADAH GFNAVWTPER HFHAFGGPYP NPSVTGAAVA AVTRNIGVRA GSCVAPLHHP
ARIAEEWAVI DNLTGGRAGI GFASGWQPDD FILRPENTPP ANKPALFEAL QTVRKLWRGE
EVAFPRADGG QHSVVTQPRP VSEELAVWVT TAGNPETWRE AGRVGANVLT HLLGQSIDEV
GDKIALYHAA LREAGYDPAD FTVTLMLHTY LAETREAARE VAREPMKDYL RSAAGLIKQY
AWAFPAFKRP KGVDNPFEMD LGSLSSEELE AILDFAFERY FEESGLFGTV DDAVARVEDL
KRIGVDEVAC LIDYGIAPHV VLEGLKPLAE VLRRSNRVQE LANDDFSLAA QIVRHGVTHM
QCTPSMARMI ASDADASPAL GRLRHLLVGG EALPGDLVAA LREQTRATLH NMYGPTETTI
WSTVETLERA PTGIAPIGTP VANNSVYVLD AKGQLAPVGA AGELYIGGAG VTAGYWRRDA
LTAERFPDDP FLSGQKMYRT GDLVRWREDG KLDFLGRTDH QVKIRGQRIE LGEIETALTA
VEGVTAAVVI PRKVGSDERL VGYVTASGVF SETEAKAQLS RQLAGAMVPS HIVVLEAFPL
TPNKKIDRKA LPDPTPSRQE AHVISAEPQS PVQQKISEIW KALLGVPAVG KSDNFFALGG
HSLLAVQAHR DIRSALEVER LSITDIFRFP TLDGLASHLE HLQTGDTTVA ETSPAAPAGR
ADTMSKRRAM RANRKARSG