Gene TM1040_2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2784 
Symbol 
ID4076552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2944029 
End bp2945063 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content58% 
IMG OID638008109 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_614778 
Protein GI99082624 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.968242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAC GTCGTAATTT TCTAAAGACC ACCGCCTTGG GCGCTGCCGC TGCACCGCTT 
GCAGCGCCTG CGCTGGCGTC TGGTAAGATC ACATGGCGGA TGCAGACCTA CGCCGGTCCC
GCGCTTGCAG CGCATGTGAT CGACCCGGCG ATTGAAATGT TCAACAAGAT CGCAGGCGAC
CGCATGCAGA TCGAGCTTTT CTACGCCGAC CAGCTGGTCC CCACGGGTGA GCTGTTCCGT
GCCATGCAGA AAGGCACCAT CGACGCGGTA CAGTCTGATG ACGATTCCAT GGCGTCTCCG
ACAGAAGTGA CCGTTTTTGG CGGCTATTTC CCCTTTGCGT CGCGCTACTC GCTCGACGTG
CCGGTGCTGT TCAACCAGTA CGGCCTCAAC GAGATCTGGG ATGCGGAATA CTCCAAGGTG
GGCGTCAAGC ACATCTCCGC AGGCGCTTGG GATCCTTGCC ACTTTGCCAC CAAAGATCCG
ATCAACTCGC TTGAGGATCT CAAGGGCAAG CGCGTCTTCA CCTTCCCGAC TGCGGGCCGC
TTCCTGAGCC AGTTCGGCGT CGTGCCTGTC ACCCTGCCGT GGGAAGACAT CGAAGTTGCA
ATGCAGACCG GCGAGTTGGA TGGCGTTGCT TGGTCGGGCA TTACCGAAGA TTACACCGTG
GGTTGGGCCG ATGTGACCAA CTACTTCCTG ACCAACAACA TTTCCGGTGC ATGGGCAGGC
AGCTTCTTTG CCAACATGGA CCGTTGGAAC GAGCTGCCCG AAGATCTGCA GGCGCTGTTC
CGTGTCTGCA CCGACCAGTC GCATTACTAT CGCCAGTGGT GGTACTGGGG TGGCGAAGCC
TCCTTGCGCG TCAATGGCGA CAAGATGAAG CTGACCTCGA TCCCCGATGC AGAATGGCAG
CAGGTCGAAG ATGCCGCGGT AAAGTTCTGG GACGAGATCG CAGCCGAATC CGAAACCAAG
GCGAAGGTTG TCGAGATCTT CAAGAAGTAC AACGCCGATA TGGCGAAAGC CGGTCGTCCG
TATCGTTACG GCTGA
 
Protein sequence
MTTRRNFLKT TALGAAAAPL AAPALASGKI TWRMQTYAGP ALAAHVIDPA IEMFNKIAGD 
RMQIELFYAD QLVPTGELFR AMQKGTIDAV QSDDDSMASP TEVTVFGGYF PFASRYSLDV
PVLFNQYGLN EIWDAEYSKV GVKHISAGAW DPCHFATKDP INSLEDLKGK RVFTFPTAGR
FLSQFGVVPV TLPWEDIEVA MQTGELDGVA WSGITEDYTV GWADVTNYFL TNNISGAWAG
SFFANMDRWN ELPEDLQALF RVCTDQSHYY RQWWYWGGEA SLRVNGDKMK LTSIPDAEWQ
QVEDAAVKFW DEIAAESETK AKVVEIFKKY NADMAKAGRP YRYG