Gene TM1040_3421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3421 
Symbol 
ID4075595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp444126 
End bp445271 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content59% 
IMG OID638004930 
Productsecretion protein HlyD 
Protein accessionYP_611655 
Protein GI99078397 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.159892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.759602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCTT CTATTGATCG CAGAACGGAT CAATTTTGCG CGTGGATCCG CCTTAACTGT 
CTGCTTATAT GCGCGTTGAC TGCGCTTCCT TTGTCTGCCG CAGCACAAGA CCAAAATGCC
GCGCCGCCAC CGCCTGCCGT CACCGTTGCA ATCATTGAGG AGCGAAACTT TCAAGAAGCC
GAAACTTTTT CGGGTCGCAT CGAGGCCATT CAATCCGTCG ATCTGATCGC GCGCGTGCAA
GGTTATCTTA GTGCGCGGCA CTTTGAAGAA GGGGCATTTG TCGAGAAAGG GCAGCCTCTC
TATACGCTCG ATCAGGACAT CTATCGCAAC ACGGTGCATC AGGCAGAGGC GGCACTAGCC
GTGGCACAAG CCACCGAAAC CCTGGCGCAG CAGAAGTTTG ATCGCCAAGA GGAACTGACC
CGGCGGGACG TGCAATCTCG GGCGCTCCTT GAGGAAGCTC AGGCCAATCT TGCCGTCAGC
CAGGCGAATG TTGCCGCCGC CCAGTCTCAG GTAGAAGCAG CGAGGATCAA CCTCGCCTAT
ACGGAGATCA GCGCGCCTAT TTCGGGGCTC ATCGGGCGAT CTGCAGTTGC CACAGGAGAT
CTGATCAGCC CACAATCCGG CCCGATGGCG ACCCTCGTGC AGTTTGATCC GATCTACGCG
AGCTTTCCGG TGCCTCAGCG CAGCATGATC GATTTTCGCA AACGGGGCGC GCGTAACGAG
GACGTGTTTG TCTCGCTCAC CTTGGCAGAT GGCTCTGTTT ATCCGCATCA CGGCGTGATC
ACCTTCACCG ATGTGAGCGC GGCCTCTTCC AGCGATGCGG TCATCGTCCG CGCGACGGTT
CCAAATCCGG ACAACCTCCT GATCAACAAC GGCCTTGTGG ATGTGCATCT GGTGGCCAAC
GCCGACAGCC GCGCGCTTGC CCTGCCAGCG CAGGCACTCT TGCTGGATCA GCAGGGAGCA
TATGTGCTGG TGGTTGACGG TGACGACAGA GTGCAGGCCC AACGGGTCGA AGTGGGCACC
CAGCGGGCCG GGTACCTGGA GGTCAAAGAC GGGCTGGAGG CAGGTGCGCG GGTCATTGTT
GAGGGCATCC AGAAGGCACG TCCGGGCAAC AGGGTTACCG TCTCGCTTGT GAACACAGAC
AACTAG
 
Protein sequence
MASSIDRRTD QFCAWIRLNC LLICALTALP LSAAAQDQNA APPPPAVTVA IIEERNFQEA 
ETFSGRIEAI QSVDLIARVQ GYLSARHFEE GAFVEKGQPL YTLDQDIYRN TVHQAEAALA
VAQATETLAQ QKFDRQEELT RRDVQSRALL EEAQANLAVS QANVAAAQSQ VEAARINLAY
TEISAPISGL IGRSAVATGD LISPQSGPMA TLVQFDPIYA SFPVPQRSMI DFRKRGARNE
DVFVSLTLAD GSVYPHHGVI TFTDVSAASS SDAVIVRATV PNPDNLLINN GLVDVHLVAN
ADSRALALPA QALLLDQQGA YVLVVDGDDR VQAQRVEVGT QRAGYLEVKD GLEAGARVIV
EGIQKARPGN RVTVSLVNTD N