Gene TM1040_3834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3834 
Symbol 
ID4074984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp79812 
End bp81131 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content49% 
IMG OID638004492 
ProductType I secretion membrane fusion protein, HlyD 
Protein accessionYP_611227 
Protein GI99077968 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.0519478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0754142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAACC TCCAAATGCG TGACAAAATT TCAAGATACT CGAGTCCGCA ACGCTTTAGC 
AGAATTGGCT ATGCAATCGT TATCCTGACC TTCGGGGTTC TTGGAGGTTG GGCGGCTTAC
GCACAAATTG ACTCAGCGAC TGTTGCACCG GGTATCGTCG AGCTTGAAGG TAATCGTAAG
GTGGTTCAAC ACCTTGAAGG TGGGATCATT CAGACGATCC ATGTCAAAGA AGCTGAAAGT
GTTTCCCAAG GCGATGTTCT CGTTACTCTC GAAAGTGTAG AGGCTCGCTC AAATGTTGAA
CGGTTCCGAA ACCGTTTAGA AGAAGCCATG GCAATTGAGG CCCGGCTCTT AGCCGAGCAA
GCACTTAGCG AAGAAATCGA TTACCCTGAG AGCCTAACGT CAAATCCCTC TCCGGCACTC
AAGAATGTAC TGGATCTTCA GAGCACGATT CTTGCAGATC GTTTGGCTAT CTTTCGCTCT
GAGCAGGAAA TCCTGCAGTT CCGGATTGAG CAGCTAGAAG GCCAAAAGGC AGGTCTTGCT
CTTCAAAAAG ACGCATATGA GCGACGATTG AAACTGCAGA GCGAGCTTGT TGAACGTTTG
ACCCGCGGTG CGGAGAGCGG TGTCATTGAA AACAACGTTC TCACTGGGCG AAAAGATGCA
TTGATTCAAG TCGAGGCCTC CCTAGGGGAA GCAATATCAG ATGAAGCTCA AGTGGGGGTT
GCTATTTCAG AGGCACGCCT CAATCGGCTT AAGTTGTCGC AGGAGTTCAA AGAAAGGGCC
AACCGAGAGC TTCGAGATAT TCAGACCGAC CTCAAGGAAA TACGTGAAAA TCTCACGGTA
GCCCAAGATA TCTATTATCG TACCGAGATC CGGGCGCCAA GTGATGGAGT GGTACAAGAT
ATCCGTGTGA CAACTGAAGG GTCTGTTATT CGACCAGGCG AAATTCTGAT GGAGATTGTT
CCACCTGATG ACAATTTACT TATCGCAAGC CGTGTATCGC CGCTGGACAT TGACAATGTT
GTGCCAGGTC AAGAGAGCGA AGTACGATTC TCTGCCTTTA AGGCGAAACT GACACCTGTT
GTGTTGGGCT ATGTGGAAAG TGTCTCTCAA GATATCATCA CGCCAGAGCG TAGTGATGAG
GAACCCTATT ATTTGGCTCG AGTTCGTGTT CCAGAGGAGA ATATGAGCGA AGAGATGCGA
CAAGGCCTGA CAGCCGGAAT GCCTGCCGAT GTTGTTATCG TAAATGGTGA AAGAAGTGTC
TTGAACTATC TTGTCTCCCC TCTGACAGAC GCGATCGCTT TGTCGTTGAA AGAAGAATGA
 
Protein sequence
MDNLQMRDKI SRYSSPQRFS RIGYAIVILT FGVLGGWAAY AQIDSATVAP GIVELEGNRK 
VVQHLEGGII QTIHVKEAES VSQGDVLVTL ESVEARSNVE RFRNRLEEAM AIEARLLAEQ
ALSEEIDYPE SLTSNPSPAL KNVLDLQSTI LADRLAIFRS EQEILQFRIE QLEGQKAGLA
LQKDAYERRL KLQSELVERL TRGAESGVIE NNVLTGRKDA LIQVEASLGE AISDEAQVGV
AISEARLNRL KLSQEFKERA NRELRDIQTD LKEIRENLTV AQDIYYRTEI RAPSDGVVQD
IRVTTEGSVI RPGEILMEIV PPDDNLLIAS RVSPLDIDNV VPGQESEVRF SAFKAKLTPV
VLGYVESVSQ DIITPERSDE EPYYLARVRV PEENMSEEMR QGLTAGMPAD VVIVNGERSV
LNYLVSPLTD AIALSLKEE