Gene EcSMS35_A0011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0011 
SymboltraD 
ID6106602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp11124 
End bp13295 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content53% 
IMG OID641614758 
Productconjugal transfer protein TraD 
Protein accessionYP_001739899 
Protein GI170650889 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID[TIGR02759] type IV conjugative transfer system coupling protein TraD 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.547678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTA ACGCAAAGGA TATGACCCAG GGCGGTCAGA TTGCGTCCAT GCGTATCCGC 
ATGTTCAGCC AGATCGCCAA TATCATGCTT TACTGCCTGT TTATTTTTTT CTGGATACTC
GTTGGTCTGG TTTTATGGGT AAAAATAAGC TGGCAGACGT TTGTGAACGG CTGTATTTAC
TGGTGGTGTA CCACGCTGGA AGGCATGCGG GATTTAATCA AGTCCCAGCC GGTATATGAG
ATCCAGTATT ACGGCAAAAC CTTCCGGATG AACGCTGCTC AGGTACTGCA TGATAAATAT
ATGATCTGGT GCGGAGAGCA GCTATGGTCC GCATTCGTTC TGGCCACAGT TGTGGCACTG
GTTATTTGCC TGATCACCTT CTTTGTTGTC TCCTGGATTC TGGGGCGTCA GGGTAAACAA
CAGAGCGAAA ATGAAGTCAC AGGTGGTCGT CAGCTGACAG ACAATCCGAA AGACGTTGCC
CGGATGCTGA AAAAAGACGG CAAGGATTCC GATATCCGGA TTGGCGACCT GCCGATTATC
CGGGATTCTG AAATCCAGAA CTTCTGTCTG CACGGCACGG TCGGGGCCGG TAAGTCGGAA
GTTATCCGTC GTCTGGCCAA CTACGCCCGT AAGCGTGGAG ATATGGTGGT GATTTATGAC
CGTTCAGGGG AATTTGTTAA AAGTTACTAT GATCCCTCCA TCGATAAAAT CCTGAATCCG
CTGGATGCCC GCTGTGCCGC CTGGGATTTA TGGAAGGAGT GTCTGACACA GCCGGATTTT
GATAATACGG CAAATACCCT GATCCCGATG GGCACAAAAG AGGACCCGTT CTGGCAGGGT
TCAGGACGTA CCATTTTTGC GGAAGCGGCG TACCTGATGC GTAATGACCC CAACCGCAGC
TACAGCAAAC TGGTGGACAC TCTGCTTTCC ATCAAAATCG AAAAACTGCG TACCTTCCTG
CGTAATTCAC CGGCGGCCAA CCTGGTGGAA GAGAAAATCG AGAAAACGGC GATTTCCATC
CGTGCTGTGC TGACCAACTA CGTGAAAGCC ATCCGTTACC TGCAGGGGAT TGAGCATAAC
GGTGAGCCCT TCACCATCCG TGACTGGATG CGTGGTGTCC GGGAAGATCA GAAAAACGGC
TGGCTGTTTA TCTCATCGAA CGCCGACACC CATGCCTCCC TGAAGCCTGT GATCTCCATG
TGGCTGTCCA TTGCCATTCG TGGTCTGCTG GCAATGGGAG AAAACCGTAA CCGTCGTGTG
TGGTTTTTCT GTGACGAGTT ACCCACGTTA CACAAACTGC CTGACCTGGT GGAGATCCTG
CCGGAAGCCC GTAAGTTCGG TGGCTGTTAT GTGTTTGGTA TCCAGTCCTA TGCCCAGCTG
GAAGATATCT ACGGTGAGAA AGCGGCTGCC ACGCTGTTTG ACGTCATGAA CACCCGTGCC
TTTTTCCGTT CTCCCAGCCA TAAGATTGCA GAGTTCGCTG CAGGTGAGAT TGGTGAGAAA
GAGCACCTGA AAGCCAGCGA GCAGTATTCC TACGGTGCTG ATCCGGTACG TGACGGGGTA
TCGACCGGTA AGGATATGGA GCGCCAGACG CTGGTCAGTT ATTCCGACAT TCAGTCTCTG
CCGGATCTGA CCTGTTATGT CACCCTGCCC GGACCGTATC CGGCAGTAAA ACTCTCTCTG
AAATATCAGA CACGACCGAA GGTCGCTCCG GAGTTTATTC CGCGTGACAT CAACCCGGAA
ATGGAGAACC GCCTGAGTGC CGTACTTGCC GCAAGGGAAG CAGAAGGTCG TCAGATGGCC
AGCCTCTTCG AACCGGATGT CCCGGAGGTT GTTTCCGGAG AAGACGTGAC TCAGGCTGAA
CAGCCGCAAC AGCCGCAACA GCCGCAACAG CCGGTGTCTC CTGCCATCAA CGATAAGAAG
TCAGATTCAG GTGTGAATGT TCCGGCAGGG GGGATCGAGC AGGAGCTGAA AATGAAACCG
GAAGAAGAGA TGGAACAGCA ACTGCCACCC GGGATCAGTG AATCCGGTGA AGTGGTGGAT
ATGGCCGCTT ATGAGGCATG GCAACAGGAA AATCATCCGG ACACCCAGCA GCAGATGCAG
CGTCGTGAAG AGGTGAACAT TAATGTGCAC CGGGAGCGCG GGGAGGATGT TGAGCCGGGA
GATGATTTCT GA
 
Protein sequence
MSFNAKDMTQ GGQIASMRIR MFSQIANIML YCLFIFFWIL VGLVLWVKIS WQTFVNGCIY 
WWCTTLEGMR DLIKSQPVYE IQYYGKTFRM NAAQVLHDKY MIWCGEQLWS AFVLATVVAL
VICLITFFVV SWILGRQGKQ QSENEVTGGR QLTDNPKDVA RMLKKDGKDS DIRIGDLPII
RDSEIQNFCL HGTVGAGKSE VIRRLANYAR KRGDMVVIYD RSGEFVKSYY DPSIDKILNP
LDARCAAWDL WKECLTQPDF DNTANTLIPM GTKEDPFWQG SGRTIFAEAA YLMRNDPNRS
YSKLVDTLLS IKIEKLRTFL RNSPAANLVE EKIEKTAISI RAVLTNYVKA IRYLQGIEHN
GEPFTIRDWM RGVREDQKNG WLFISSNADT HASLKPVISM WLSIAIRGLL AMGENRNRRV
WFFCDELPTL HKLPDLVEIL PEARKFGGCY VFGIQSYAQL EDIYGEKAAA TLFDVMNTRA
FFRSPSHKIA EFAAGEIGEK EHLKASEQYS YGADPVRDGV STGKDMERQT LVSYSDIQSL
PDLTCYVTLP GPYPAVKLSL KYQTRPKVAP EFIPRDINPE MENRLSAVLA AREAEGRQMA
SLFEPDVPEV VSGEDVTQAE QPQQPQQPQQ PVSPAINDKK SDSGVNVPAG GIEQELKMKP
EEEMEQQLPP GISESGEVVD MAAYEAWQQE NHPDTQQQMQ RREEVNINVH RERGEDVEPG
DDF