Gene ECH74115_B0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0003 
SymbolgspD 
ID6966391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp82210 
End bp84177 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content50% 
IMG OID643384019 
Productgeneral secretion pathway protein D 
Protein accessionYP_002268498 
Protein GI209395621 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02517] general secretion pathway protein D 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0438473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTCAGGA AATGGTTGAA TGGGCGTTTG CCGGTACTTG TGTTCACTAC AGTAATTTTG 
GGGGCCATTC CAGGGTGGGG GGCTGAATTT TCGGCCAACT TTAAAGATAC GGATATTCAG
GAGTTCATAA ATACTGTCAG TAAAAATTTA CACAAAACGG TAATAATTAA TCCTGACGTG
CAGGGAACCA TCACTGTACG CAGCTACGAT ATGCTGAACG AGGAACAATA TTATCAGTTC
TTTCTCAGTG TGCTGGACGT TTATGGTTTT GCTGTGGTCG ATATGCACAA CGGTATACTG
AAAGTAGTGC GCTCAAAAGA TGCCAAAACG TCGGCGGTGC CGGTAGCTAG TGATGTCAGT
CCCGGGACTG GTGATGAGGT TGTTACCCGG GTGGTCCCCG TAAGTAACGT GGCAGCCAGA
GATCTGGCGC CTTTGCTGCG TCAGCTCAAT GATAATGCTG GCGCAGGAAG CGTGGTGCAT
TATGAACCTT CTAATGTTTT GTTGATGACC GGACGTGCTG CAGTGATGAA ACGGTTGATG
GAGATTGTTG AACGTGTGGA TAAGGTGGGT AATCGCAGCG TTGCCACGGT CCCGCTCACC
TACGCGTCCG CAACAGACGT AGCCAGACTT GTTACGGAAC TGACTAAAGA AACAGATAAG
ACAGCTATAC CTGCTTGGAT GACGGCGAAA CTGGTTGCAG ACGAGAGGAC AAACTCAGTG
CTCGTCAGCG GAGAGCCAAT CTCCCAACAG CGTATCATCT CCATAATTAA GCAACTGGAT
CGTCAGGAGG ATGTTCAGGG TAATACTAAG GTGATTTACC TGAAATATGC GAAGGCGAAG
GATTTAGTGG AAGTCCTGAC AGGTATCAGC AGCAGTATTG AAAACGACTC TAAAAAGAGT
CCGTCAACGG AAGCCTTGCG CAAAGGAGTG ACGATTAAAT CCCACGAACA AACCAATGCC
CTGATCCTGA CGGGGGCCCC TGACGTCATC CGCGACCTTG AAAATGTGAT TTCGCAGTTG
GATATTCGTC GTCCTCAGGT CCTGGTGGAG GCCATCATTG CTGAAATACA GGATGCTGAC
GGGCTGAACC TTGGGATCCA GTGGGTGAAT AAACATGCCG GTGTGGCGCA GTTCACCAGT
ACCGGTTTAC CTATTACCAC GATGGTTCAG ACTCGTCAGA ACGAAATCTT AGACAGCGAT
CAGAGCAATG CCCTGAGCAT GTTTAACGGA ATTGCAGCGG GGTTTTATCA GGGAAACTGG
GCGATGCTGT TGACGGCGCT CTCCACAAGT AGCAAGAATG ATATCTTGGC GACCCCCAGT
ATTGTCACGC TGGACAATAT GGAGGCCACT TTCAATGTTG GTCAGGAGGT CCCGGTACTT
TCGGGCTCAC AGACAACCTC TGGGGACAAT ATTTTTAACA CGGTCGAGCG CAAAACGGTG
GGGATCAAAC TCAGGGTAAA ACCCCAGATC AACGAGGGTG ATTCCGTGTT ACTGGAGATA
GAACAGGAGG TGTCCGGTGT GGCGGACACT GCAGTAGCCA CCACTACTGA CTTGGGAGCA
ACCTTCAACA CCCGAACAGT GACCAATGCC ATGCTGGTCG GGAATGGCGA AACGGTGGTG
GTCGGAGGAT TACTGGATAA GTCGATCAGG GGGAGTGAGA GTAAAGTGCC ACTGCTGGGG
GATATCCCGG TACTGGGGCA TCTTTTTCGC GCAAAAAGCG AACAGACAGC TAAGCGTAAT
CTGATGCTGT TCATTCGGCC AACTATTATT CGTGAGCGCG ACGGATTTCG TCATGCTTCG
GCCGAAAAAT ACCAGTCGTT TAATCAGGAA CAGGTGCAGT CGCGTGGCAA AGAAACAACG
GCGCTGACGC TGAATGAGGA ACAGCTCAGG CTGTCCCCCG ATCAAGACGA TACGGCTTTC
CGGAAGGTGA AAGCGGCGAT TGCTGCGTTT TATGCGCAGG AGATGTAA
 
Protein sequence
MFRKWLNGRL PVLVFTTVIL GAIPGWGAEF SANFKDTDIQ EFINTVSKNL HKTVIINPDV 
QGTITVRSYD MLNEEQYYQF FLSVLDVYGF AVVDMHNGIL KVVRSKDAKT SAVPVASDVS
PGTGDEVVTR VVPVSNVAAR DLAPLLRQLN DNAGAGSVVH YEPSNVLLMT GRAAVMKRLM
EIVERVDKVG NRSVATVPLT YASATDVARL VTELTKETDK TAIPAWMTAK LVADERTNSV
LVSGEPISQQ RIISIIKQLD RQEDVQGNTK VIYLKYAKAK DLVEVLTGIS SSIENDSKKS
PSTEALRKGV TIKSHEQTNA LILTGAPDVI RDLENVISQL DIRRPQVLVE AIIAEIQDAD
GLNLGIQWVN KHAGVAQFTS TGLPITTMVQ TRQNEILDSD QSNALSMFNG IAAGFYQGNW
AMLLTALSTS SKNDILATPS IVTLDNMEAT FNVGQEVPVL SGSQTTSGDN IFNTVERKTV
GIKLRVKPQI NEGDSVLLEI EQEVSGVADT AVATTTDLGA TFNTRTVTNA MLVGNGETVV
VGGLLDKSIR GSESKVPLLG DIPVLGHLFR AKSEQTAKRN LMLFIRPTII RERDGFRHAS
AEKYQSFNQE QVQSRGKETT ALTLNEEQLR LSPDQDDTAF RKVKAAIAAF YAQEM