Gene ECH74115_4918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4918 
Symbol 
ID6969036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4556193 
End bp4557248 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content48% 
IMG OID643388603 
Productprotein LpfD 
Protein accessionYP_002273030 
Protein GI209397444 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGG CAATAGCATT ATCCTTATTA GGATGCGTAT TCGGATTTTC TGGCAAGGCA 
TTTGCCGGAG ATGCGTGGGG GCCATGTACC CCCGCGGATG GCACGACTTA TCACTATAAT
GTCGATGTGG ATGTTGGTAT ACCTGATGCG GCAAAAAACG TCGCAGGAAC GGTTTTACCG
GATGTCCTTA ACTGGTCTAA TGGACAAAAC GTATCGCTAA TTTGTGAATG TCCTGATTCT
TATAAGAATG AAAAAGATAC TCTAGTGCAG GGGGTGAGTA TGCTTCCCCC TTCTGGCCGT
ACGGTTGACA GTATGAAATA TTATACTTTG ACGGAGGAAC TGGAGGTTGC GACGAACATT
CGCATCAGTA CCAGTGTGTA TGGGTTTGTT CCTTTCAAAA ACCAGCAGGC GCTGCAAACA
ACCGGATGCA ACAAAGTCAT TACTACGCCC TATATGGGCG GCGCAGGTCT ACTTTCCTTT
GCTATTACTA AACCCTTTAT CGGCGATTCC GTCATCCCTC TGACGTTAAT TGCCGAACTG
TATGCCTCGA AAACAAATAA AGATTACGGA ACAATACCCA TATCTTCCGT ATCTATTCAG
GGGCGTGTCA CCGTAACCCA GGATTGCGAA ATTAAACCCG GCACGGTGCT GGATGTGCCA
TTTGGTGAAT TCCCCTCCTC GGCGTTTAAA AACAGGCAAG GGCAAATGCC TGAAGGTGCG
ACAGAGCAAG AAATTAACCT CAGCTTCGAT TGCAACAATA TCTCTGATGG TATCAAGGTT
GCGTTGCGCC TTGAGGGGGC AACCAATGCT GACGATCCCC GAGCGGTGGA TATGGGCAAC
CCCGATATTG GCGTACTGGT GAAAGATTCC AGCGGCAAAA TTTTGGTACC GAATGATTCA
AGCAGCACCA CGTTATTAAA TTTGAGCAGC CTGGATTCTA AAACTCACCG CAATGCCGCG
ATTCGACTGC TGGCTTTACC GATTAGCACG ACGGGTAAAG CGCCTAAAGG CGGGACGTTT
GAAGGCGTAA CGACCATATA TCTTGAAATG GAGTGA
 
Protein sequence
MKAAIALSLL GCVFGFSGKA FAGDAWGPCT PADGTTYHYN VDVDVGIPDA AKNVAGTVLP 
DVLNWSNGQN VSLICECPDS YKNEKDTLVQ GVSMLPPSGR TVDSMKYYTL TEELEVATNI
RISTSVYGFV PFKNQQALQT TGCNKVITTP YMGGAGLLSF AITKPFIGDS VIPLTLIAEL
YASKTNKDYG TIPISSVSIQ GRVTVTQDCE IKPGTVLDVP FGEFPSSAFK NRQGQMPEGA
TEQEINLSFD CNNISDGIKV ALRLEGATNA DDPRAVDMGN PDIGVLVKDS SGKILVPNDS
SSTTLLNLSS LDSKTHRNAA IRLLALPIST TGKAPKGGTF EGVTTIYLEM E