Gene ECH74115_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1459 
SymbolflgI 
ID6969444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1438967 
End bp1440064 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content54% 
IMG OID643385432 
Productflagellar basal body P-ring protein 
Protein accessionYP_002269926 
Protein GI209398849 
COG category[N] Cell motility 
COG ID[COG1706] Flagellar basal-body P-ring protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.31008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.000149159 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGATTAAAT TTCTCTCTGC ATTAATTCTT CTACTGGTTA CGACGGCGGC TCAGGCTGAG 
CGTATTCGCG ATCTCACCAG TGTTCAGGGG GTAAGGCAAA ACTCACTGAT TGGCTATGGT
CTGGTGGTGG GGCTGGATGG CACCGGTGAC CAGACAACCC AGACGCCGTT TACCACACAA
ACGCTTAATA ACATGCTCTC ACAGCTGGGA ATTACCGTTC CGACGGGCAC CAATATGCAG
CTAAAAAACG TTGCTGCGGT AATGGTGACG GCGTCACTTC CACCGTTTGG ACGTCAGGGG
CAAACCATTG ACGTGGTGGT TTCTTCCATG GGAAATGCCA AAAGCCTGCG TGGCGGCACA
TTGTTGATGA CTCCGCTTAA GGGCGTTGAC AGTCAGGTGT ATGCGCTGGC GCAGGGCAAT
ATTCTGGTTG GCGGCGCAGG AGCCTCCGCT GGCGGTAGCA GTGTTCAGGT GAACCAACTG
AACGGTGGAC GGATCACCAA TGGTGCAGTT ATTGAACGTG AATTACCCAG CCAGTTTGGC
GTCGGGAATA CCCTTAATTT GCAACTTAAC GACGAAGATT TCAGCATGGC GCAGCAAATC
GCTGACACCA TCAACCGCGT GCGTGGATAT GGCAGCGCCA CCGCGTTGGA TGCGCGGACT
ATTCAGGTGC GCGTACCGAG TGGCAACAGT TCCCAGGTCC GTTTCCTTGC CGATATCCAG
AATATGCAGG TTAATGTCAC CCCGCAGGAC GCTAAAGTAG TGATTAACTC GCGCACCGGT
TCGGTGGTGA TGAATCGCGA AGTGACTCTC GACAGCTGCG CGATAGCGCA GGGAAATCTC
TCAGTAACAG TCAATCGTCA GGCCAATGTC AGCCAACCAG ATACACCGTT TGGTGGCGGA
CAGACCGTGG TAACGCCACA AACGCAGATC GACTTACGCC AGAGCGGCGG TTCGCTGCAA
AGCGTACGTT CCAGCGCCAG CCTCAATAAC GTGGTGCGCG CGCTCAATGC GCTGGGCGCT
ACGCCGATGG ATCTGATGTC TATTTTGCAA TCAATGCAAA GTGCGGGATG TCTGCGGGCA
AAACTGGAAA TCATCTGA
 
Protein sequence
MIKFLSALIL LLVTTAAQAE RIRDLTSVQG VRQNSLIGYG LVVGLDGTGD QTTQTPFTTQ 
TLNNMLSQLG ITVPTGTNMQ LKNVAAVMVT ASLPPFGRQG QTIDVVVSSM GNAKSLRGGT
LLMTPLKGVD SQVYALAQGN ILVGGAGASA GGSSVQVNQL NGGRITNGAV IERELPSQFG
VGNTLNLQLN DEDFSMAQQI ADTINRVRGY GSATALDART IQVRVPSGNS SQVRFLADIQ
NMQVNVTPQD AKVVINSRTG SVVMNREVTL DSCAIAQGNL SVTVNRQANV SQPDTPFGGG
QTVVTPQTQI DLRQSGGSLQ SVRSSASLNN VVRALNALGA TPMDLMSILQ SMQSAGCLRA
KLEII