Gene EcSMS35_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2048 
SymbolflgI 
ID6143125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2069960 
End bp2071057 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content54% 
IMG OID641616924 
Productflagellar basal body P-ring protein 
Protein accessionYP_001744100 
Protein GI170681975 
COG category[N] Cell motility 
COG ID[COG1706] Flagellar basal-body P-ring protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.159552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.112526 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTAAAT TTCTCTCTGC ATTAATTCTT CTACTGGTTA CGACGGCGGT TCAGGCTGAG 
CGTATTCGCG ATCTCACCAG TGTTCAGGGG GTAAGGCAAA ACTCACTGAT TGGCTATGGC
CTGGTGGTGG GGCTGGATGG CACCGGTGAC CAGACAACTC AGACGCCGTT TACCACACAA
ACGCTTAATA ACATGCTCTC ACAGCTGGGA ATTACCGTTC CGACGGGCAC CAATATGCAG
CTAAAAAACG TCGCTGCGGT AATGGTGACG GCGTCACTTC CACCGTTTGG ACGCCAGGGG
CAAACCATTG ACGTGGTGGT TTCTTCCATG GGAAATGCCA AAAGTCTGCG TGGCGGTACG
TTATTGATGA CACCGCTTAA GGGCGTTGAC AGTCAGGTGT ATGCGCTGGC ACAGGGCAAT
ATTCTGGTTG GCGGCGCAGG GGCCTCCGCT GGCGGTAGCA GTGTTCAGGT GAACCAACTG
AACGGTGGAC GGATCACCAA TGGTGCGGTT ATTGAACGTG AATTGCCCAG CCAGTTTGGC
GTCGGGAATA CCCTTAATTT GCAACTTAAC GACGAAGATT TCAGTATGGC GCAGCAAATC
GCTGACACCA TCAACCGCGT GCGTGGATAT GGCAGCGCCA CCGCGTTAGA TGCGCGGACT
ATTCAGGTGC GCGTACCGAG TGGCAACAGT TCCCAGGTCC GTTTCCTTGC CGATATCCAG
AATATGCAGG TTAATGTTAC CCCGCAGGAC GCTAAAGTAG TGATTAACTC GCGCACTGGT
TCGGTGGTGA TGAATCGCGA AGTGACCCTC GACAGCTGCG CGGTAGCGCA GGGGAATCTC
TCGGTTACGG TAAATCGGCA GGCCAATGTC AGCCAGCCAG ATACACCGTT TGGTGGCGGG
CAGACTGTGG TTACTCCACA AACGCAGATC GATTTACGCC AGAGCGGCGG TTCGCTGCAA
AGCGTACGTT CCAGCGCCAG CCTCAATAAC GTGGTGCGTG CGCTCAATGC GCTGGGCGCT
ACGCCGATGG ATCTGATGTC CATACTGCAA TCAATGCAAA GTGCGGGATG TCTGCGGGCA
AAACTGGAAA TCATCTGA
 
Protein sequence
MIKFLSALIL LLVTTAVQAE RIRDLTSVQG VRQNSLIGYG LVVGLDGTGD QTTQTPFTTQ 
TLNNMLSQLG ITVPTGTNMQ LKNVAAVMVT ASLPPFGRQG QTIDVVVSSM GNAKSLRGGT
LLMTPLKGVD SQVYALAQGN ILVGGAGASA GGSSVQVNQL NGGRITNGAV IERELPSQFG
VGNTLNLQLN DEDFSMAQQI ADTINRVRGY GSATALDART IQVRVPSGNS SQVRFLADIQ
NMQVNVTPQD AKVVINSRTG SVVMNREVTL DSCAVAQGNL SVTVNRQANV SQPDTPFGGG
QTVVTPQTQI DLRQSGGSLQ SVRSSASLNN VVRALNALGA TPMDLMSILQ SMQSAGCLRA
KLEII