Gene EcSMS35_0272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0272 
SymbollfgK 
ID6143853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp282527 
End bp283903 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content58% 
IMG OID641615170 
Productlateral flagellar hook associated protein 1 
Protein accessionYP_001742379 
Protein GI170680262 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.508781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATGA TTAACATCGG CTACAGCGGC GCATCAACCG CGCAGGTAGA GCTGAACGTC 
ACGGCGCAAA ACACCGCCAA CGCCATGACC ACGGGCTACA CCCGTCAGGT GGCGGAGATC
AGCACCATCG GCGCCAGCGG TGGTTCGCCG AACAGCGCCG GTAACGGCGT GCAGGTCGAC
AGCATTCGCC GCGTCTCTAA CCAGTATCAG GTGAATCAGG TGTGGTATGC CGCCAGCGAT
TACGGCTATT ACAGCACCCA GCAGGGGTAT CTCACGCAAC TGGAAGCGGT ACTGAGCGAC
GATAACAGCA GCCTGAGCGG CGGCTTCGAT AACTTCTTCG CCGCCCTCAA CGAAGCGACC
ACCAGCCCCG ATGATTCCGC CCTGCGCGAA CAGGTGATCA GCGAAGCCGG GGCGCTGTCG
TTGCGTATCG ACAACACGCT GGATTACATC GACTCGCAAA GCACGGAAAT CATCAGCCAG
CAGCAGGCAA TGGTGTCGCA AATCAACACG CTTACCAGCG GCATCGCCAG CTATAACCAG
CAAATCGCCC AGGCCGAAGC CAACGGCGAT AACGCCTCCG CGCTGTACGA CGCCCGCGAT
CAGATGGTGG AAGAACTGAG CGGGATGATG GATGTGCAGG TCAATATCGA CGACCAGGGC
AACTACAACG TCACCCTGAA AAACGGTCAA CCGCTGGTGA GCGGGCAGCA AAGCTCGACC
ATCGCGCTGG AAACCAACGC CGATGGCACG CCGACCATGT CGCTGACTTT CGCTGGCACC
ACCTCGACGA TGACTACCGA CACCGGCGGT TCATTAGGCG CACTGTTTGA TTATCAAAAC
GACGTGCTGA CGCCGCTGAC CGACACCATC AACAGCATGG CGTTGCAGTT TGCCGATGCG
GTCAACAACC AGCTGGCGCA GGGCTACGAT CTCAACGGTA ACCCCGGCGA GCCGCTGTTT
ATTTATGACG CCAGCAACGC CGATGGCCCG CTGACCGTTA ACCCGGATAT CACCGCCGAT
GAGCTGGCGT TCTCCAGTTC GCCGGATGAA AGCGGTAACA GCGACAACCT TCAGGCGCTG
ATCAACATCT CCACCGAACC GCTGGAGATA GCCAACCTTG GCAGCGTGAC GGTCGGGCAG
GCGTGCTCGT CGATCATCAG CAACATCGGC ATTTACAGCC AGCAAAACCA GACGGAAGTC
GATGCCGCGT CCAATGTTTA TTCTGAGGCG CAAAACCAGC AGAGCAGCGT CAGCGGCGTC
AGCATGGACG AAGAAGCGGT GAACCTGATC ACCTATCAAC AAATTTATGA AGCTAATCTG
AAAGTCATTT CCGCCGGGGC CGAGATTTTC GATTCGGTGC TGGAAATGTG CAGCTAA
 
Protein sequence
MDMINIGYSG ASTAQVELNV TAQNTANAMT TGYTRQVAEI STIGASGGSP NSAGNGVQVD 
SIRRVSNQYQ VNQVWYAASD YGYYSTQQGY LTQLEAVLSD DNSSLSGGFD NFFAALNEAT
TSPDDSALRE QVISEAGALS LRIDNTLDYI DSQSTEIISQ QQAMVSQINT LTSGIASYNQ
QIAQAEANGD NASALYDARD QMVEELSGMM DVQVNIDDQG NYNVTLKNGQ PLVSGQQSST
IALETNADGT PTMSLTFAGT TSTMTTDTGG SLGALFDYQN DVLTPLTDTI NSMALQFADA
VNNQLAQGYD LNGNPGEPLF IYDASNADGP LTVNPDITAD ELAFSSSPDE SGNSDNLQAL
INISTEPLEI ANLGSVTVGQ ACSSIISNIG IYSQQNQTEV DAASNVYSEA QNQQSSVSGV
SMDEEAVNLI TYQQIYEANL KVISAGAEIF DSVLEMCS