Gene EcSMS35_0277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0277 
SymbollafB 
ID6142780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp288349 
End bp289665 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content56% 
IMG OID641615175 
Productlateral flagellar hook associated protein 2 
Protein accessionYP_001742384 
Protein GI170680645 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0170554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.892607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAACC CAAGAACCAT TGCGCAAGAA ATCGCCTATG CGGATGTTGC CACTCAGGCG 
GCAAACTTGC AGGAGAAGCA GAGCGAGCTG GATGCGGAAA GCAGCGGCCT GGACTCGCTC
AGCTCAGCGT TGAGCGATTT TCAGAGCGCC GTTGACGCGC TGAACAGCGA TACTGACGGC
CCGGTGACGT TTGCCGCCAC CAGCAATAAT GACTCGGCGA CCGTTTCCGC CAATTCCCAG
GCGCAGGCGG GAAGCTACTC ATTTTTCGTT GAGCAACTGG CACAGGGGCA GCAAACCACG
TTCAGTATGG GCGACGACGC CTTTTCCGCT ACCGGCACCT TCGAACTGAC GATGGGCGGC
AGCACCATGG ATATCGATCT GTCGGCGGCG GATCAAAACG GTGACGGCGA CGGTTTTATC
GACGCCAGTG AACTGGTGAA TGCGATTAAC GACTCTGATG ACAATCCGGG CGTGTCGGCG
GCACTGGTAA AAACCGACGG CACCACCACG ATTATGCTCA CCTCCGATAG CACCGGGGCG
CAGAGCGCAT TTTCGGTGAG CGTAACGGGG CATGACGCCA GCAACGACAG CGCCAGCGCG
CCTGTTGCTA CGGATGTTTC CTCCGCCCAG GACGCGATTA TTCATCTCGG CAGCGCCACG
GGGCCAGAGA TCACTAATAG CAGCAATACC TTTGATGATG TGATCCCCGG CGTCACCATG
ACTTTCACCG AAGTCAGCGA TTCTGACAGC GATCTCACTA CCTTTAACAT CAGCGAAGAT
TCCAGCGCCA GCCAGGAGAA AGTACAGACC TTTGTCGATG CATATAACAC CTTGATTGAT
ACGGTTGATT CGCTGACCAC CCACGGTGAT GACAGTACCA GCGCCGGGGT ATTTGCAGGC
GACGCCGGAC TCAGCTCACT GGCAAACCAG CTCGATGACA TCGCCCATGC CAGCTACAAC
GGCGTGTCGA TTGTCGACTA TGGCATTACG CTCGATTCTC ACGGCCATTT ACAGATTGAT
TCAGACCAGT TCAACGACGC GATGAAGAGC GATCCCGACG GCCTGACCTC TATTTTCGTC
GGCGATAACA GCATGGTGGC GCAGATGGAC AGCCTTATTG ACACCTATAC CGACTCCAGC
AACGGCATCA TCACCCTGCG CCAGCAGAAC ATTGACGATC AGATGAGCAA AATTCAGGAC
GAAGGCGATC AGCTTACCGA TACCTATAAC GCCAACTACG ACCGTTATCT GGAGGAGTAC
ACCAACACGC TGGTTGAGGT GTACACCATG AAAGCCAGCA TGGCGGCATT CGCGTAA
 
Protein sequence
MINPRTIAQE IAYADVATQA ANLQEKQSEL DAESSGLDSL SSALSDFQSA VDALNSDTDG 
PVTFAATSNN DSATVSANSQ AQAGSYSFFV EQLAQGQQTT FSMGDDAFSA TGTFELTMGG
STMDIDLSAA DQNGDGDGFI DASELVNAIN DSDDNPGVSA ALVKTDGTTT IMLTSDSTGA
QSAFSVSVTG HDASNDSASA PVATDVSSAQ DAIIHLGSAT GPEITNSSNT FDDVIPGVTM
TFTEVSDSDS DLTTFNISED SSASQEKVQT FVDAYNTLID TVDSLTTHGD DSTSAGVFAG
DAGLSSLANQ LDDIAHASYN GVSIVDYGIT LDSHGHLQID SDQFNDAMKS DPDGLTSIFV
GDNSMVAQMD SLIDTYTDSS NGIITLRQQN IDDQMSKIQD EGDQLTDTYN ANYDRYLEEY
TNTLVEVYTM KASMAAFA