Gene EcSMS35_1307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1307 
SymbolflhB 
ID6144489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1294901 
End bp1296049 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content55% 
IMG OID641616185 
Productflagellar biosynthesis protein FlhB 
Protein accessionYP_001743365 
Protein GI170683078 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1377] Flagellar biosynthesis pathway, component FlhB 
TIGRFAM ID[TIGR00328] flagellar biosynthetic protein FlhB 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0569687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGACG AGAGCGACGA CAAAACAGAA GCCCCCACAC CTCACCGACT AGAAAAAGCG 
CGGGAAGAGG GGCAAATCCC GCGTTCCCGT GAACTGACCT CGCTGCTGAT TTTGTTAGTG
GGCGTGTGTG TTATCTGGTT TGGCGGTGTG TCGCTGGCCC GTCGATTGTC GGGCATGCTC
TCCGCTGGGC TGCATTTTGA TCACAGTATT ATCAATGACC CGAATTTGAT CCTCGGGCAG
ATTATTCTGC TGATCAGAGA AGCCATGCTG GCGCTGCTAC CGCTGATTAG CGGCGTGGTG
CTGGTGGCGA TTATTTCTCC GGTTATGCTG GGGGGGCTGG TATTTAGCGG CAAATCCTTG
CAGCCGAAGT TTTCCAAACT CAACCCGCTA CCGGGCATTA AACGGATGTT CTCGGCACAG
ACTGGCGCGG AGTTGCTTAA GGCAATTTTG AAAACCATCC TGGTTGGCAG CGTGACGGGG
TTTTTTCTCT GGCATCACTG GCCGCAGATG ATGCGCTTAA TGGCCGAGTC TCCGATTACC
GCCATGGGTA ATGCGATGGA TCTGGTAGGG CTATGCGCAC TGCTGGTGGT GCTTGGTGTT
ATTCCGATGG TGGGATTTGA CGTCTTTTTC CAAATCTTCA GCCACCTGAA AAAGCTGCGA
ATGTCGCGGC AGGATATTCG TGATGAGTTC AAACAAAGCG AAGGCGACCC CCATGTTAAA
GGGCGGATAC GTCAGATGCA GCGAGCTGCC GCGCGGCGTC GGATGATGGC CGATGTGCCG
AAAGCGGATG TCATTGTCAA TAACCCAACC CACTATTCGG TAGCGTTGCA GTATGACGAA
AACAAAATGA GCGCACCGAA AGTGGTCGCT AAAGGTGCAG GACTGGTCGC GCTGCGCATT
CGTGAAATTG GTGCTGAAAA TAACGTCCCT ACGCTTGAAG CGCCGCCGCT GGCGCGAGCG
CTGTATCGAC ATGCGGAGAT CGGTCAACAA ATCCCGGGTC AACTGTACGC CGCGGTAGCG
GAAGTGCTGG CCTGGGTCTG GCAACTGAAA CGCTGGCGTC TGGCTGGAGG ACAGCGCCCT
GTACAACCTA CTCATCTTCC GGTGCCGGAA GCCCTGGATT TTATTAACGA GAAACCGTCC
CATGAGTAA
 
Protein sequence
MSDESDDKTE APTPHRLEKA REEGQIPRSR ELTSLLILLV GVCVIWFGGV SLARRLSGML 
SAGLHFDHSI INDPNLILGQ IILLIREAML ALLPLISGVV LVAIISPVML GGLVFSGKSL
QPKFSKLNPL PGIKRMFSAQ TGAELLKAIL KTILVGSVTG FFLWHHWPQM MRLMAESPIT
AMGNAMDLVG LCALLVVLGV IPMVGFDVFF QIFSHLKKLR MSRQDIRDEF KQSEGDPHVK
GRIRQMQRAA ARRRMMADVP KADVIVNNPT HYSVALQYDE NKMSAPKVVA KGAGLVALRI
REIGAENNVP TLEAPPLARA LYRHAEIGQQ IPGQLYAAVA EVLAWVWQLK RWRLAGGQRP
VQPTHLPVPE ALDFINEKPS HE