Gene EcSMS35_1258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1258 
SymbolfliC 
ID6144795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1255771 
End bp1257408 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content48% 
IMG OID641616136 
Productflagellin 
Protein accessionYP_001743319 
Protein GI170682489 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0516151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC
GCGAAGGATG ACGCAGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC
CTGACTCAGG CGGCCCGTAA CGCCAACGAC GGTATCTCCG TTGCGCAGAC CACTGAAGGC
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATCCGTG AACTGACGGT TCAGGCTTCT
ACCGGGACTA ACTCCGATTC GGATCTGGAC TCCATTCAGG ACGAAATCAA ATCCCGTCTG
GACGAAATTG ACCGCGTATC TGGCCAGACC CAGTTCAACG GCGTGAACGT ACTGGCGAAA
GACGGTTCAA TGAAAATTCA GGTTGGTGCG AATGACGGCC AGACTATCAC TATTGATCTG
AAGAAAATTG ACTCTGATAC GCTGGGGCTG AGTGGGTTTA ATGTGAATGG TAGCGCAGAT
AAGGCAAGTG TCGCGGCGAC AGCTGACGGA ATGGTTAAAG ACGGATATAT CAAAGGGTTA
ACTTCATCTG ACGGCAGCAC TGCATATACT AAAACTACAG CAAATACTGC AGCAAAAGGA
TCTGATATTC TTGCGGCGCT TAAGACTGGC GATAAAATTA CCGCAACAGG TGCAAATAGC
CTTGCTGATA ATGCGACATC GACAACTTAT ACTTATAATG CAACCAGCAA TACCTTCTCC
TATACGGCTG ACGGTGTAAA CCAAACGAAT GCTGCAGCAA ATCTCATACC TGCAGCAGGG
AAAACGACAG CTGCATCAGT TACTATTGGT GGGACAGCAC AGAATGTAAA TATTGATGAT
TCGGGCAATA TTACTTCAAG TGATGGCGAT CAACTTTATC TGGATTCAAC AGGTAACCTG
ACTAAAAACC AGGCCGGCAA CCCGAACAAA GCAACCGTTT CTGGGCTTCT CGGAAATACG
GATGCGAAAG GTACTGCTGT TAAAACAACC ATCAAGACAG AGGCTGGTGT AACAGTTACA
GCTGAAGGTA ATACAGGTAC TGTAAAAATT GAAGGTGCTA CTGTTTCAGC ATCTGCATTT
ACGGGCATTG CATATTCCGC CAACACCGGT GGGAATACTT ATGCTGTTGC CGCAAATAAT
ACTACAAATG GTTTCCTGGC GGGGGATGCC TTAACCCAGG ATGCTCAAAC TGTTTCAACC
TACTACTCGC AAGCCGATGG CACGGTCACG AATAGCGCAG GCAAAGAAAT CTATAAAGAC
GCTGATGGTG TCTACAGCAC AGAGAATAAA ACATCGAAGA CGTCCGATCC ATTGGCTGCG
CTTGACGACG CAATCAGCTC CATCGACAAA TTCCGTTCAT CCCTGGGTGC TATCCAGAAC
CGTCTGGATT CCGCGGTCAC CAACCTGAAC AACACCACTA CCAACCTGTC CGAAGCGCAG
TCCCGTATTC AGGACGCCGA CTATGCGACC GAAGTGTCCA ACATGTCGAA AGCGCAGATC
ATCCAGCAGG CCGGTAACTC CGTGCTGGCA AAAGCTAACC AGGTACCACA GCAGGTTCTG
TCTCTGCTGC AGGGTTAA
 
Protein sequence
MAQVINTNSL SLITQNNINK NQSALSSSIE RLSSGLRINS AKDDAAGQAI ANRFTSNIKG 
LTQAARNAND GISVAQTTEG ALSEINNNLQ RIRELTVQAS TGTNSDSDLD SIQDEIKSRL
DEIDRVSGQT QFNGVNVLAK DGSMKIQVGA NDGQTITIDL KKIDSDTLGL SGFNVNGSAD
KASVAATADG MVKDGYIKGL TSSDGSTAYT KTTANTAAKG SDILAALKTG DKITATGANS
LADNATSTTY TYNATSNTFS YTADGVNQTN AAANLIPAAG KTTAASVTIG GTAQNVNIDD
SGNITSSDGD QLYLDSTGNL TKNQAGNPNK ATVSGLLGNT DAKGTAVKTT IKTEAGVTVT
AEGNTGTVKI EGATVSASAF TGIAYSANTG GNTYAVAANN TTNGFLAGDA LTQDAQTVST
YYSQADGTVT NSAGKEIYKD ADGVYSTENK TSKTSDPLAA LDDAISSIDK FRSSLGAIQN
RLDSAVTNLN NTTTNLSEAQ SRIQDADYAT EVSNMSKAQI IQQAGNSVLA KANQVPQQVL
SLLQG