Gene EcE24377A_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1199 
SymbolflgE 
ID5590183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1210041 
End bp1211246 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content52% 
IMG OID640924898 
Productflagellar hook protein FlgE 
Protein accessionYP_001462310 
Protein GI157157539 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACCAACCT CGATGTTATT 
GGCAACAATA TCGCCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC
GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC
TTTACCGATG GCACGACCAC CAACACCGGG CGAGGTCTGG ACGTTGCTAT CAGCCAGAAC
GGTTTTTTCC GTCTGGTAGA CAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT
AAGCTGGATG AAAACCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG
GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACCAATAT TTCGATCCCG
AATACCCTGA TGGCAGCGAA AACTACCACC ACGGCATCGA TGCAGATCAA CCTGAATTCC
AGTGATCCGC TTCCTACTGT TACGCCATTC AGCGCCAGCA ATGCGGATAG CTATAACAAA
AAAGGTTCGG TGACTGTTTT CGACAGTCAG GGTAATGCTC ATGACATGAG CGTCTACTTT
GTGAAGACCG GGGATAATAA CTGGCAGGTC TACACCCAGG ATAGCAGTGA TCCAACAGGT
ACAGCCGAGC CTGCAATGAA GCTGGTGTTT AATGCCAATG GCGTTCTGAC CTCAAATCCA
ACAGAGAATA TTACCACTGG CGCAATTAAC GGCGCAGATC CCGCCACGTT TAGCCTGAGC
TTCCTCAACT CCATGCAGCA AAATACCGGC GCTAACAATA TTGTGGCAAC CACCCAGAAT
GGCTACAAAC CGGGCGATCT GGTGAGTTAT CAAATCAATG ATGACGGTAC GGTTGTCGGC
AACTATTCCA ACGAACAAAC ACAACTGCTG GGGCAGATTG TACTGGCGAA CTTTGCCAAC
AACGAAGGTC TGGCATCCGA AGGCGACAAC GTCTGGTCTG CGACGCAATC TTCTGGCGTG
GCGCTGTTGG GGACAGCCGG GACGGGCAAC TTTGGCACCC TGACCAACGG TGCGTTGGAA
GCGTCCAACG TCGATCTCAG TAAAGAACTG GTCAATATGA TCGTTGCCCA GCGTAACTAT
CAGTCTAACG CCCAGACCAT CAAAACCCAG GACCAGATCC TCAACACGCT GGTTAACTTA
CGCTAA
 
Protein sequence
MAFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD 
FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP
ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPTVTPF SASNADSYNK
KGSVTVFDSQ GNAHDMSVYF VKTGDNNWQV YTQDSSDPTG TAEPAMKLVF NANGVLTSNP
TENITTGAIN GADPATFSLS FLNSMQQNTG ANNIVATTQN GYKPGDLVSY QINDDGTVVG
NYSNEQTQLL GQIVLANFAN NEGLASEGDN VWSATQSSGV ALLGTAGTGN FGTLTNGALE
ASNVDLSKEL VNMIVAQRNY QSNAQTIKTQ DQILNTLVNL R