Gene ECD_01072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01072 
SymbolflgE 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1135144 
End bp1136349 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content52% 
IMG OID 
Productflagellar hook protein E 
Protein accessionACT42967 
Protein GI253977297 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACCAACCT CGATGTTATT 
GGCAACAATA TCGCCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC
GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC
TTTACCGATG GCACGACCAC CAACACCGGG CGAGGTCTGG ACGTTGCTAT CAGCCAGAAC
GGTTTTTTCC GTCTGGTAGA CAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT
AAGCTGGATG AAAACCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG
GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACCAATAT TTCGATCCCG
AATACCCTGA TGGCAGCGAA AACTACCACC ACGGCATCGA TGCAGATCAA CCTGAATTCC
AGCGATCCGC TTCCTACTGT TACGCCATTC AGCGCCAGCA ATGCGGATAG CTATAACAAA
AAAGGTTCGG TGACTGTTTT CGACAGTCAG GGTAATGCTC ATGACATGAG CGTCTACTTT
GTGAAGACCG GGGATAATAA CTGGCAGGTC TACACCCAGG ATAGCAGTGA TCCAACAGGT
ACAGCCGAGC CTGCAATGAA GCTGGTGTTT AATGCCAATG GCGTTCTGAC CTCAAATCCA
ACAGAGAATA TTACCACTGG CGCAATTAAC GGCGCAGATC CCGCCACGTT TAGCCTGAGC
TTCCTCAACT CCATGCAGCA AAATACCGGC GCTAACAATA TTGTGGCAAC CACCCAGAAT
GGCTACAAAC CGGGCGATCT GGTGAGTTAT CAAATCAATG ATGACGGTAC GGTTGTCGGC
AACTATTCCA ACGAACAAAC ACAACTGCTG GGGCAGATTG TACTGGCGAA CTTTGCCAAC
AACGAAGGTC TGGCATCCGA AGGCGACAAC GTCTGGTCTG CGACGCAATC TTCTGGCGTG
GCGCTGTTGG GGACAGCCGG GACGGGAAAC TTTGGCACCC TGACCAACGG TGCGCTGGAA
GCGTCCAACG TCGATCTCAG TAAAGAACTG GTCAATATGA TCGTTGCCCA GCGTAACTAT
CAGTCTAACG CCCAGACCAT CAAAACCCAG GACCAGATCC TCAACACGCT GGTTAACTTA
CGCTAA
 
Protein sequence
MAFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD 
FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP
ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPTVTPF SASNADSYNK
KGSVTVFDSQ GNAHDMSVYF VKTGDNNWQV YTQDSSDPTG TAEPAMKLVF NANGVLTSNP
TENITTGAIN GADPATFSLS FLNSMQQNTG ANNIVATTQN GYKPGDLVSY QINDDGTVVG
NYSNEQTQLL GQIVLANFAN NEGLASEGDN VWSATQSSGV ALLGTAGTGN FGTLTNGALE
ASNVDLSKEL VNMIVAQRNY QSNAQTIKTQ DQILNTLVNL R