Gene EcolC_2524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2524 
SymbolflgE 
ID6067440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2776126 
End bp2777334 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content52% 
IMG OID641601930 
Productflagellar hook protein FlgE 
Protein accessionYP_001725482 
Protein GI170020528 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.353481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00930832 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACCAACCT CGATGTTATT 
GGCAACAATA TCGCCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC
GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC
TTTACCGATG GCACGACCAC CAACACCGGG CGCGGTCTGG ACGTTGCTAT CAGCCAGAAC
GGTTTTTTCC GTCTGGTAGA TAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT
AAGCTGGATG AAAACCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG
GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACTAATAT TTCGATCCCG
AATACCCTGA TGGCAGCGAA AACTACCACT ACGGCGTCGA TGCAGATCAA CCTGAATTCC
AGCGATCCGC TTCCCTCTGT TAACGCATTT GATGCCAGCA ATGCGGATAG CTATAACAAA
AAAGGTTCGG TGACTGTTTT CGACAGTCAG GGTAATGCTC ATGACATGAG CGTCTACTTT
GTGAAGACCG GGGATAATAA CTGGCAGGTC TACACCCAGG ATAGCAGTGA TCCAAACAGC
ATTGCGAAGA CAGCGACAAC ACTGAAATTT AATGCTAATG GCACATTAGT GGATGGTGCG
ATGGCGAATA ATATCGCAAC CGGCGCAATT AACGGCGCAG AACCCGCCAC GTTTAGCCTG
AGCTTCCTCA ACTCCATGCA GCAAAATACC GGCGCTAACA ACATTGTGGC AACCACCCAG
AATGGCTACA AACCGGGCGA TCTGGTGAGT TATCAAATCA ATGATGACGG TACGGTTGTC
GGCAACTATT CCAACGAACA AACCCAACTG CTGGGGCAGA TTGTACTGGC GAACTTTGCC
AACAACGAAG GTCTGGCATC CGAAGGCGAC AACGTCTGGT CTGCGACGCA ATCTTCTGGC
GTGGCGCTGT TGGGGACAGC CGGGACGGGC AACTTTGGCA CCCTGACCAA CGGTGCGCTG
GAAGCGTCCA ACGTCGATCT CAGTAAAGAA CTGGTCAATA TGATCGTTGC CCAGCGTAAC
TATCAGTCTA ACGCCCAGAC CATCAAAACC CAGGACCAGA TCCTCAACAC GCTGGTTAAC
TTACGCTAA
 
Protein sequence
MAFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD 
FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP
ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPSVNAF DASNADSYNK
KGSVTVFDSQ GNAHDMSVYF VKTGDNNWQV YTQDSSDPNS IAKTATTLKF NANGTLVDGA
MANNIATGAI NGAEPATFSL SFLNSMQQNT GANNIVATTQ NGYKPGDLVS YQINDDGTVV
GNYSNEQTQL LGQIVLANFA NNEGLASEGD NVWSATQSSG VALLGTAGTG NFGTLTNGAL
EASNVDLSKE LVNMIVAQRN YQSNAQTIKT QDQILNTLVN LR