Gene EcHS_A1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1199 
SymbolflgE 
ID5592587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1196254 
End bp1197462 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content52% 
IMG OID640920358 
Productflagellar hook protein FlgE 
Protein accessionYP_001457921 
Protein GI157160603 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones70 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACCAACCT CGATGTTATT 
GGCAACAATA TCGCCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC
GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC
TTTACCGATG GCACGACCAC CAACACCGGG CGCGGTCTGG ACGTTGCTAT CAGCCAGAAC
GGTTTTTTCC GTCTGGTAGA TAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT
AAGCTGGATG AAAACCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG
GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACTAATAT TTCGATCCCG
AATACCCTGA TGGCAGCGAA AACTACCACT ACGGCGTCGA TGCAGATCAA CCTGAATTCC
AGCGATCCGC TTCCCTCTGT TAACGCATTT GATGCCAGCA ATGCGGATAG CTATAACAAA
AAAGGTTCGG TGACTGTTTT CGACAGTCAG GGTAATGCTC ATGACATGAG CGTCTACTTT
GTGAAGACCG GGGATAATAA CTGGCAGGTC TACACCCAGG ATAGCAGTGA TCCAAACAGC
ATTGCGAAGA CAGCGACAAC ACTGAAATTT AATGCTAATG GCACATTAGT GGATGGTGCG
ATGGCGAATA ATATCGCAAC CGGCGCAATT AACGGCGCAG AACCCGCCAC GTTTAGCCTG
AGCTTCCTCA ACTCCATGCA GCAAAATACC GGCGCTAACA ACATTGTGGC AACCACCCAG
AATGGCTACA AACCGGGCGA TCTGGTGAGT TATCAAATCA ATGATGACGG TACGGTTGTC
GGCAACTATT CCAACGAACA AACCCAACTG CTGGGGCAGA TTGTACTGGC GAACTTTGCC
AACAACGAAG GTCTGGCATC CGAAGGCGAC AACGTCTGGT CTGCGACGCA ATCTTCTGGC
GTGGCGCTGT TGGGGACAGC CGGGACGGGC AACTTTGGCA CCCTGACCAA CGGTGCGCTG
GAAGCGTCCA ACGTCGATCT CAGTAAAGAA CTGGTCAATA TGATCGTTGC CCAGCGTAAC
TATCAGTCTA ACGCCCAGAC CATCAAAACC CAGGACCAGA TCCTCAACAC GCTGGTTAAC
TTACGCTAA
 
Protein sequence
MAFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD 
FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP
ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPSVNAF DASNADSYNK
KGSVTVFDSQ GNAHDMSVYF VKTGDNNWQV YTQDSSDPNS IAKTATTLKF NANGTLVDGA
MANNIATGAI NGAEPATFSL SFLNSMQQNT GANNIVATTQ NGYKPGDLVS YQINDDGTVV
GNYSNEQTQL LGQIVLANFA NNEGLASEGD NVWSATQSSG VALLGTAGTG NFGTLTNGAL
EASNVDLSKE LVNMIVAQRN YQSNAQTIKT QDQILNTLVN LR