Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1199 |
Symbol | flgE |
ID | 5592587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1196254 |
End bp | 1197462 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640920358 |
Product | flagellar hook protein FlgE |
Protein accession | YP_001457921 |
Protein GI | 157160603 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 70 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACCAACCT CGATGTTATT GGCAACAATA TCGCCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC TTTACCGATG GCACGACCAC CAACACCGGG CGCGGTCTGG ACGTTGCTAT CAGCCAGAAC GGTTTTTTCC GTCTGGTAGA TAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT AAGCTGGATG AAAACCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACTAATAT TTCGATCCCG AATACCCTGA TGGCAGCGAA AACTACCACT ACGGCGTCGA TGCAGATCAA CCTGAATTCC AGCGATCCGC TTCCCTCTGT TAACGCATTT GATGCCAGCA ATGCGGATAG CTATAACAAA AAAGGTTCGG TGACTGTTTT CGACAGTCAG GGTAATGCTC ATGACATGAG CGTCTACTTT GTGAAGACCG GGGATAATAA CTGGCAGGTC TACACCCAGG ATAGCAGTGA TCCAAACAGC ATTGCGAAGA CAGCGACAAC ACTGAAATTT AATGCTAATG GCACATTAGT GGATGGTGCG ATGGCGAATA ATATCGCAAC CGGCGCAATT AACGGCGCAG AACCCGCCAC GTTTAGCCTG AGCTTCCTCA ACTCCATGCA GCAAAATACC GGCGCTAACA ACATTGTGGC AACCACCCAG AATGGCTACA AACCGGGCGA TCTGGTGAGT TATCAAATCA ATGATGACGG TACGGTTGTC GGCAACTATT CCAACGAACA AACCCAACTG CTGGGGCAGA TTGTACTGGC GAACTTTGCC AACAACGAAG GTCTGGCATC CGAAGGCGAC AACGTCTGGT CTGCGACGCA ATCTTCTGGC GTGGCGCTGT TGGGGACAGC CGGGACGGGC AACTTTGGCA CCCTGACCAA CGGTGCGCTG GAAGCGTCCA ACGTCGATCT CAGTAAAGAA CTGGTCAATA TGATCGTTGC CCAGCGTAAC TATCAGTCTA ACGCCCAGAC CATCAAAACC CAGGACCAGA TCCTCAACAC GCTGGTTAAC TTACGCTAA
|
Protein sequence | MAFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPSVNAF DASNADSYNK KGSVTVFDSQ GNAHDMSVYF VKTGDNNWQV YTQDSSDPNS IAKTATTLKF NANGTLVDGA MANNIATGAI NGAEPATFSL SFLNSMQQNT GANNIVATTQ NGYKPGDLVS YQINDDGTVV GNYSNEQTQL LGQIVLANFA NNEGLASEGD NVWSATQSSG VALLGTAGTG NFGTLTNGAL EASNVDLSKE LVNMIVAQRN YQSNAQTIKT QDQILNTLVN LR
|
| |