Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2249 |
Symbol | flgE |
ID | 6271105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2049353 |
End bp | 2050561 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641726268 |
Product | flagellar hook protein FlgE |
Protein accession | YP_001880752 |
Protein GI | 187734244 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.467124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACTAACCT CGATGTTATT GGCAACAATA TCGTCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC TTTACAGATG GCACGACCAC CAACACCGGG CGAGGTCTGG ACGTTGCTAT CAGCCAGAAC GGTTTTTTCC GTCTGGTAGA CAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT AAGCTGGATG AAAACCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACCAATAT TTCGATCCCG AATACCCTGA TGGCAGCGAA AACTACCACC ACGGCGTCGA TGCAGATCAA CCTGAATTCC AGTGATCCGC TTCCTACTGT TACGCCATTC AGCGCCAGCA ATGCGGATAG CTATAACAAA AAAGGTTCGG TGACTGTTTT CGACAGTCAG GGTAATGCTC ATGACATGAG CGTCTACTTT GTGAAGACCG GGGATAATAA CTGGCAGGTC TACACCCAGG ATAGCAGTGA TCCAAACAGC ATTGCGAAGA CAGCGACAAC ACTGGAATTT AATGCTAATG GCACATTAGT GGATGGTGCG ATGGCGAATA ATATCGCAAC CGGCGCAATT AACGGTGCAG ACCCCGCCAC GTTTAGTCTG AGCTTCCTCA ACTCCATGCA GCAAAATACC GGCGCTAACA ATATTGTGGC AACCCCCCAG AACGGCTACA AACCGGGTGA TCTGGTGAGT TATCAAATCA ATGATGACGG TACGGTTGTC GGCAACTATT CCAACGAACA AACTCAACTG CTGGGGCAGA TTGTACTGGC GAACTTTGCC AACAACGAAG GTCTGGCATC CGAAGGCGAC AACGTCTGGT CTGCGACGCA ATCTTCTGGC GTGGCGCTGT TGGGGACAGC CGGGACGGGA AACTTTGGCA CCCTGACCAA CGGTGCGCTG GAAGCGTCCA ACGTCGATCT CAGTAAAGAA CTGGTCAATA TGATCGTTGC CCAGCGTAAC TATCAGTCTA ACGCCCAGAC CATCAAAACC CAGGACCAGA TCCTCAACAC GCTGGTTAAC TTACGCTAA
|
Protein sequence | MAFSQAVSGL NAAATNLDVI GNNIVNSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPTVTPF SASNADSYNK KGSVTVFDSQ GNAHDMSVYF VKTGDNNWQV YTQDSSDPNS IAKTATTLEF NANGTLVDGA MANNIATGAI NGADPATFSL SFLNSMQQNT GANNIVATPQ NGYKPGDLVS YQINDDGTVV GNYSNEQTQL LGQIVLANFA NNEGLASEGD NVWSATQSSG VALLGTAGTG NFGTLTNGAL EASNVDLSKE LVNMIVAQRN YQSNAQTIKT QDQILNTLVN LR
|
| |