Gene SbBS512_E2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2249 
SymbolflgE 
ID6271105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2049353 
End bp2050561 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content51% 
IMG OID641726268 
Productflagellar hook protein FlgE 
Protein accessionYP_001880752 
Protein GI187734244 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.467124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACTAACCT CGATGTTATT 
GGCAACAATA TCGTCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC
GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC
TTTACAGATG GCACGACCAC CAACACCGGG CGAGGTCTGG ACGTTGCTAT CAGCCAGAAC
GGTTTTTTCC GTCTGGTAGA CAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT
AAGCTGGATG AAAACCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG
GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACCAATAT TTCGATCCCG
AATACCCTGA TGGCAGCGAA AACTACCACC ACGGCGTCGA TGCAGATCAA CCTGAATTCC
AGTGATCCGC TTCCTACTGT TACGCCATTC AGCGCCAGCA ATGCGGATAG CTATAACAAA
AAAGGTTCGG TGACTGTTTT CGACAGTCAG GGTAATGCTC ATGACATGAG CGTCTACTTT
GTGAAGACCG GGGATAATAA CTGGCAGGTC TACACCCAGG ATAGCAGTGA TCCAAACAGC
ATTGCGAAGA CAGCGACAAC ACTGGAATTT AATGCTAATG GCACATTAGT GGATGGTGCG
ATGGCGAATA ATATCGCAAC CGGCGCAATT AACGGTGCAG ACCCCGCCAC GTTTAGTCTG
AGCTTCCTCA ACTCCATGCA GCAAAATACC GGCGCTAACA ATATTGTGGC AACCCCCCAG
AACGGCTACA AACCGGGTGA TCTGGTGAGT TATCAAATCA ATGATGACGG TACGGTTGTC
GGCAACTATT CCAACGAACA AACTCAACTG CTGGGGCAGA TTGTACTGGC GAACTTTGCC
AACAACGAAG GTCTGGCATC CGAAGGCGAC AACGTCTGGT CTGCGACGCA ATCTTCTGGC
GTGGCGCTGT TGGGGACAGC CGGGACGGGA AACTTTGGCA CCCTGACCAA CGGTGCGCTG
GAAGCGTCCA ACGTCGATCT CAGTAAAGAA CTGGTCAATA TGATCGTTGC CCAGCGTAAC
TATCAGTCTA ACGCCCAGAC CATCAAAACC CAGGACCAGA TCCTCAACAC GCTGGTTAAC
TTACGCTAA
 
Protein sequence
MAFSQAVSGL NAAATNLDVI GNNIVNSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD 
FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP
ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPTVTPF SASNADSYNK
KGSVTVFDSQ GNAHDMSVYF VKTGDNNWQV YTQDSSDPNS IAKTATTLEF NANGTLVDGA
MANNIATGAI NGADPATFSL SFLNSMQQNT GANNIVATPQ NGYKPGDLVS YQINDDGTVV
GNYSNEQTQL LGQIVLANFA NNEGLASEGD NVWSATQSSG VALLGTAGTG NFGTLTNGAL
EASNVDLSKE LVNMIVAQRN YQSNAQTIKT QDQILNTLVN LR