Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1289 |
Symbol | flgE |
ID | 6491200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 1264154 |
End bp | 1265365 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642741526 |
Product | flagellar hook protein FlgE |
Protein accession | YP_002045176 |
Protein GI | 194449419 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.209767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.0125362 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTTT CTCAAGCGGT TAGCGGCCTG AACGCTGCGG CCACCAACCT TGATGTTATC GGTAATAACA TCGCCAACTC CGCCACCTAT GGCTTTAAGT CCGGTACGGC ATCATTTGCC GATATGTTCG CCGGTTCCAA AGTGGGGCTG GGCGTAAAAG TGGCGGGGAT TACCCAGGAT TTTACCGACG GTACGACAAC GAACACCGGG CGCGGGCTGG ATGTCGCGAT TAGCCAGAAC GGTTTTTTCC GCCTGGTAGA CAGCAACGGT TCCGTGTTCT ATAGCCGCAA CGGCCAGTTC AAACTGGACG AGAACCGTAA CCTGGTCAAT ATGCAGGGGA TGCAGTTGAC CGGCTATCCG GCCACCGGTA CGCCGCCGAC CATTCAGCAG GGGGCGAATC CTGCGCCGAT CACCATTCCG AACACGCTGA TGGCGGCGAA ATCGACCACC ACCGCGTCAA TGCAGATCAA CCTGAACTCA ACGGACCCTG TACCGTCTAA AACGCCCTTT AGCGTGAGTG ATGCGGATTC GTATAACAAA AAAGGCACCG TCACCGTTTA TGACAGCCAG GGTAATGCCC ATGACATGAA CGTCTATTTT GTGAAAACCA AAGATAATGA ATGGGCCGTG TACACCCATG ACAGCAGCGA TCCTGCAGCC ACAGCGCCGG CGGCACCATC AACCACTCTG GTATTTAACG CAAACGGGAC ACTGCAATCC GGCGGTACGG TGAACATCAC CACCGGTACG ATTAATGGCG CGACAGCGGC CACCTTCTCC CTCAGCTTCC TTAACTCCAT GCAGCAGAAC ACCGGGGCTA ACAACATCGT TGCCACCAAT CAAAACGGCT ATAAGCCGGG CGACCTGGTG AGCTACCAGA TTAACAACGA CGGCACCGTA GTTGGCAACT ACTCCAACGA GCAGGAGCAG GTGCTGGGGC AGATTGTGCT GGCTAACTTC GCCAACAACG AAGGTCTGGC ATCCCAGGGC GATAACGTCT GGGCGGCGAC GCAGGCTTCC GGGGTTGCGC TGCTGGGGAC TGCCGGTTCC GGCAACTTCG GTAAGCTGAC GAACGGCGCG CTGGAAGCCT CTAACGTGGA TTTGAGTAAA GAGCTGGTGA ATATGATCGT CGCGCAGCGT AACTACCAGT CGAATGCGCA GACCATCAAA ACCCAGGACC AGATCCTCAA TACGCTGGTT AACCTGCGCT AA
|
Protein sequence | MSFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGMQLTGYP ATGTPPTIQQ GANPAPITIP NTLMAAKSTT TASMQINLNS TDPVPSKTPF SVSDADSYNK KGTVTVYDSQ GNAHDMNVYF VKTKDNEWAV YTHDSSDPAA TAPAAPSTTL VFNANGTLQS GGTVNITTGT INGATAATFS LSFLNSMQQN TGANNIVATN QNGYKPGDLV SYQINNDGTV VGNYSNEQEQ VLGQIVLANF ANNEGLASQG DNVWAATQAS GVALLGTAGS GNFGKLTNGA LEASNVDLSK ELVNMIVAQR NYQSNAQTIK TQDQILNTLV NLR
|
| |