Gene SeHA_C1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1289 
SymbolflgE 
ID6491200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1264154 
End bp1265365 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content56% 
IMG OID642741526 
Productflagellar hook protein FlgE 
Protein accessionYP_002045176 
Protein GI194449419 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.209767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.0125362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTT CTCAAGCGGT TAGCGGCCTG AACGCTGCGG CCACCAACCT TGATGTTATC 
GGTAATAACA TCGCCAACTC CGCCACCTAT GGCTTTAAGT CCGGTACGGC ATCATTTGCC
GATATGTTCG CCGGTTCCAA AGTGGGGCTG GGCGTAAAAG TGGCGGGGAT TACCCAGGAT
TTTACCGACG GTACGACAAC GAACACCGGG CGCGGGCTGG ATGTCGCGAT TAGCCAGAAC
GGTTTTTTCC GCCTGGTAGA CAGCAACGGT TCCGTGTTCT ATAGCCGCAA CGGCCAGTTC
AAACTGGACG AGAACCGTAA CCTGGTCAAT ATGCAGGGGA TGCAGTTGAC CGGCTATCCG
GCCACCGGTA CGCCGCCGAC CATTCAGCAG GGGGCGAATC CTGCGCCGAT CACCATTCCG
AACACGCTGA TGGCGGCGAA ATCGACCACC ACCGCGTCAA TGCAGATCAA CCTGAACTCA
ACGGACCCTG TACCGTCTAA AACGCCCTTT AGCGTGAGTG ATGCGGATTC GTATAACAAA
AAAGGCACCG TCACCGTTTA TGACAGCCAG GGTAATGCCC ATGACATGAA CGTCTATTTT
GTGAAAACCA AAGATAATGA ATGGGCCGTG TACACCCATG ACAGCAGCGA TCCTGCAGCC
ACAGCGCCGG CGGCACCATC AACCACTCTG GTATTTAACG CAAACGGGAC ACTGCAATCC
GGCGGTACGG TGAACATCAC CACCGGTACG ATTAATGGCG CGACAGCGGC CACCTTCTCC
CTCAGCTTCC TTAACTCCAT GCAGCAGAAC ACCGGGGCTA ACAACATCGT TGCCACCAAT
CAAAACGGCT ATAAGCCGGG CGACCTGGTG AGCTACCAGA TTAACAACGA CGGCACCGTA
GTTGGCAACT ACTCCAACGA GCAGGAGCAG GTGCTGGGGC AGATTGTGCT GGCTAACTTC
GCCAACAACG AAGGTCTGGC ATCCCAGGGC GATAACGTCT GGGCGGCGAC GCAGGCTTCC
GGGGTTGCGC TGCTGGGGAC TGCCGGTTCC GGCAACTTCG GTAAGCTGAC GAACGGCGCG
CTGGAAGCCT CTAACGTGGA TTTGAGTAAA GAGCTGGTGA ATATGATCGT CGCGCAGCGT
AACTACCAGT CGAATGCGCA GACCATCAAA ACCCAGGACC AGATCCTCAA TACGCTGGTT
AACCTGCGCT AA
 
Protein sequence
MSFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD 
FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGMQLTGYP
ATGTPPTIQQ GANPAPITIP NTLMAAKSTT TASMQINLNS TDPVPSKTPF SVSDADSYNK
KGTVTVYDSQ GNAHDMNVYF VKTKDNEWAV YTHDSSDPAA TAPAAPSTTL VFNANGTLQS
GGTVNITTGT INGATAATFS LSFLNSMQQN TGANNIVATN QNGYKPGDLV SYQINNDGTV
VGNYSNEQEQ VLGQIVLANF ANNEGLASQG DNVWAATQAS GVALLGTAGS GNFGKLTNGA
LEASNVDLSK ELVNMIVAQR NYQSNAQTIK TQDQILNTLV NLR