Gene SNSL254_A1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1274 
SymbolflgE 
ID6486308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1261594 
End bp1262805 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content55% 
IMG OID642736674 
Productflagellar hook protein FlgE 
Protein accessionYP_002040431 
Protein GI194442954 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.637715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.02291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTT CTCAAGCGGT TAGCGGCCTG AACGCTGCGG CCACCAACCT TGATGTTATC 
GGTAATAACA TCGCCAACTC CGCCACCTAT GGCTTTAAGT CCGGTACGGC ATCATTTGCC
GATATGTTCG CCGGTTCCAA AGTGGGGCTG GGCGTAAAAG TGGCGGGGAT TACCCAGGAT
TTTACCGACG GTACGACAAC GAACACCGGG CGCGGGCTGG ATGTCGCGAT TAGCCAGAAC
GGTTTTTTCC GCCTGGTAGA CAGCAACGGT TCCGTGTTCT ATAGCCGCAA CGGCCAGTTC
AAACTGGACG AGAACCGTAA CCTGGTCAAT ATGCAGGGGA TGCAGTTGAC CGGCTATCCG
GCCACCGGTA CGCCGCCGAC CATTCAGCAG GGGGCGAATC CTGCGCCGAT CACCATTCCG
AACACGCTGA TGGCGGCGAA ATCGACCACC ACCGCGTCAA TGCAGATCAA CCTGAACTCA
ACGGACCCTG TACCGTCTAA AACGCCCTTT AGCGTGAGTG ATGCGGATTC GTATAACAAA
AAAGGCACCG TCACCGTTTA TGACAGCCAG GGTAATGCCC ATGACATGAA CGTCTATTTT
GTGAAAACCA AAGATAATGA ATGGGCCGTG TACACCCATG ACAGCAGCGA TCCTGCAGCC
ACTGCGCCAA CAACGGCGTC CACTACGCTG AAATTCAATG AAAACGGGAT TCTGGAGTCT
GGCGGTACGG TGAACATCAC CACCGGTACG ATTAATGGCG CGACAGCGGC CACCTTCTCC
CTCAGCTTCC TTAACTCCAT GCAGCAGAAC ACCGGGGCTA ACAACATCGT CGCCACCAAT
CAAAACGGCT ATAAGCCTGG CGACTTGGTG AGCTACCAGA TTAACAACGA CGGCACCGTA
GTTGGCAACT ACTCCAACGA GCAGGAGCAG GTGCTGGGGC AGATTGTGCT GGCTAACTTC
GCCAATAACG AAGGTCTGGC ATCCCAGGGC GATAACGTCT GGGCGGCGAC GCAGGCTTCC
GGGGTAGCGC TGCTGGGGAC TGCCGGTTCC GGCAACTTCG GTAAGCTGAC GAACGGCGCG
CTGGAAGCCT CTAACGTGGA TTTGAGTAAA GAGCTGGTGA ATATGATCGT CGCGCAGCGT
AACTACCAGT CGAATGCGCA GACCATCAAA ACTCAGGACC AGATCCTCAA TACGCTGGTT
AACCTGCGCT AA
 
Protein sequence
MSFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD 
FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGMQLTGYP
ATGTPPTIQQ GANPAPITIP NTLMAAKSTT TASMQINLNS TDPVPSKTPF SVSDADSYNK
KGTVTVYDSQ GNAHDMNVYF VKTKDNEWAV YTHDSSDPAA TAPTTASTTL KFNENGILES
GGTVNITTGT INGATAATFS LSFLNSMQQN TGANNIVATN QNGYKPGDLV SYQINNDGTV
VGNYSNEQEQ VLGQIVLANF ANNEGLASQG DNVWAATQAS GVALLGTAGS GNFGKLTNGA
LEASNVDLSK ELVNMIVAQR NYQSNAQTIK TQDQILNTLV NLR