Gene SeD_A2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2195 
SymbolflgE 
ID6875421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2105979 
End bp2107190 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content55% 
IMG OID642785297 
Productflagellar hook protein FlgE 
Protein accessionYP_002215960 
Protein GI198245171 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.000430421 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTTTTT CTCAAGCGGT TAGCGGCCTG AACGCTGCGG CCACCAACCT TGATGTTATC 
GGTAATAACA TCGCCAACTC CGCCACCTAT GGCTTTAAGT CCGGTACGGC ATCATTTGCC
GATATGTTCG CCGGTTCCAA AGTGGGGCTG GGCGTAAAAG TGGCGGGGAT TACCCAGGAT
TTTACCGACG GTACGACAAC GAACACCGGG CGCGGGCTGG ATGTCGCGAT TAGCCAGAAC
GGTTTTTTCC GCCTGGTAGA CAGCAACGGT TCCGTGTTCT ATAGCCGCAA CGGCCAGTTC
AAACTGGACG AGAACCGTAA CCTGGTCAAT ATGCAGGGGA TGCAGTTGAC CGGCTATCCG
GCCACCGGTA CGCCGCCGAC CATTCAGCAG GGGGCGAATC CTGCGCCGAT CACCATTCCG
AACACGCTGA TGGCGGCGAA ATCGACCACC ACCGCGTCAA TGCAGATCAA CCTGAACTCA
ACGGACCCTG TACCGTCTAA AACGCCCTTT AGCGTGAGTG ATGCGGATTC GTATAACAAA
AAAGGCACCG TCACCGTTTA TGACAGCCAG GGTAATGCCC ATGACATGAA CGTCTATTTT
GTGAAAACCA AAGATAATGA ATGGGCTGTG TACACCCATG ACAGCAGCGA TCCTGCAGCC
ACTGCGCCAA CAACGGCGTC CACTACGCTG AAATTCAATG AAAACGGGAT TCTGGAGTCT
GGCGGTACGG TGAACATCAC CACCGGTACG ATTAATGGCG CGACAGCGGC CACCTTCTCC
CTCAGCTTCC TTAACTCCAT GCAGCAGAAC ACCGGGGCTA ACAACATCGT CGCCACCAAT
CAAAACGGCT ATAAGCCGGG CGACCTGGTG AGCTACCAGA TTAACAACGA CGGCACCGTA
GTTGGCAACT ACTCCAACGA GCAGGAGCAG GTGCTGGGGC AGATTGTGCT GGCTAACTTC
GCCAACAACG AAGGTCTGGC ATCCCAGGGC GATAACGTCT GGGCGGCGAC GCAGGCTTCC
GGGGTTGCGC TGCTGGGGAC TGCCGGTTCC GGCAACTTCG GTAAGCTGAC GAACGGCGCG
CTGGAAGCCT CTAACGTGGA TTTGAGTAAA GAGCTGGTGA ATATGATCGT CGCGCAGCGT
AACTACCAGT CGAATGCGCA GACCATCAAA ACCCAGGACC AGATCCTCAA TACGCTGGTT
AACCTGCGCT AA
 
Protein sequence
MSFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD 
FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGMQLTGYP
ATGTPPTIQQ GANPAPITIP NTLMAAKSTT TASMQINLNS TDPVPSKTPF SVSDADSYNK
KGTVTVYDSQ GNAHDMNVYF VKTKDNEWAV YTHDSSDPAA TAPTTASTTL KFNENGILES
GGTVNITTGT INGATAATFS LSFLNSMQQN TGANNIVATN QNGYKPGDLV SYQINNDGTV
VGNYSNEQEQ VLGQIVLANF ANNEGLASQG DNVWAATQAS GVALLGTAGS GNFGKLTNGA
LEASNVDLSK ELVNMIVAQR NYQSNAQTIK TQDQILNTLV NLR