Gene SeD_A3184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3184 
SymboleprH 
ID6873503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3060733 
End bp3061797 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content50% 
IMG OID642786204 
Producttype III secretion apparatus protein PrgH/EprH 
Protein accessionYP_002216845 
Protein GI198244606 
COG category 
COG ID 
TIGRFAM ID[TIGR02554] type III secretion system protein PrgH/EprH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.736227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAGGTC AGAGTGATGC GCTCACTGCT TCAGGCCAAC TCCCTGATAT ACCTGCCGAT 
AGCTTTTTTA TCCCGCTGGA CCATGGCGGA GTAAATTTTG AAATCCAGGT GGATACGGAT
ACGACCGAAA TTATACTCCA TGAGCTGAAA GAAGGAAATT CTGAATCTCG TTCGGTGCAA
TTAAATACGC CAATACAGGT CGGTGAATTG CTTATCCTGA TTCGCCCGGA AAGCGAGCCG
TGGGTGCCCG AGCAGCCTGA GAAGTTAGAA ACGTCTGCAA AAAAGAACGA GCCGCGTTTT
AAAAACGGAA TTGTAGCCGC ACTGGCCGGG TTTTTTATAT TGGGAATTGG GACTGTGGGG
ACGTTATGGA TACTTAACTC GCCGCAGCGG CAGGCCGCAG AGCTCGATTC GTTATTGGGG
CAGGAGAAGG AGCGTTTTCA GGTGTTGCCA GGCCGGGACA AAATGCTCTA TGTCGCTGCG
CAAAATGAAA GAGATACGCT GTGGGCTCGT CAGGTTTTAG CGAGGGGCGA TTATGATAAA
AATGCGCGAG TGATTAACGA AAACGAAGAA AATAAGCGTA TCTCTATCTG GCTGGATACC
TATTATCCGC AGCTGGCTTA TTATCGGATT CATTTCGATG AGCCGCGTAA ACCCGTTTTC
TGGCTAAGCC GCCAGCGAAA CACGATGAGC AAGAAAGAGC TCGAGGTGTT AAGTCAAAAG
CTGAGAGCGC TAATGCCTTA CGCGGATTCG GTTAACATCA CGTTGATGGA CGATGTTACC
GCAGCAGGCC AGGCGGAAGC GGGGCTAAAA CAGCAGGCGT TACCTTATTC CCGCAGGAAT
CATAAGGGGG GCGTAACGTT TGTTATTCAG GGGGCGCTCG ATGATGTAGA AATACTCAGA
GCCCGTCAAT TTGTCGATAG CTATTACCGC ACATGGGGAG GACGCTATGT GCAGTTTGCG
ATCGAATTAA AAGATGACTG GCTCAAGGGG CGCTCATTTC AGTACGGGGC GGAAGGTTAT
ATCAAAATGA GCCCAGGCCA TTGGTATTTC CCAAGCCCAC TTTAA
 
Protein sequence
MVGQSDALTA SGQLPDIPAD SFFIPLDHGG VNFEIQVDTD TTEIILHELK EGNSESRSVQ 
LNTPIQVGEL LILIRPESEP WVPEQPEKLE TSAKKNEPRF KNGIVAALAG FFILGIGTVG
TLWILNSPQR QAAELDSLLG QEKERFQVLP GRDKMLYVAA QNERDTLWAR QVLARGDYDK
NARVINENEE NKRISIWLDT YYPQLAYYRI HFDEPRKPVF WLSRQRNTMS KKELEVLSQK
LRALMPYADS VNITLMDDVT AAGQAEAGLK QQALPYSRRN HKGGVTFVIQ GALDDVEILR
ARQFVDSYYR TWGGRYVQFA IELKDDWLKG RSFQYGAEGY IKMSPGHWYF PSPL