Gene SeHA_C3063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3063 
SymboleprH 
ID6491785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2990856 
End bp2991920 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content50% 
IMG OID642743218 
Producttype III secretion apparatus protein PrgH/EprH 
Protein accessionYP_002046837 
Protein GI194448137 
COG category 
COG ID 
TIGRFAM ID[TIGR02554] type III secretion system protein PrgH/EprH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.69246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.188447 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAGGTC AGAGTGATGC GCTCACTGCT TCAGGTCAAC TCCCTGATAT ACCTGCCGAT 
AGCTTTTTTA TCCCGCTGGA CCATGGCGGA GTAAATTTTG AAATCCAGGT GGATACGGAT
GCGACCGAAA TTATACTCCA TGAGCTGAAA GAAGGAAATT CTGAATCTCG TTCGGTGCAA
TTAAATACGC CAATACAGGT CGGTGAATTG CTTATCCTGA TTCGCCCGGA AAGCGAGCCG
TGGGTGCCCG AGCAGCCTGA GAAGTTAGAA ACGTCTGCAA AAAAGAACGA GCCGCGTTTT
AAAAACGGAA TTGTAGCCGC ACTGGCCGGG TTTTTTATAT TGGGAATTGG GACTGTGGGG
ACGTTATGGA TACTTAACTC GCCGCAGCGG CAGGCCGCAG AGCTCGATTC GTTATTGGGG
CAGGAGAAGG AGCGTTTTCA GGTGTTGCCA GGACGGGACA AAATGCTCTA TGTCGCTGCG
CAAAATGAAA GAGATACGCT GTGGGCTCGT CAGGTTTTAG CGAGGGGCGA TTATGATAAA
AATGCGCGAG TGATTAACGA AAACGAAGAA AATAAGCGTA TCTCTACCTG GCTGGATACC
TATTATCCGC AGCTTGCTTA TTATCGGCTT CATTTCGATG AGCCGCGTAA ACCCGTTTTC
TGGCTAAGCC GCCAGCGAAA CACGATGAGC AAGAAAGAGC TCGAGGTGTT AAGTCAAAAG
CTGAGAGCGC TAATGCCTTA CGCGGATTCG GTTAACATCA CGTTGATGGA CGATGTTACC
GCAGCAGGCC AGGCGGAAGC GGGGCTAAAA CAGCAGGCGT TACCTTATTC CCGCAGGAAT
CATAAGGGGG GCGTAACGTT TGTTATTCAG GGGGCGCTCG ATGATGTAGA AATACTCAGA
GCCCGTCAAT TTGTCGATAG CTATTACCGC ACATGGGGAG GACGCTATGT GCAGTTTGCG
ATCGAATTAA AAGATGACTG GCTCAAGGGG CGCTCATTTC AGTACGGGGC GGAAGGTTAT
ATCAAAATGA GCCCAGGCCA TTGGTATTTC CCAAGCCCAC TTTAA
 
Protein sequence
MVGQSDALTA SGQLPDIPAD SFFIPLDHGG VNFEIQVDTD ATEIILHELK EGNSESRSVQ 
LNTPIQVGEL LILIRPESEP WVPEQPEKLE TSAKKNEPRF KNGIVAALAG FFILGIGTVG
TLWILNSPQR QAAELDSLLG QEKERFQVLP GRDKMLYVAA QNERDTLWAR QVLARGDYDK
NARVINENEE NKRISTWLDT YYPQLAYYRL HFDEPRKPVF WLSRQRNTMS KKELEVLSQK
LRALMPYADS VNITLMDDVT AAGQAEAGLK QQALPYSRRN HKGGVTFVIQ GALDDVEILR
ARQFVDSYYR TWGGRYVQFA IELKDDWLKG RSFQYGAEGY IKMSPGHWYF PSPL