Gene SeD_A1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1947 
Symbol 
ID6872186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1874900 
End bp1876099 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content43% 
IMG OID642785069 
Producttype III secretion apparatus protein, YscD/HrpQ family 
Protein accessionYP_002215735 
Protein GI198243835 
COG category 
COG ID 
TIGRFAM ID[TIGR02500] type III secretion apparatus protein, YscD/HrpQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.389941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAATC CAAAGAGTTC CTGGAAAATA CGTTTTTTAG GTCACGTTTT ACAAGGCCGG 
GAAGTATGGC TGAATGAAGG TAACCTGTCA CTGGGGGAGA AAGGATGCGA TATTTGTATT
CCGCTTACTA TAAATGAAAA AATTATTCTG AGAGAACAGG CAGATAATTT ATTTGTTGAT
GCCGGGAAAG CCAGAGTTAG AGTTAATGGC CGCAGATTTA ATCCAAATAA GCCGCTACCA
TCCAGTGGGG TTTTGCAGGT TGCGGGAGTG GCTATCGCGT TTGGTAAACA GGATTGTGAA
CTTGCTGATT ATCAAATACC CGTTTCCAGA TCAGGGTACT GGTGGTTGGC TGGCGTATTC
TTGATTTTCA TCGGTGGAAT GGGTGTCCTG TTAAGTATTA GTGGTCAGCC TGAAACGGTA
AATGACTTAC CTTTGCGGGT TAAGTTTTTA TTAGACAAAA GCAATATTCA TTATGTGCGG
GCGCAATGGA AAGAAGATGG CAGCCTGCAG TTGTCCGGTT ATTGCTCGTC AAGCGAACAG
ATGCAAAAGG TGAGAGCGAC TCTCGAATCA TGGGGGGTCA TGTATCGGGA TGGTGTAATC
TGTGATGACT TATTGGTACG AGAAGTGCAG GATGTTTTGA TAAAAATGGG TTACCCGCAT
GCTGAAGTAT CCAGCGAAGG GCCGGGGAGC GTGTTAATTC ATGATGATAT ACAAATGGAT
CAGCAATGGC GTAAGGTTCA ACCATTACTT GCAGATATTC CCGGGTTATT GCACTGGCAG
ATTAGTCACT CTCATCAGTC TCAGGGAGAT GATATTATTT CTGCGATAAT AGAGAACGGT
TTAGTGGGGC TTGTCAATGT TACGCCAATG CGGCGCTCTT TTGTTATCAG TGGTGTACTG
GATGAATCTC ATCAACGCAT TTTGCAAGAA ACGTTAGCAG CATTAAAGAA AAAGGATCCC
GCTCTTTCTT TAATTTATCA GGATATTGCG CCTTCCCATG ATGAAAGCAA GTATCTGCCT
GCGCCAGTGG CTGGCTTTGT ACAGAGTCGC CATGGTAATT ACTTATTACT GACGAATAAA
GAGCGTTTAC GTGTAGGGGC ATTGTTACCC AATGGGGGAG AAATTGTCCA TCTGAGTGCC
GATGTGGTAA CGATTAAACA TTATGATACT TTGATTAACT ATCCATTAGA TTTTAAGTGA
 
Protein sequence
MVNPKSSWKI RFLGHVLQGR EVWLNEGNLS LGEKGCDICI PLTINEKIIL REQADNLFVD 
AGKARVRVNG RRFNPNKPLP SSGVLQVAGV AIAFGKQDCE LADYQIPVSR SGYWWLAGVF
LIFIGGMGVL LSISGQPETV NDLPLRVKFL LDKSNIHYVR AQWKEDGSLQ LSGYCSSSEQ
MQKVRATLES WGVMYRDGVI CDDLLVREVQ DVLIKMGYPH AEVSSEGPGS VLIHDDIQMD
QQWRKVQPLL ADIPGLLHWQ ISHSHQSQGD DIISAIIENG LVGLVNVTPM RRSFVISGVL
DESHQRILQE TLAALKKKDP ALSLIYQDIA PSHDESKYLP APVAGFVQSR HGNYLLLTNK
ERLRVGALLP NGGEIVHLSA DVVTIKHYDT LINYPLDFK