Gene SeD_A0668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0668 
Symbol 
ID6872798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp667700 
End bp668875 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content54% 
IMG OID642783882 
Productheat shock protein DnaJ domain-containing protein 
Protein accessionYP_002214568 
Protein GI198243015 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1076] DnaJ-domain-containing proteins 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA TCATCAAACG TTTAGAGATC ATCAAAAGCG CCATCGAACT TGAGGACGAA 
GAGATTATCC GTCAGCAGCT CATCTACCTG AAAAATGAGC CGCAGGATGC CGTAATCAGC
GCGATTGCCC AGGCGATTGA AGCTCGTCGA TTCAGTGACG CCATGCAAGA GATCGCCGCC
TGGCTACAGG CTCAACGGGC GCTCTCGACG TGGCAAGATC CCTCTATTGC CGCCAGTAAA
CTGGAGCTGA AAGCGCTTGA AGCCCAACTT CGCGATCTGA TTGACAAACG AAATGCGCGG
GTACAAATCC TCGACGATTT CAACGACCTC TATCATCTGC GTCTCGGGCC GTTGATGAGC
CGTATTCTGG AGTTGCGTAA ACAGCTCGCG GTGAGTATGC AGCGTAAGCA AGAAGCCGAA
ATAAAACGCC GGGAAAAGGA TTATCAATCC TGCCTGCAAT TTATTTCCCA GGCCGTGGAT
CAACTGGCAA CGCTAAAACA GCAGTGGACA GGATTGAATG CCGCCTCGCG CGAAGCGGTG
GGCATCCGTC AGCGAATCCA GCAGCAGACG GAATTAATTA CCGCGTTGCT GGCGGAAATT
CGCGAGCTGG AAGCGGATTT TTCCCATCAG GACGACAGCG CATTCCGCCA GGCGCAGGAA
AATGCGGAAC AGGACTATCA CCAGTACCGG GAGCAGCAGC AGGAAGCGCA ATTCCGCTAC
GCTCGCGATC AACGTTTGTC GGCTGACGAA CGCAATGAAT TAAAACGTTT GTGGCGTCAG
GCCAGCCGCC TGTGTCACCC GGATGTGGTC GCCGATGAAT TAAAAGAAAA AGCGCACCAG
ATGATGGTAC AACTTAATCA GGCGCGGCAG AATGCCGATC TGGCGGCAAT TCGCGCGTTA
TTGACGCAGC TGCAAAGCGG TCTGGAACCG ATGATGGCAA GCGACAGGCT CAATAACCTG
GAACATCTGC GCCATAAAAT ACGCCAGCTC CGCACGCAAA TCGACGCGCT GTTGAAAGAG
ATAACGCAGC TGGAAACGGA AAACGCCTGG CGGCTCGCCT CTTCCGTTGC GGATAAGGAA
GCCTATTTCT CCGAGCAGGA ACGGGCGCTA ACCGAAATTC GCAATACGCT GGAGGCGCAG
GTTCAACAGG TGGAACAGGA ACTTCTGTCA GGGTAG
 
Protein sequence
MNKIIKRLEI IKSAIELEDE EIIRQQLIYL KNEPQDAVIS AIAQAIEARR FSDAMQEIAA 
WLQAQRALST WQDPSIAASK LELKALEAQL RDLIDKRNAR VQILDDFNDL YHLRLGPLMS
RILELRKQLA VSMQRKQEAE IKRREKDYQS CLQFISQAVD QLATLKQQWT GLNAASREAV
GIRQRIQQQT ELITALLAEI RELEADFSHQ DDSAFRQAQE NAEQDYHQYR EQQQEAQFRY
ARDQRLSADE RNELKRLWRQ ASRLCHPDVV ADELKEKAHQ MMVQLNQARQ NADLAAIRAL
LTQLQSGLEP MMASDRLNNL EHLRHKIRQL RTQIDALLKE ITQLETENAW RLASSVADKE
AYFSEQERAL TEIRNTLEAQ VQQVEQELLS G