Gene SeD_A4837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4837 
SymboltreR 
ID6875487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4690803 
End bp4691750 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content56% 
IMG OID642787725 
Producttrehalose repressor 
Protein accessionYP_002218319 
Protein GI198242111 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID[TIGR02405] trehalose operon repressor, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAACC GGCTCACTAT CAAAGACATC GCCCGCCTGA GCGGCGTAGG GAAATCAACC 
GTTTCCCGCG TGCTTAACAA TGAAAGCGGC GTAAGCGAAC GTACCCGCGA GCGTGTCGAA
GCGGTGATGA ATCAACACGG TTTCTCCCCG TCCCGCTCTG CCCGCGCGAT GCGGGGACAA
AGCGATAAAG TGGTCGCTAT TATCGTCACT CGCCTTGATT CGTTGTCTGA AAACCTCGCG
GTTCAGACCA TGCTGCCTGC GTTTTATGAA CAGGGCTACG ACCCTATTAT GATGGAAAGT
CAGTTCTCGC CGACGCTGGT AATGGAACAT CTGGGCATGC TCAGACGACG TAACATTGAT
GGCGTGGTGC TGTTTGGCTT TACCGGCATC ACAGAAGAGT TGATCGCCCC CTGGAAAGCC
TCGCTGGTGC TGCTGGCAAG AGATGCGCAA GGTTTTGCCT CCGTCTGTTA CGACGACGAG
GGCGCGATTC ATATCCTTAT GCAGCGGCTG TATGAGCAGG GACACCGCAA CATTAGCTTT
CTGGGCGTTC CCCATAGCGA TATTACCACC GGCAAACGTC GGCATGACGC ATACCTGGCG
TTTTGCAAAA AACATAAACT TCATCCCGTC GCCGCCCTGC CCGGTCTTGC CATGAAGCAG
GGCTATGAGC ATACGGCAAG CGTCATCATG CCGGATACCA CCGCGTTAGT CTGCGCCACC
GATACGCTGG CGTTGGGCGC CAGTAAGTAT TTACAGGAGC AACGTATTGA GACGCTGCAA
CTGGCAAGCG TCGGGAACAC GCCGCTGATA AAATTCCTGC ACCCGGAGAT CGTCACTGTC
GATCCTGGCT ATGCTGAAGC CGGACGACAG GCGGCTTCGC AGCTGATCGA ACAGATCAAT
GGCCGCTGCG ATCCGCGCCG GATCGTCATT CCTTCTACCC TCGCCTGA
 
Protein sequence
MQNRLTIKDI ARLSGVGKST VSRVLNNESG VSERTRERVE AVMNQHGFSP SRSARAMRGQ 
SDKVVAIIVT RLDSLSENLA VQTMLPAFYE QGYDPIMMES QFSPTLVMEH LGMLRRRNID
GVVLFGFTGI TEELIAPWKA SLVLLARDAQ GFASVCYDDE GAIHILMQRL YEQGHRNISF
LGVPHSDITT GKRRHDAYLA FCKKHKLHPV AALPGLAMKQ GYEHTASVIM PDTTALVCAT
DTLALGASKY LQEQRIETLQ LASVGNTPLI KFLHPEIVTV DPGYAEAGRQ AASQLIEQIN
GRCDPRRIVI PSTLA