Gene SeD_A4249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4249 
Symbol 
ID6874101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4095652 
End bp4097442 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content56% 
IMG OID642787179 
ProductHTH-type transcriptional regulator SgrR 
Protein accessionYP_002217805 
Protein GI198244077 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTCATTC CCGTTCGGGA AATTTCACCT TTTATGACAA CCCGGCACAC TGAACAAAAA 
TACTTAAAGC TACTCCAGCA CTATGGCGAC AAGCCTGTTA GCGTGACATT ACAGGAGCTG
GCGGATGTGT TGTTCTGCAC CCGACGGCAT ATGCGTAATC TGTTGCTTCA GATGCAGGAG
GCAAAGTGGC TCATCTGGCA ATCACAGGCC GGACGTGGGC ATCGCGCCCG GCTTCACCTG
CGCTATAAAC CAGAACAGCT GTTAAGCGAA AAGGCGGAGC AGTTACTGGA GTCTGGTCAT
ATTGATCAGG CCATTCAGCT GCTGGGTAAA AATAAGCACC AGGTGGCGCA ACTGCTGCGT
TCAAAGCTGG GCTATAGCGT GCGGGCAGAC TATCAGCGGC TGTGCATCCC CTATTATCGG
ACAATGCCGT CGCTGTGCCC CGGCATACCG TTGCGTCGCT CTGAGCAGCA TCTGGTCAGG
CAGATTTTTA GCGGCCTGAC GCGCATAAAT GAGGAAAAAG GTGAAGTCGA AGCCGATCTT
GCCCACCACT GGCGGCAGAT TGATCCACTG CGCTGGCGTT TTTATCTGCG CCCCGCCGTC
CTCTGGCATG ATGGTCAGGA GCTGACGATC GACGCGGTTA TCGCTTCACT GACCCGCAGC
GCTAAGCTGC CGTTGTTCTC GCACTTGCAG ACCATTCAGG CCACCGGGCC GCTGAGCCTT
GAAATTACGC TGGCGCACCC GGATAACCGA CTGCCGCTGC TGCTCAGTCG TATTGATGCC
ATGATCCTAC CGCCTGACCA TACACAACGC GCTGATTTCC CGGCACATCC CGTGGGAACT
GGCCCTTATG AGGTGGTGGA AAACAATGGC TTTCATCTGC AAATGAAGGC CTTTGACCAC
TATTTCGGTC TGCGCGGGCT GCTGGATGAA GTGGAGGTCT TTATCTGGCC GAATTTAACG
GAGACAGATA ACCTGGCGGA ATCGCTGTCG GATAACGACA CGGCAGCCTG GCTCAGTTCC
AGCCTGAGCG ATGAGGATTA CGTTTCCGGA CGGCTTAGCC AGGTATCAGG CAAACCTTCT
GACAACCTGC GCGAGATGTT TCTTGAGCGT GGAGGCTATT TTTTATTATG CGACAGTCGC
TCTCCGCACT GGCATACCGC CGAACATCGC CGCTGGCTGC GGGAAACACT CAGCCCTTAC
GCCTTACTCC AGCATCTGAG TGAGGCAATT CGCCCCTTCT GGGTACCGGG CGGCAGCCTA
CTGTCCTCCT GGTTTCATAC TATTGAGGCG GGCCCAGCCT GTTCACCTTT TATCTCGTCG
TCGCCCTACG CAAAACTGCG TCTGGCCTAT CACGATCAGC ACCCTGAATT TCCGATGCTC
CTGGATATCA TGCAAGAGAT CATGCGCCAG CAGGGCATTT TACTTGAGGG CGTGGAGCTG
AATTATGATG ACTGGGCGAA TGGCAAAGCC AATGTGGATC TCTGGCTGGG GACGGTCAAT
TTCCCCATTC CCGAAGAGTG GAACGTCGGT ACATGGCTGC TGGGCTCCCC TTTACTGCGC
CACGCCATCA GAGGTGGGGA TGATGCGCTG CTGGCCCAAT GGGAAACCCA GTGGCATGCC
GAAACCATCA GCGCGGAACA ACTGGTCAGG GAAACCACCC GTTCAGGCTG GCTACAGCCG
CTTTTTCACC ACTGGATGCG ACTCAAAGGC CCCGACCGGG CCAGGGGGAT CCACCTGAAT
AACCTGGGAT GGTTTGATTT CCGATCCACC TGGATTGAGC CAGGGCCTTA A
 
Protein sequence
MLIPVREISP FMTTRHTEQK YLKLLQHYGD KPVSVTLQEL ADVLFCTRRH MRNLLLQMQE 
AKWLIWQSQA GRGHRARLHL RYKPEQLLSE KAEQLLESGH IDQAIQLLGK NKHQVAQLLR
SKLGYSVRAD YQRLCIPYYR TMPSLCPGIP LRRSEQHLVR QIFSGLTRIN EEKGEVEADL
AHHWRQIDPL RWRFYLRPAV LWHDGQELTI DAVIASLTRS AKLPLFSHLQ TIQATGPLSL
EITLAHPDNR LPLLLSRIDA MILPPDHTQR ADFPAHPVGT GPYEVVENNG FHLQMKAFDH
YFGLRGLLDE VEVFIWPNLT ETDNLAESLS DNDTAAWLSS SLSDEDYVSG RLSQVSGKPS
DNLREMFLER GGYFLLCDSR SPHWHTAEHR RWLRETLSPY ALLQHLSEAI RPFWVPGGSL
LSSWFHTIEA GPACSPFISS SPYAKLRLAY HDQHPEFPML LDIMQEIMRQ QGILLEGVEL
NYDDWANGKA NVDLWLGTVN FPIPEEWNVG TWLLGSPLLR HAIRGGDDAL LAQWETQWHA
ETISAEQLVR ETTRSGWLQP LFHHWMRLKG PDRARGIHLN NLGWFDFRST WIEPGP