Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4141 |
Symbol | |
ID | 6482439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4033359 |
End bp | 4035149 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642739397 |
Product | HTH-type transcriptional regulator SgrR |
Protein accession | YP_002043106 |
Protein GI | 194446062 |
COG category | [R] General function prediction only |
COG ID | [COG4533] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.898899 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTCATTC CCGTTCGGGA AATTTCACCT TTTATGACAA CCCGGCACAC TGAACAAAAA TACTTAAAGC TACTCCAGCA CTATGGCGAT AAGCCCGTTA GCGTGACGCT ACAGGAGCTG GCGGATGTGT TGTTCTGCAC CCGGCGGCAT ATGCGTAATC TGTTGCTTCA GATGCAGGAG GCAAAGTGGC TCATCTGGCA ATCACAGGCC GGACGTGGGC ATCGCGCCCG GCTTCACCTG CGCTATAAAC CAGAACAGCT GTTAAGCGAA AAGGCGGAGC AGTTGCTGGA GTCTGGTCAT GTTGATCAGG CCATTCAGCT GCTGGGTAAA AATAAGCACC AGGTGGCGCA ACTGCTGCGC TCAAAACTGG GCTATAGCGT GCGGGCAGAC TATCAGCGGC TGTGCATCCC CTATTACCGG ACAATGCCGT CGCTGTGCCC CGGCATACCG TTGCGTCGCT CTGAGCAGCA TCTGGTCAGG CAGATTTTTA GCGGCCTGAC GCGCATAAAT GAGGAAAAAG GTGAAGTCGA AGCCGATCTT GCCCACCACT GGCGGCAGAT TGATCCACTG CGCTGGCGTT TTTATCTGCG CCCCGCCGTC CTCTGGCATG ATGGTCAGGA GCTGACGATC GACGCGGTTA TCGCTTCACT GACCTGCAGC GCTAAGCTGC CGTTGTTCTC GCACTTGCAG ACCATTCAGG CCACCGGGCC GCTGAGCCTT GAAATTACGC TGGCGCACCC GGATAACCGA CTGCCGCTGC TGCTCAGCCA TATTGATGCC ATGATCCTAC CGCCTGACCA TACACAACGC GCTGATTTCC CGGCACATCC TGTGGGGACT GGCCCTTATG AGGTGGTGGA AAACAATGGC TTTCATCTGC AAATGAAGGC CTTTGACCAC TATTTCGGCC TGCGCGGGCT GCTGGATGAA GTGGAGGTCT TTATCTGGCC GAATTTAACG GAGACAGACA ACCTGGCGGA ATCGCTGTCG GATAACGACA CGGCAGCCTG GCTCAGCTCC AGCCTGAGCG ATGAGGATTA CGTTTCCGGA CGGCTTAGCC AGGTATCAGG CAAACCTTCT GACAACCTGC GCGAGATGTT TCTTGAGCGT GGAGGATATT TTTTATTATG CGACAGCCGC TCCCCGCACT GGCATACCGC CGAACATCGC CGCTGGCTGC GGGAAACTCT CAGCCCTTAC GCCTTACTCC AGCATCTGAG TGAGGCAATT CGCCCCTTCT GGGTACCGGG CGGCAGCCTG CTGTCCTCCT GGTTTCATAC TATTGAGGCG GGCCCGGCCT GTTCACCTTT TATCTCGTCG TCGCCCTACG CAAAACTGCG TCTGGCCTAT CACGATCAGC ACCCTGAATT TCCAATGCTC CTGGATATTA TGCAAGAGAT CATGCGCCAG CAGGGCATTT TACTTGAGGG CGTTGAGCTG AATTATGATG ACTGGGCGAA TGGCAAAACC AATGTGGATC TCTGGCTGGG GACGGTCAAT TTCCCCATTC CCGAAGAGTG GAACGTCGGT ACATGGCTAC TGGGCTCCCC TTTACTGCGC CACGCCATCA GCGGTGGGGA TGATGCGCTG CTGGCCCAAT GGGAAACCCA GTGGCATGCC GAAACCATCA GCGCGGAACA ACTGGTCAGG GAAACCACCC GTTCAGGCTG GCTACAGCCG CTTTTTCACC ACTGGATGCG ACTCAAAGGC CCCGACCGGG CCAGGGGGAT CCACCTGAAT AACCTGGGAT GGTTTGATTT CCGATCCACC TGGATTGAGC CAGGGCCTTA A
|
Protein sequence | MLIPVREISP FMTTRHTEQK YLKLLQHYGD KPVSVTLQEL ADVLFCTRRH MRNLLLQMQE AKWLIWQSQA GRGHRARLHL RYKPEQLLSE KAEQLLESGH VDQAIQLLGK NKHQVAQLLR SKLGYSVRAD YQRLCIPYYR TMPSLCPGIP LRRSEQHLVR QIFSGLTRIN EEKGEVEADL AHHWRQIDPL RWRFYLRPAV LWHDGQELTI DAVIASLTCS AKLPLFSHLQ TIQATGPLSL EITLAHPDNR LPLLLSHIDA MILPPDHTQR ADFPAHPVGT GPYEVVENNG FHLQMKAFDH YFGLRGLLDE VEVFIWPNLT ETDNLAESLS DNDTAAWLSS SLSDEDYVSG RLSQVSGKPS DNLREMFLER GGYFLLCDSR SPHWHTAEHR RWLRETLSPY ALLQHLSEAI RPFWVPGGSL LSSWFHTIEA GPACSPFISS SPYAKLRLAY HDQHPEFPML LDIMQEIMRQ QGILLEGVEL NYDDWANGKT NVDLWLGTVN FPIPEEWNVG TWLLGSPLLR HAISGGDDAL LAQWETQWHA ETISAEQLVR ETTRSGWLQP LFHHWMRLKG PDRARGIHLN NLGWFDFRST WIEPGP
|
| |