Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3910 |
Symbol | |
ID | 6484381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3793795 |
End bp | 3794802 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642739172 |
Product | regulatory protein LacI:Periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_002042883 |
Protein GI | 194442857 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.83437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTTC AAAATAAAAA ACGCGCAAAG TTGATTGATG TTGCCCGCTA TGCAGGTGTG TCGCCAGGGA CAGTATCCAA TGCATTGCAC AACACTCGCT TTGTCGAGCC GCAGACGCGA CGGCGTATTG AAGAGGCCAT TGCTGCGCTC AACTACACGC CGAATATTCG CGCCCGCCAG TTGCGAACCG GCAAAACCAA TACCATTGCT TTGCTCTCTT CGGTGCCGCT GGCGATTGCC TCCGGCGCGT CACGACTGGG ATTTATGATG GAGGTGGCAT TAACGTCCGC GATGATGGCG CTGGAAAAGC AGCATGCGCT GATTCTGGTG CCGCCGGGGG CAAATCCACT GGATGCCGTC AGCTTTGACG CGGCGATCCT GATTGAGCCG GCGGAGAACG ATCCGCAGCT CCAGGCGCTG GCGCAAGCGG GCATTCCCTG CGTCACCATT GGCCGCACGC CGGGGACCGA CACGCCTGTG CCGTGGGTTG AGCTGCACTC GGCGGCAACA GCACAGCTTC TGCTAACGCA TCTGGAGGCC TCCGGCGCCA GCAAATGTGC GTTATTTGTC GGTAACACAC GGCGAACATC AGTTCTGGAA AGCATAGCGG CTTACCAGCG CTGGTGCGCG GGGCGCCAGG CCCCCGTCGT CTACTCTCTC AATGAAAGCG AGGGTGAAAA TGCCGGCTAC CAGGCCGCGC AGCAGCTATT ACAGGCGCAT CCCGACGTTG ACGGCGTGCT GGTGCTGATC GATACCTTTG CCAGCGGCGC GGTACGCGCT TTCCAGGAAC AAGACATCGC CATACCTGAA CAAATGCGGG TGGTTACCCG CTATGACGGT ATCCGCGCGC GCGAATCGCT GCCGCCGCTG ACGGCAGTGA ATATGCATCT TGATGAGGTG GCGCGACAGG CAATCACGCT CCTGTTTGCC GTTCTGTCGG GTGAGAAGGT CAGCTACAGC GACGGGATCA TGCCTGAACT GGTGGTGCGA GCGTCAACCT GCCGGTGA
|
Protein sequence | MAVQNKKRAK LIDVARYAGV SPGTVSNALH NTRFVEPQTR RRIEEAIAAL NYTPNIRARQ LRTGKTNTIA LLSSVPLAIA SGASRLGFMM EVALTSAMMA LEKQHALILV PPGANPLDAV SFDAAILIEP AENDPQLQAL AQAGIPCVTI GRTPGTDTPV PWVELHSAAT AQLLLTHLEA SGASKCALFV GNTRRTSVLE SIAAYQRWCA GRQAPVVYSL NESEGENAGY QAAQQLLQAH PDVDGVLVLI DTFASGAVRA FQEQDIAIPE QMRVVTRYDG IRARESLPPL TAVNMHLDEV ARQAITLLFA VLSGEKVSYS DGIMPELVVR ASTCR
|
| |