Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3976 |
Symbol | |
ID | 6482901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3862569 |
End bp | 3863624 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642739235 |
Product | LacI family transcriptional regulator |
Protein accession | YP_002042945 |
Protein GI | 194443984 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 0.637637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATA ATGCGCGGAT AAATCAGCCA ATCTTTGCAG GGGCATCAGA CGTGAAGAGA ACCAAATCAC CTCGTGCGCC GACGCTGGAA GATGTAGCAC GTAGCGCCGG GCTGTCTCCG ATGACAGTCA GCCGGGCATT GAATTCGCCA CAACTGGTTC GCCCCAAAAC GGTTGAGAAA GTCATGCAGG CGGTTCGCGT CACAGGCTAC ATACCCAACG CGTTAGCCGG TGGTCTGGCG TCGCGACGGA GTAAATTAAT CGCTGTCGTC GTGCCGCAAA TTAACAACAA CATGTTTGTC GATACCATCC AGTCACTGAG CGATGAACTG GCCCGACGCG GATATCACAT ATTGCTGTGC GTGGCGGGAT ATACCGAACA AACGGAAGCG GAGTTGGTGG CGACACTGCT TTCCCGCCGC CCCGATGGCG TGGTGCTTAC CGGGATCCAT CACACGATAG AACTGAAAAA GGTCATCCTG AACGCGGCTA TTCCGGTGGT GGAAATTTGG GACTTAACGC CCACGCCGCT TGATATGCTG GTCGGTTTTT CCCATGAAAA AGTCGGGCAG GCGACGGGGG AGTATCTTCT GAGTAAGGGC TATCGTCGTC CCGGCTTGTT GTGGACCGCC GATCGCCGAG CCGCGCAACG TAAGCAGGGG TTATGTAGTG TTCTTCAACG CCACGCTATT CATGCCGTAC CGCAGGTAGA TGTCCCCCTT CCGGCATCGC TTTCGCTGGG GCGCAGCGGT TTAAGTCAGC TTTTTGACGA AGGGACGTTT GATGTCATTG TTTGCAGTTC TGATACCCTG GCACAGGGGG CGATGATGGA GGCGGAAAGC CGTGGTTTGC GCATCCCGCA TGATTTAGCG GTTATTGGTT TTGGCGACCT TGATTTTGCC GCCAGCAATC GACCATCAAT TACTACCGTA AGCGTTGACA GACGCGCCAT TGGCCAGCGC GCCGCTACGC TGTTGGCCGA TCGTATTGAA CAGAAACCGT GCGCAGAAGC TATTGTGGAT ATTGGCTTTC ATTTGATTGA GCGAGAGTCC GCATAA
|
Protein sequence | MMNNARINQP IFAGASDVKR TKSPRAPTLE DVARSAGLSP MTVSRALNSP QLVRPKTVEK VMQAVRVTGY IPNALAGGLA SRRSKLIAVV VPQINNNMFV DTIQSLSDEL ARRGYHILLC VAGYTEQTEA ELVATLLSRR PDGVVLTGIH HTIELKKVIL NAAIPVVEIW DLTPTPLDML VGFSHEKVGQ ATGEYLLSKG YRRPGLLWTA DRRAAQRKQG LCSVLQRHAI HAVPQVDVPL PASLSLGRSG LSQLFDEGTF DVIVCSSDTL AQGAMMEAES RGLRIPHDLA VIGFGDLDFA ASNRPSITTV SVDRRAIGQR AATLLADRIE QKPCAEAIVD IGFHLIERES A
|
| |