Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4866 |
Symbol | |
ID | 6482401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4734466 |
End bp | 4735374 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642740077 |
Product | putative DNA-binding transcriptional regulator |
Protein accession | YP_002043754 |
Protein GI | 194445555 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.424188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.514626 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTAA CTGGAGCAGG TTTGCACAAT ATTGAGACAA AATGGCTATA TGATTTTCTG ACACTGGAAA AGTGCCGCAA TTTCTCTCAG GCCGCCATTA TCCGCAACGT ATCGCAACCC GCTTTTAGCC GGCGGATTCG CGCCCTGGAA CATGCCGTGG GCGTTGAACT GTTTAACCGA CAGGTCTCGC CGCTACAGCT TTCCGAACAG GGTAAAATCT TTCACTCCCA GGTTCGCCAC CTGTTACAGC AGCTGGAAAG TAATCTGACC GAACTGCGCG GCGGCAGCGA TTATACGCTG CGTAAAATCA AGATTGCCGC CGCCCACTCG CTCTCCCTCG GCCTGTTGCC GACCATCGTT AAGCAGATGC CGACGCAGTT TACCTACGCC GTTGAGGCGA TAGATGTCGA CCAGGCGGTG GATATGTTAC GTGAGGGGCA AAGCGATTTT ATCTTTTCCT ATCACGATGA AAACCTGCAA CAAGCGCCGT TTGATAATAT CCGCCTGTTT GAGTCGAGAC TGTTTCCGGT TTGCGCCAAC AATGGCCGGG GCGAGCCACG CTATACGCTT GAGCAGCCGC ACTTTCCCCT GCTTAATTAC AGTCAGAACT CCTATATGGG CCGGCTGATA AATCGTACTC TGACTCGCCA TGCTGAACTG AGTTTCAGTA CATTTTTCGT CTCTTCGATG AGTGAATTGT TAAAACAGGT TGCGATGGAC GGCTGCGGAA TCGCCTGGTT GCCCGAGTAT GCTATCCGTC AGGAGATTAC CGACGGACGC CTGATAGTGC TTGATGCCGA CGAACTGGTT ATTCCGATCC AGGCTTATGC TTATCGCATG AATACCCGCA TGAGTCAGGT AGCCGAAACA TTTTGGCGCG ACCTGCGTGG GCTTCAGGCC GCGCTGTAA
|
Protein sequence | MDVTGAGLHN IETKWLYDFL TLEKCRNFSQ AAIIRNVSQP AFSRRIRALE HAVGVELFNR QVSPLQLSEQ GKIFHSQVRH LLQQLESNLT ELRGGSDYTL RKIKIAAAHS LSLGLLPTIV KQMPTQFTYA VEAIDVDQAV DMLREGQSDF IFSYHDENLQ QAPFDNIRLF ESRLFPVCAN NGRGEPRYTL EQPHFPLLNY SQNSYMGRLI NRTLTRHAEL SFSTFFVSSM SELLKQVAMD GCGIAWLPEY AIRQEITDGR LIVLDADELV IPIQAYAYRM NTRMSQVAET FWRDLRGLQA AL
|
| |