Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3072 |
Symbol | |
ID | 6484804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2988611 |
End bp | 2989498 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642738386 |
Product | AraC family transcriptional regulator |
Protein accession | YP_002042110 |
Protein GI | 194446606 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.879002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.00256157 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTATTGC CTTCAATGAA TAAATCAGTT GAGGCCATTA GCAATAATCA CCTTCAACAG CCGAACAAAT TTCCATTAAT AAATGGATTA GCTGACGTAA GAGACTATTA TGTCGCAAAC TGCTTATTGT TTAAACTTAA TAAAGGCAGT TTGCGAATTG AAAACGAATT TGGGGAGTTC ATCGAACAAT CTGCGCCGTG TTTATTTTTA TTGGAAAAAG ATCAAACAAT CACGCTTAGT ATGAGCGAAA TAGAAGGGCA TATTGATTTT TCTTCACTGG AAGTTTCCTA TGACTTAATG CAAAAATTCT ACAAAGTTTT TTACAGTACA AGAAACTATA ATGATCGAGA GCTATCATTA AAAACGAAAC CAAAGTATTT TTTTCATGCG GACTTGTTGC CAGGGATGAG TGATACTTTT GACTCTATTT TGCATGGTGT GGCATGTCCA CGGGTTTGTA GTAATGTGAG TATTGATGAT CATGATTATT CATATTTCTC ATTGATGTAT CTTATATCGG CATTTGTACG TAAGCCCGGT GGGTTTGATT TCCTTGAGCG AGCAATAAAA ATTACGACAA AAGAGAAAGT TTATAACATT ATTATCAGCG ATCTCACCCG CAAATGGTCA CAGGCTGAGG TGGCAGGAAA GCTGTTTATG AGCGTATCAA GTCTGAAGCG AAAACTGGCC GCTGAAGAGG TGAGTTTTAG CAAAATATAC CTGGATGCTC GTATGAATCA GGCTATAAAA TTATTACGTA TGGGGGCTGG AAATATTTCA CAAGTCGCGA CGATGTGTGG CTATGATACG CCTTCTTATT TTATCGCTAT TTTTAAACGG CATTTTAAGA TTACACCGCT TAGCTTTATG CGTACAATGA ACCATTGA
|
Protein sequence | MVLPSMNKSV EAISNNHLQQ PNKFPLINGL ADVRDYYVAN CLLFKLNKGS LRIENEFGEF IEQSAPCLFL LEKDQTITLS MSEIEGHIDF SSLEVSYDLM QKFYKVFYST RNYNDRELSL KTKPKYFFHA DLLPGMSDTF DSILHGVACP RVCSNVSIDD HDYSYFSLMY LISAFVRKPG GFDFLERAIK ITTKEKVYNI IISDLTRKWS QAEVAGKLFM SVSSLKRKLA AEEVSFSKIY LDARMNQAIK LLRMGAGNIS QVATMCGYDT PSYFIAIFKR HFKITPLSFM RTMNH
|
| |