Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0403 |
Symbol | |
ID | 6486010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 415972 |
End bp | 416976 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642735827 |
Product | AraC family transcriptional regulator |
Protein accession | YP_002039601 |
Protein GI | 194445438 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00000000000043058 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTCCGA TCTCCTGTCA CTCTTCCGCC GCCCCGGCGA TGAAAAAGAT CTTTTCCGTC AGCGACTTCA TCGCGTTTGG CGAGCGTTAT GGCATTGATT ACCGCTTCCC TGCGTTACCG CAGTATACGC AGAGTAGTCC CGTACTTCAT GGCGATATCG AAGAGATAGC GCTTCCCGGC GGGATTTGCA TTACACGCTC GGATGTTCAC GTGTTACAAC CTTATGAAAC CACCTCTCGC CATAGCAGTC CGCTGTATAT GCTGGTGGTG CTGGAAGGTA ACGTCGCGCT GGCTGTCAAT GAGCAGACCT TTTTGTTGAG CGCGGGGATG GCGTTTTGCT CGCAACTGAG TGAGCAGCAG ACGATACGCG CCCATCACGG CGCAGACAGT AAATTGCGCA CCTTGTCGCT GGGAATGTAC CCGGACGGCG GATGGCGGGA GCGTTTGCCT GTCTCGCTGG CAGACGAGTG GGAACATCGC GCGGCCTCGG CGAGGGTCTG GCAGGTGCCG GAGTTTCTGC TTTCGGGGCT ACGTTATGCG CAGCAGCCCG GACCTCATGC GGCGTCACGC CAGTTAATGC TGGAAGGCAT CATGCTGCAA TTGCTGGGCT ATGCGCTAAA TCTATGTCAG CCCGCAACGC AAAAACGCGG GCTTCCCGTC ACCGGTGAAT ACCAGCGGCT GGAGCTCATT CGGCGTTTAC TGGAGCAGAC GCCGGAAAAA GCCTACACGC TGAACGAACT GGCGCGTCGG GCGGCAATGA GTCCAAGTAG CCTGCGGTGC AAGTTTCGCC ATGCCTATGG GTGTACCGTG TTTGATTATC TGCGCGATTG CCGCCTGGCG CGCGCGCGTC GTTATCTGAT GGAGGGATAC AGCGTGCAGC AGGCCGCCTG GATGTCAGGC TATCAACATG CCACTAACTT TGCGACGGCA TTTCGTCGGC GTTATGGCTG CTCGCCCGGC GAGCTGCGTG ACGCGTCTCT GACGGCGTCC CGCCACTGTG CGTAA
|
Protein sequence | MSPISCHSSA APAMKKIFSV SDFIAFGERY GIDYRFPALP QYTQSSPVLH GDIEEIALPG GICITRSDVH VLQPYETTSR HSSPLYMLVV LEGNVALAVN EQTFLLSAGM AFCSQLSEQQ TIRAHHGADS KLRTLSLGMY PDGGWRERLP VSLADEWEHR AASARVWQVP EFLLSGLRYA QQPGPHAASR QLMLEGIMLQ LLGYALNLCQ PATQKRGLPV TGEYQRLELI RRLLEQTPEK AYTLNELARR AAMSPSSLRC KFRHAYGCTV FDYLRDCRLA RARRYLMEGY SVQQAAWMSG YQHATNFATA FRRRYGCSPG ELRDASLTAS RHCA
|
| |