Gene SNSL254_A0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0033 
Symbol 
ID6483241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp33357 
End bp34361 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content39% 
IMG OID642735478 
Productputative transcriptional regulator 
Protein accessionYP_002039260 
Protein GI194446304 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.792057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.102163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCAA TAAAAAATGC TAAAAAAATT GACTACAATC TGATCAAAGT GTTCGATACG 
GTTATTACTG AAGGAAATGC AACCAGGGCG GCGAGGAAAC TGGATGTCAC GCCTGCGGCG
ATCTCTCAGG CTCTTCTTCG TTTACAAAAT CTTTATGGCG AAGAGTTATT TATCAGAACC
CGCAAAGGAT TAGTTCCGTC CAGCAAAGGT AAATCGCTTC ACCAGGTATT TCGCCAGGCA
ATTGAATCTA TAGAAAGCAC ACTGTGCGAT AAAACAGATG CTCAGGAGAG TAATGAACTC
ATTGTTCTGG GAAGTGATAT CACTGAAAAT TACTATTTTC CAGCATTGCT GGATACTGTG
TTGATGAATC GATATATTAT TAAACACTAT GCGATTAAAA AAACAGGGGA ATACTCACCA
GCCTCCATGC TGACGCATGG CTATGCGGAT GTTATCATGG GAATTCTGGA AATTAAGAAT
GAGATGATCG AAAGTTATCT TATTGATAAT TTATCTGATT TTGTTTGTGT TTGTGGTGAA
AAAAGTCCAT TGGTTGGGCT TGAAAAAATT TCTTTATATA ATTTTTATGC TGCCAGACAT
GCTGTCTATC ATTCAGATAT GTTCTCTTCT TTCACCGCTG ATAGCATTGA TTTATTCAAG
AGCAGTACGC CTTATGCGGG GCGCAGGGAA ATAGGTTATT ATAGTGATTC ACTATTTGGA
GTTATCGGTG TTGTTGAAAA AAGCGATATG GTTGCGATTT TGCCAGGAAA GATTGTTACT
TATTTTAGAG ATGTGCGGCG TTATAATATA AAAATACTAC GTATGCCTGA TGAAATGATT
TTTCGTACGT TACCCGTTTA TGCTTATCTG GCTACAAACA GCACCCATTA TAAAAATGTC
AAAAAACTGA TATCAACATT TCAGTCGACC TTTCTTTTTA GCCAGGAAAA GCAGCCTGAC
GCTTTGGTTG AAGGAAGCAC ATCCTTATGC GATTTGTCGG CTTAA
 
Protein sequence
MRPIKNAKKI DYNLIKVFDT VITEGNATRA ARKLDVTPAA ISQALLRLQN LYGEELFIRT 
RKGLVPSSKG KSLHQVFRQA IESIESTLCD KTDAQESNEL IVLGSDITEN YYFPALLDTV
LMNRYIIKHY AIKKTGEYSP ASMLTHGYAD VIMGILEIKN EMIESYLIDN LSDFVCVCGE
KSPLVGLEKI SLYNFYAARH AVYHSDMFSS FTADSIDLFK SSTPYAGRRE IGYYSDSLFG
VIGVVEKSDM VAILPGKIVT YFRDVRRYNI KILRMPDEMI FRTLPVYAYL ATNSTHYKNV
KKLISTFQST FLFSQEKQPD ALVEGSTSLC DLSA