Gene SNSL254_A2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2647 
Symbol 
ID6483633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2562112 
End bp2563164 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content55% 
IMG OID642737980 
Producttranscriptional regulator EutR 
Protein accessionYP_002041714 
Protein GI194445351 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA CCCGTACAGC GAATTTGCAC CATCTTTATC ATGAAGCGTT ACCCGAAGAC 
GTTAAACTTA CGCCCAGAGT CGAGGTGGAC AATGTTCATC AGCGACGGAC GACGGATGTC
TATGAACATG CTCTGACTAT CACCGCCTGG CAGCAGATCT ACGATCAGTT ACACCCGGGT
AAATTTCATG GCGAGTTCAC GGAAATCCTG CTTGATGAGA TTCAGGTGTT CCGTGAATAC
ACCGGCCTGG CGCTGCGTCA GTCGTGTCTG GTTTGGCCAA ACTCTTTCTG GTTTGGTATT
CCGGCGACGC GCGGCGAACA GGGATTTATC GGCGCGCAGG GGCTCGGCAG CGCCGAGATT
GCCACCCGAC CGGGAGGCAC CGAGTTTGAA CTGAGCACGC CGGATGATTA CACCATTCTG
GGCGTCGTTA TCTCGGAAGA TGTTATTTCC CGTCAGGCCA CGTTTTTGCA TAACCCGGAA
AGGGTGCTGC ATATGCTGCG TAACCAGCTA GCGCTGGAGG TAAAAGAGCA GCATAAAGCA
GCGCTGTGGG GCTTTGTGCA GCAGGCGCTG GCCACCTTCA GCGAGAGTCC TGAAACCCTG
CATCAACCTG CGGTGCGTAA GGTCTTGAGT GATAATTTGT TGCTGGCGAT GGGCACGATG
CTGGAAGAGG CGAAGCCGAT TCATAGCGCC GAGAGCATCA GCCATCAGGG ATATCGTAGG
CTACTGTCGC GGGCGCGGGA ATATGTGCTG GAAAATATGT CGGAGCCGCT GACGGTGCTC
GACTTATGTA ACCAGCTACA CGTAAGCCGT CGCACGCTGC AAAATGCGTT TCACGCCATT
TTGGGCATTG GCCCCAATGC GTGGCTGAAA CGTATTCGCT TAAACGCAGT ACGCCGGGAA
CTGATTAGCC CGTGGTCGCA GAGCACCACG GTCAAAGATG CCGCGATGCA GTGGGGATTC
TGGCATTTGG GGCAATTTGC CACCGATTAT CAGCAATTAT TTGCCGAAAA ACCGTCGTTG
ACGCTGCATC AGCGGATGCG GCAATGGGCT TGA
 
Protein sequence
MKKTRTANLH HLYHEALPED VKLTPRVEVD NVHQRRTTDV YEHALTITAW QQIYDQLHPG 
KFHGEFTEIL LDEIQVFREY TGLALRQSCL VWPNSFWFGI PATRGEQGFI GAQGLGSAEI
ATRPGGTEFE LSTPDDYTIL GVVISEDVIS RQATFLHNPE RVLHMLRNQL ALEVKEQHKA
ALWGFVQQAL ATFSESPETL HQPAVRKVLS DNLLLAMGTM LEEAKPIHSA ESISHQGYRR
LLSRAREYVL ENMSEPLTVL DLCNQLHVSR RTLQNAFHAI LGIGPNAWLK RIRLNAVRRE
LISPWSQSTT VKDAAMQWGF WHLGQFATDY QQLFAEKPSL TLHQRMRQWA