Gene SNSL254_A3243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3243 
Symbol 
ID6482280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3156001 
End bp3157023 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content58% 
IMG OID642738541 
ProductHTH-type transcriptional regulator AscG 
Protein accessionYP_002042263 
Protein GI194445314 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0000054765 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGGCGA CAATGCTGGA TGTTTCCCGC CATGCGGGCG TATCAAAGGC CACCGTCTCA 
CGAGTGCTGA ATGGGACGGG GCAGGTAAAA GAAAGTACGC GCCAGAAAGT GTTTACGGCG
ATGCAGGCTC TGGGCTATCG CCCCAACCTG CTGGCACGCT CGCTGGCGAA TCGCACCAGC
AACAGCATCG GTCTGGTCGT CTCTACGTTT GACGGCTTCT ATTTTGGCAG TTTGTTGCGC
CGGGCGTCGC GCCAGGCGGA GTCTCATAAC AAGCAGTTGA TCGTCACCGA TGGTCACGAT
ACGCCGGAAC GAGAGCAGAA AGCCGTACAA ATGTTGGCCG ACAGACAGTG CGACGCTATT
ATTCTTTACA CTCGCTATAT GGATGAGCCG GCGATTTTGT CGTTGATTGA CGCCACGGAA
ATGCCGCTGG TGATTATTAA TCGCAACGTC ACTCAGGCCC GCGATCGCGC TATTTTCTTC
GAGCAGGAGA CGGCGGCATT CCAGGCGGTG GAATACCTGA TTACGCAGGG CCATCGCGAT
ATCGCCTGTA TTACGCTGCC TGTTCATACT CCCACCGGCA CATCACGCGT AGCGGGTTAT
CGAAAGGCGC TGGAAAAATA TGGCATTCCC TGGCAACCGG CAAAAGTGAA ATACGGCGAT
TACACGCTGA CGCGCGGCTA TGACGCCTGC CGGGAATTAC TGGAGGAAGG CGTCACGTTT
AGCGCGCTAT TCGCCTGTAA TGATGACACG GCGCTGGGCG CGGCAAAAGC GCTGCGCCAG
GCCGGATTAC GCATCCCGCA GGATGTGTCG CTGTTTGGTT TTGACGATGC GCCGGGCGCA
ACCTGGCTTG AACCGGGGCT TTCAACAGTC TATTTACCCA TCGAGGATAT GATAGCCACC
GCGATCGATC AGGCCGTTCG TCTGGCGAAC AGCGAGCCGG TCGCCCCGAT CCCGCCCTTT
ACCGGCACGC TGATTCTGCG CGAGTCCGTC GCCGCGGGCC CGTTTTTTCA ACGTCCGGCC
TAA
 
Protein sequence
MMATMLDVSR HAGVSKATVS RVLNGTGQVK ESTRQKVFTA MQALGYRPNL LARSLANRTS 
NSIGLVVSTF DGFYFGSLLR RASRQAESHN KQLIVTDGHD TPEREQKAVQ MLADRQCDAI
ILYTRYMDEP AILSLIDATE MPLVIINRNV TQARDRAIFF EQETAAFQAV EYLITQGHRD
IACITLPVHT PTGTSRVAGY RKALEKYGIP WQPAKVKYGD YTLTRGYDAC RELLEEGVTF
SALFACNDDT ALGAAKALRQ AGLRIPQDVS LFGFDDAPGA TWLEPGLSTV YLPIEDMIAT
AIDQAVRLAN SEPVAPIPPF TGTLILRESV AAGPFFQRPA