Gene SNSL254_A0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0403 
Symbol 
ID6486010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp415972 
End bp416976 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content59% 
IMG OID642735827 
ProductAraC family transcriptional regulator 
Protein accessionYP_002039601 
Protein GI194445438 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00000000000043058 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTCCGA TCTCCTGTCA CTCTTCCGCC GCCCCGGCGA TGAAAAAGAT CTTTTCCGTC 
AGCGACTTCA TCGCGTTTGG CGAGCGTTAT GGCATTGATT ACCGCTTCCC TGCGTTACCG
CAGTATACGC AGAGTAGTCC CGTACTTCAT GGCGATATCG AAGAGATAGC GCTTCCCGGC
GGGATTTGCA TTACACGCTC GGATGTTCAC GTGTTACAAC CTTATGAAAC CACCTCTCGC
CATAGCAGTC CGCTGTATAT GCTGGTGGTG CTGGAAGGTA ACGTCGCGCT GGCTGTCAAT
GAGCAGACCT TTTTGTTGAG CGCGGGGATG GCGTTTTGCT CGCAACTGAG TGAGCAGCAG
ACGATACGCG CCCATCACGG CGCAGACAGT AAATTGCGCA CCTTGTCGCT GGGAATGTAC
CCGGACGGCG GATGGCGGGA GCGTTTGCCT GTCTCGCTGG CAGACGAGTG GGAACATCGC
GCGGCCTCGG CGAGGGTCTG GCAGGTGCCG GAGTTTCTGC TTTCGGGGCT ACGTTATGCG
CAGCAGCCCG GACCTCATGC GGCGTCACGC CAGTTAATGC TGGAAGGCAT CATGCTGCAA
TTGCTGGGCT ATGCGCTAAA TCTATGTCAG CCCGCAACGC AAAAACGCGG GCTTCCCGTC
ACCGGTGAAT ACCAGCGGCT GGAGCTCATT CGGCGTTTAC TGGAGCAGAC GCCGGAAAAA
GCCTACACGC TGAACGAACT GGCGCGTCGG GCGGCAATGA GTCCAAGTAG CCTGCGGTGC
AAGTTTCGCC ATGCCTATGG GTGTACCGTG TTTGATTATC TGCGCGATTG CCGCCTGGCG
CGCGCGCGTC GTTATCTGAT GGAGGGATAC AGCGTGCAGC AGGCCGCCTG GATGTCAGGC
TATCAACATG CCACTAACTT TGCGACGGCA TTTCGTCGGC GTTATGGCTG CTCGCCCGGC
GAGCTGCGTG ACGCGTCTCT GACGGCGTCC CGCCACTGTG CGTAA
 
Protein sequence
MSPISCHSSA APAMKKIFSV SDFIAFGERY GIDYRFPALP QYTQSSPVLH GDIEEIALPG 
GICITRSDVH VLQPYETTSR HSSPLYMLVV LEGNVALAVN EQTFLLSAGM AFCSQLSEQQ
TIRAHHGADS KLRTLSLGMY PDGGWRERLP VSLADEWEHR AASARVWQVP EFLLSGLRYA
QQPGPHAASR QLMLEGIMLQ LLGYALNLCQ PATQKRGLPV TGEYQRLELI RRLLEQTPEK
AYTLNELARR AAMSPSSLRC KFRHAYGCTV FDYLRDCRLA RARRYLMEGY SVQQAAWMSG
YQHATNFATA FRRRYGCSPG ELRDASLTAS RHCA