Gene SNSL254_A3081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3081 
Symbol 
ID6482526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2996360 
End bp2998021 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content43% 
IMG OID642738395 
Productinvasion protein regulator 
Protein accessionYP_002042119 
Protein GI194443739 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0000362191 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCACATT TTAATCCTGT TCCTGTATCG AATAAAAAAT TCGTCTTTGA TGATTTCATA 
CTCAACATGG ACGGCTCCCT GCTACGCTCA GAAAAGAAAG TCAATATTCC GCCAAAAGAA
TATGCCGTTC TGGTCATCCT GCTCGAAGCC GCCGGCGAGA TTGTGAGTAA AAACACCTTA
CTGGACCAGG TATGGGGCGA CGCGGAAGTT AACGAAGAAT CTCTTACCCG CTGTATTTAT
GCCTTACGAC GTATTCTGTC GGAAGATAAA GAGCATCGTT ACATTGAAAC ACTGTACGGA
CAGGGCTATC GGTTTAATCG TCCGGTCGTA GTGGTGTCTC CGCCAGCGCC GCAACCTACG
ACTCATACAT TGGCGATACT TCCTTTTCAG ATGCAGGATC AGGTTCAATC CGAGAGTCTG
CATTACTCTA TCGTGAAGGG ATTATCGCAG TATGCGCCCT TTGGCCTGAG CGTGCTGCCG
GTGACCATTA CGAAGAACTG CCGCAGTGTT AAGGATATTC TTGAGCTCAT GGATCAATTA
CGCCCCGATT ATTATATCTC CGGGCAGATG ATACCCGATG GCAATGATAA TATTGTACAG
ATCGAGATAG TTCGGGTTAA AGGTTATCAC CTGCTGCACC AGGAAAGCAT TAAGTTGATA
GAACACCAAC CCGCTTCTCT CTTGCAAAAC AAAATTGCGA ATCTTTTGCT CAGATGTATT
CCCGGACTTC GCTGGGACAC AAAGCAGATT AGCGAGCTAA ATTCGATTGA CAGTACTATG
GTTTACTTAC GCGGTAAGCA TGAGTTAAAT CAATATACCC CCTATAGCTT ACAGCAAGCG
CTTAAATTGC TGACTCAATG CGTTAACATG TCGCCAAACA GCATTGCGCC TTACTGTGCG
CTGGCAGAAT GCTACCTCAG CATGGCGCAA ATGGGGATTT TTGATAAACA AAACGCCATG
ATCAAAGCTA AAGAACATGC GATTAAGGCG ACAGAGCTGG ACCACAATAA TCCACAAGCT
TTAGGATTAC TGGGACTAAT TAATACGATT CATTCAGAAT ACATCGTCGG GAGTTTGCTA
TTCAAACAAG CTAACTTACT TTCGCCCATT TCTGCAGATA TTAAATATTA TTATGGCTGG
AATCTCTTCA TGGCTGGTCA GTTGGAGGAG GCCTTACAAA CGATTAACGA GTGTTTAAAA
TTGGACCCAA CGCGCGCAGC CGCAGGGATC ACTAAGCTGT GGATTACCTA TTATCATACC
GGTATTGATG ATGCTATACG TTTAGGCGAT GAATTACGCT CACAACACCT GCAGGATAAT
CCAATATTAT TAAGTATGCA GGTTATGTTT CTTTCGCTTA AAGGTAAACA TGAACTGGCA
CGAAAATTAA CTAAAGAAAT ATCCACGCAG GAAATAACAG GACTTATTGC TGTTAATCTT
CTTTACGCTG AATATTGTCA GAATAGTGAG CGTGCCTTAC CGACGATAAG AGAATTTCTG
GAAAGTGAAC AGCGTATAGA TAATAATCCG GGATTATTAC CGTTAGTGCT GGTTGCCCAC
GGCGAAGCTA TTGCCGAGAA AATGTGGAAT AAATTTAAAA ACGAAGACAA TATTTGGTTC
AAAAGATGGA AACAGGATCC CCGCTTGATT AAATTACGGT AA
 
Protein sequence
MPHFNPVPVS NKKFVFDDFI LNMDGSLLRS EKKVNIPPKE YAVLVILLEA AGEIVSKNTL 
LDQVWGDAEV NEESLTRCIY ALRRILSEDK EHRYIETLYG QGYRFNRPVV VVSPPAPQPT
THTLAILPFQ MQDQVQSESL HYSIVKGLSQ YAPFGLSVLP VTITKNCRSV KDILELMDQL
RPDYYISGQM IPDGNDNIVQ IEIVRVKGYH LLHQESIKLI EHQPASLLQN KIANLLLRCI
PGLRWDTKQI SELNSIDSTM VYLRGKHELN QYTPYSLQQA LKLLTQCVNM SPNSIAPYCA
LAECYLSMAQ MGIFDKQNAM IKAKEHAIKA TELDHNNPQA LGLLGLINTI HSEYIVGSLL
FKQANLLSPI SADIKYYYGW NLFMAGQLEE ALQTINECLK LDPTRAAAGI TKLWITYYHT
GIDDAIRLGD ELRSQHLQDN PILLSMQVMF LSLKGKHELA RKLTKEISTQ EITGLIAVNL
LYAEYCQNSE RALPTIREFL ESEQRIDNNP GLLPLVLVAH GEAIAEKMWN KFKNEDNIWF
KRWKQDPRLI KLR