Gene SeSA_A3029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3029 
Symbol 
ID6517430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2927061 
End bp2928722 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content43% 
IMG OID642748052 
Productinvasion protein regulator 
Protein accessionYP_002115829 
Protein GI194736966 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0229765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACATT TTAATCCTGT TCCTGTATCG AATAAAAAAT TCGTCTTTGA TGATTTCATA 
CTCAACATGG ACGGCTCCCT GCTACGCTCA GAAAAGAAAG TCAATATTCC GCCAAAAGAA
TATGCCGTTC TGGTCATCCT GCTCGAAGCC GCCGGCGAGA TTGTGAGTAA AAACACCTTA
CTGGACCAGG TATGGGGCGA CGCGGAAGTT AACGAAGAAT CTCTTACCCG CTGTATTTAT
GCCTTACGAC GTATTCTGTC GGAAGATAAA GAGCATCGTT ACATTGAAAC ACTGTACGGA
CAGGGCTATC GGTTTAATCG TCCGGTCGTA GTGGTGTCTC CGCCAGCGCC GCAACCTACG
ACTCATACAT TGGCGATACT TCCTTTTCAG ATGCAGGATC AGGTTCAATC CGAGAGTCTG
CATTACTCTA TCGTGAAGGG ATTATCGCAG TATGCGCCCT TTGGCCTGAG CGTGCTACCG
GTGACCATTA CGAAGAACTG TCGCAGTGTT AAAGATATTC TTGAGCTCAT GGATCAATTA
CGCCCCGATT ATTATATCTC CGGACAGATG ATACCCGATG GCAATGATAA TATTGTACAG
ATCGAGATAG TTCGGGTTAA AGGTTATCAC CTGCTGCACC AGGAAAGCAT TAAGTTGATA
GAACACCAAC CCGCTTCTCT CTTGCAAAAC AAAATTGCTA ATCTTTTGCT CAGATGTATT
CCCGGGCTTC GCTGGGACAC AAAGCAGGTT AGCGAGCTAA ATTCGATTGA CAGTACTATG
GTTTACTTAC GCGGTAAGCA TGAGTTAAAT CAATATACCC CCTATAGCTT ACAGCAAGCG
CTTAAATTGC TGACTCAATG CGTTAACATG TCGCCAAACA GCATTGCGCC TTACTGTGCG
CTGGCAGAAT GCTACCTCAG CATGGCGCAA ATGGGGATTT TTGATAAACA AAACGCCATG
ATCAAAGCTA AAGAACATGC GATTAAGGCG ACAGAGCTGG ACCACAATAA TCCACAAGCT
TTAGGATTAC TGGGGCTAAT TAATACGATT CATTCAGAAT ACATCGTCGG GAGTTTGCTA
TTCAAACAAG CTAACTTACT TTCGCCCATC TCTGCAGATA TTAAATATTA TTATGGCTGG
AATCTCTTCA TGGCTGGTCA GTTGGAGGAG GCCTTACAAA CGATTAACGA GTGTTTAAAA
TTGGATCCAA CGCGCGCAGC CGCAGGGATC ACTAAGCTGT GGATTACCTA TTATCATACC
GGTATTGATG ATGCTATACG TTTAGGCGAT GAATTACGCT CACAACACCT GCAGGATAAT
CCAATATTAT TAAGTATGCA GGTTATGTTT CTTTCTCTTA AAGGTAAACA TGAACTGGCA
CGAAAATTAA CTAAAGAAAT ATCCACGCAG GAAATAACAG GACTTATTGC TGTTAATCTT
CTTTACGCTG AATATTGTCA GAATAGTGAG CGTGCCTTAC CGACGATAAG AGAATTTCTG
GAAAGTGAAC AGCGTATAGA TAATAATCCG GGATTATTAC CGTTAGTGCT GGTTGCCCAC
GGCGAAGCTA TTGCCGAGAA AATGTGGAAT AAATTTAAAA ACGAAGACAA TATTTGGTTC
AAAAGATGGA AACAGGATCC CCGCTTGATT AAATTACGGT AA
 
Protein sequence
MPHFNPVPVS NKKFVFDDFI LNMDGSLLRS EKKVNIPPKE YAVLVILLEA AGEIVSKNTL 
LDQVWGDAEV NEESLTRCIY ALRRILSEDK EHRYIETLYG QGYRFNRPVV VVSPPAPQPT
THTLAILPFQ MQDQVQSESL HYSIVKGLSQ YAPFGLSVLP VTITKNCRSV KDILELMDQL
RPDYYISGQM IPDGNDNIVQ IEIVRVKGYH LLHQESIKLI EHQPASLLQN KIANLLLRCI
PGLRWDTKQV SELNSIDSTM VYLRGKHELN QYTPYSLQQA LKLLTQCVNM SPNSIAPYCA
LAECYLSMAQ MGIFDKQNAM IKAKEHAIKA TELDHNNPQA LGLLGLINTI HSEYIVGSLL
FKQANLLSPI SADIKYYYGW NLFMAGQLEE ALQTINECLK LDPTRAAAGI TKLWITYYHT
GIDDAIRLGD ELRSQHLQDN PILLSMQVMF LSLKGKHELA RKLTKEISTQ EITGLIAVNL
LYAEYCQNSE RALPTIREFL ESEQRIDNNP GLLPLVLVAH GEAIAEKMWN KFKNEDNIWF
KRWKQDPRLI KLR