Gene SeAg_B2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2998 
Symbol 
ID6796813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp2928382 
End bp2930043 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content43% 
IMG OID642777165 
Productinvasion protein regulator 
Protein accessionYP_002147774 
Protein GI197250952 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000362876 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACATT TTAATCCTGT TCCTGTATCG AATAAAAAAT TCGTCTTTGA TGATTTCATA 
CTCAACATGG ACGGCTCCCT GCTACGCTCA GAAAAGAAAG TCAATATTCC GCCAAAAGAA
TATGCTGTTC TGGTCATCCT GCTCGAAGCC GCCGGCGAGA TTGTGAGTAA AAACACCTTA
CTGGACCAGG TATGGGGCGA CGCGGAAGTT AACGAAGAAT CTCTTACCCG CTGTATTTAT
GCCTTACGAC GTATTCTGTC GGAAGATAAA GAGCATCGTT ACATTGAAAC ACTGTACGGA
CAGGGCTATC GGTTTAATCG TCCGGTCGTA GTGGTGTCTC CGCCAGCGCC GCAACCTACG
ACTCATACAT TGGCGATACT TCCTTTTCAG ATGCAGGATC AGGTTCAATC CGAGAGTTTG
CATTACTCTA TCGTGAAGGG ATTATCGCAG TATGCGCCCT TTGGCCTGAG CGTGCTGCCG
GTGACCATTA CGAAGAACTG CCGCAGTGTT AAGGATATAC TTGAGCTCAT GGATCAATTA
CGCCCCGATT ATTATATCTC CGGGCAGATG ATACCTGATG GTAATGATAA TATTGTACAG
ATCGAGATAG TTCGGGTTAA AGGTTATCAC CTGCTGCACC AGGAAAGCAT TAAGTTGATA
GAACACCAAC CCGCTTCTCT CTTGCAAAAC AAAATTGCGA ATCTTTTGCT CAGATGTATT
CCCGGGCTTC GCTGGGACAC AAAGCAGATT AGCGAGCTAA ATTCGATTGA CAGCACTATG
GTTTACTTAC GCGGTAAGCA TGAGTTAAAT CAATACACCC CCTATAGCTT ACAGCAAGCG
CTTAAATTGC TGACTCAATG CGTCAACATG TCGCCAAACA GCATTGCGCC TTACTGTGCG
CTGGCAGAAT GCTACCTCAG CATGGCGCAA ATGGGGATTT TTGATAAACA AAACGCTATG
ATCAAAGCTA AAGAACATGC GATTAAGGCG ACAGAGCTGG ACCACAATAA TCCACAAGCT
TTAGGATTAC TGGGGCTAAT TAATACGATT CATTCAGAAT ACATCGTCGG GAGTTTGCTA
TTCAAACAAG CTAACTTACT TTCGCCCATT TCTGCAGATA TTAAATATTA TTATGGCTGG
AATCTCTTCA TGGCTGGTCA GTTGGAGGAG GCCTTACAAA CGATTAACGA GTGTTTAAAA
TTGGACCCAA CGCGCGCAGC CGCAGGGATC ACTAAGCTGT GGATTACCTA TTATCATACC
GGTATTGATG ATGCTATACG TTTAGGCGAT GAATTACGCT CACAACACTT GCAGGATAAT
CCAATATTAT TAAGTATGCA GGTTATGTTC CTTTCGCTTA AAGGTAAACA TGAACTGGCA
CGAAAATTAA CTAAAGAAAT ATCCACGCAG GAAATAACAG GGCTTATTGC TGTTAATCTT
CTTTATGCTG AATACTGTCA GAATAGTGAG CGTGCCTTAC CGACGATAAG AGAGTTTCTG
GAAAGCGAAC AGCGTATTGA TAATAATCCG GGATTATTAC CGTTAGTGCT GGTTGCCCAC
GGCGAAGCTA TTGCCGAGAA AATGTGGAAT AAATTTAAAA ACGAAGACAA TATTTGGTTC
AAAAGATGGA AACAGGATCC CCGCTTGATT AAATTACGGT AA
 
Protein sequence
MPHFNPVPVS NKKFVFDDFI LNMDGSLLRS EKKVNIPPKE YAVLVILLEA AGEIVSKNTL 
LDQVWGDAEV NEESLTRCIY ALRRILSEDK EHRYIETLYG QGYRFNRPVV VVSPPAPQPT
THTLAILPFQ MQDQVQSESL HYSIVKGLSQ YAPFGLSVLP VTITKNCRSV KDILELMDQL
RPDYYISGQM IPDGNDNIVQ IEIVRVKGYH LLHQESIKLI EHQPASLLQN KIANLLLRCI
PGLRWDTKQI SELNSIDSTM VYLRGKHELN QYTPYSLQQA LKLLTQCVNM SPNSIAPYCA
LAECYLSMAQ MGIFDKQNAM IKAKEHAIKA TELDHNNPQA LGLLGLINTI HSEYIVGSLL
FKQANLLSPI SADIKYYYGW NLFMAGQLEE ALQTINECLK LDPTRAAAGI TKLWITYYHT
GIDDAIRLGD ELRSQHLQDN PILLSMQVMF LSLKGKHELA RKLTKEISTQ EITGLIAVNL
LYAEYCQNSE RALPTIREFL ESEQRIDNNP GLLPLVLVAH GEAIAEKMWN KFKNEDNIWF
KRWKQDPRLI KLR