Gene SeD_A3186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3186 
Symbol 
ID6874825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3064247 
End bp3065914 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content43% 
IMG OID642786206 
Productinvasion protein regulator 
Protein accessionYP_002216847 
Protein GI198245842 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.735415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACATT TTAATCCTGT TCCTGTTCCT GTATCGAATA AAAAATTCGT CTTTGATGAT 
TTCATACTCA ACATGGACGG CTCCCTGCTA CGCTCAGAAA AGAAAGTCAA TATTCCGCCA
AAAGAATATG CCGTTCTGGT CATCCTGCTC GAAGCCGCCG GCGAGATTGT GAGTAAAAAC
ACCTTACTGG ACCAGGTATG GGGCGACGCG GAAGTTAACG AAGAATCTCT TACCCGCTGT
ATTTATGCCT TACGACGTAT TCTGTCGGAA GATAAAGAGC ATCGTTACAT TGAAACACTG
TACGGACAGG GCTATCGGTT TAATCGTCCG GTCGTAGTGG TGTCTCCGCC AGCGCCGCAA
CCTACGACTC ATACATTGGC GATACTTCCT TTTCAGATGC AGGATCAGGT TCAATCCGAG
AGTCTGCATT ACTCTATCGT GAAGGGATTA TCGCAGTATG CGCCCTTTGG CCTGAGCGTG
CTGCCGGTGA CCATTACGAA GAACTGCCGC AGTGTTAAGG ATATTCTTGA GCTCATGGAT
CAATTACGCC CCGATTATTA TATCTCCGGG CAGATGATAC CCGATGGTAA TGATAATATT
GTACAGATTG AGATAGTTCG GGTTAAAGGT TATCACCTGC TGCACCAGGA AAGCATTAAG
TTGATAGAAC ACCAACCCGC TTCTCTCTTG CAAAACAAAA TTGCGAATCT TTTGCTCAGA
TGTATTCCCG GACTTCGCTG GGACACAAAG CAGATTAGCG AGCTAAATTC GATTGACAGT
ACTATGGTTT ACTTACGCGG TAAGCATGAG TTAAATCAAT ACACCCCCTA TAGCTTACAG
CAAGCGCTTA AATTGCTGAC TCAATGCGTT AACATGTCGC CAAACAGCAT TGCGCCTTAC
TGTGCGCTGG CAGAATGCTA CCTCAGCATG GCGCAAATGG GGATTTTTGA TAAACAAAAC
GCTATGATCA AAGCTAAAGA ACATGCGATT AAGGCGACAG AGCTGGACCA CAATAATCCA
CAAGCTTTAG GATTACTGGG GCTAATTAAT ACGATTCACT CAGAATACAT CGTCGGGAGT
TTGCTATTCA AACAAGCTAA CTTACTTTCG CCCATTTCTG CAGATATTAA ATATTATTAT
GGCTGGAATC TTTTCATGGC TGGTCAGTTG GAGGAGGCCT TACAAACGAT TAACGAGTGT
TTAAAATTGG ACCCAACGCG CGCAGCCGCA GGGATCACTA AGCTGTGGAT TACCTATTAT
CATACCGGTA TTGATGATGC TATACGTTTA GGCGATGAAT TACGCTCACA ACACCTGCAG
GATAATCCAA TATTATTAAG TATGCAGGTT ATGTTTCTTT CGCTTAAAGG TAAACATGAA
CTGGCACGAA AATTAACTAA AGAAATATCC ACGCAGGAAA TAACAGGACT TATTGCTGTT
AATCTTCTTT ACGCTGAATA TTGTCAGAAT AGTGAGCGTG CCTTACCGAC GATAAGAGAA
TTTCTGGAAA GTGAACAGCG TATAGATAAT AATCCGGGAT TATTACCGTT AGTGCTGGTT
GCCCACGGCG AAGCTATTGC CGAGAAAATG TGGAATAAAT TTAAAAACGA AGACAATATT
TGGTTCAAAA GATGGAAACA GGATCCCCGC TTGATTAAAT TACGGTAA
 
Protein sequence
MPHFNPVPVP VSNKKFVFDD FILNMDGSLL RSEKKVNIPP KEYAVLVILL EAAGEIVSKN 
TLLDQVWGDA EVNEESLTRC IYALRRILSE DKEHRYIETL YGQGYRFNRP VVVVSPPAPQ
PTTHTLAILP FQMQDQVQSE SLHYSIVKGL SQYAPFGLSV LPVTITKNCR SVKDILELMD
QLRPDYYISG QMIPDGNDNI VQIEIVRVKG YHLLHQESIK LIEHQPASLL QNKIANLLLR
CIPGLRWDTK QISELNSIDS TMVYLRGKHE LNQYTPYSLQ QALKLLTQCV NMSPNSIAPY
CALAECYLSM AQMGIFDKQN AMIKAKEHAI KATELDHNNP QALGLLGLIN TIHSEYIVGS
LLFKQANLLS PISADIKYYY GWNLFMAGQL EEALQTINEC LKLDPTRAAA GITKLWITYY
HTGIDDAIRL GDELRSQHLQ DNPILLSMQV MFLSLKGKHE LARKLTKEIS TQEITGLIAV
NLLYAEYCQN SERALPTIRE FLESEQRIDN NPGLLPLVLV AHGEAIAEKM WNKFKNEDNI
WFKRWKQDPR LIKLR