Gene SeD_A1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1914 
Symbol 
ID6873766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1850125 
End bp1851057 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content57% 
IMG OID642785037 
Productputative DNA-binding transcriptional regulator 
Protein accessionYP_002215705 
Protein GI198243042 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000429246 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTCGG AATACTCGCT TGAAGTGGTT GACGCCGTAG CGCGCAATGG CAGTTTTAGC 
GCGGCGGCGC AGGAACTGCA CCGTGTGCCT TCAGCAGTCA GTTACACCGT TCGTCAGTTA
GAGGAGTGGC TGGCCGTACC GCTTTTTGTA CGACGTCACC GTGATGTTGA ACTGACGCCT
GCTGGCGTCT GGTTTTTAAA AGAAGGGCGT TCTGTTATCA AAAAAATGCA GATCACTCGC
CAACAGTGTC AGCAAATTGC TAATGGCTGG CGCGGGCAGT TAGCTATTGC GGTGGACAAT
ATCGTCAGAC CAGAACGTAC CCGGCAGATG ATCGTCGATT TCTATCGCCA TTTTGACGAT
GTGGAACTGC TGGTTTTTCA GGAGGTGTTT AACGGCGTCT GGGACGCGCT CTCCGACGGG
CGTGTCGAAC TGGCGATTGG GGCGACGCAG GCGATTCCGG TAGGGGGGCG TTACGCTTTT
CGGGATATGG GGACCCTGAG CTGGAGCTGT GTAGTGGCAA GCGATCATCC GCTGGCGTCA
ATGCCTGGGC CGTTAAGCGA TGATACCCTG CGCAACTGGC CGTCGCTGGT CAGGGAAGAC
ACTTCGCGAA CCTTACCGAA ACGGATTACC TGGCTGCTGG ATAACCAAAA AAGGGTCGTC
GTACCGGACT GGGAGTCATC GGCAACCTGC CTGTCGGCAG GATTATGCGT GGGAATGGTG
CCGACGCATT TTGCCCGACA GTGGATAGAC AGCGGAAAAT GGGTGGCGCT GACGTTAGAG
AATCCGTTTC CCGATGCGGC CTGTTGCGTG ACGTGGCAGC AAAACGAGGC CTCGCCCGCG
CTGGCATGGC TGCTGGACTA TTTGGGCGAT AGCGAAACGT TGAATCGGGA GTGGCTGCGG
GAGCCAGAAG AGGCTCCCGA CAGCGGGGAT TAA
 
Protein sequence
MWSEYSLEVV DAVARNGSFS AAAQELHRVP SAVSYTVRQL EEWLAVPLFV RRHRDVELTP 
AGVWFLKEGR SVIKKMQITR QQCQQIANGW RGQLAIAVDN IVRPERTRQM IVDFYRHFDD
VELLVFQEVF NGVWDALSDG RVELAIGATQ AIPVGGRYAF RDMGTLSWSC VVASDHPLAS
MPGPLSDDTL RNWPSLVRED TSRTLPKRIT WLLDNQKRVV VPDWESSATC LSAGLCVGMV
PTHFARQWID SGKWVALTLE NPFPDAACCV TWQQNEASPA LAWLLDYLGD SETLNREWLR
EPEEAPDSGD