Gene SeD_A3344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3344 
Symbol 
ID6871406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3219413 
End bp3220432 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content58% 
IMG OID642786348 
ProductHTH-type transcriptional regulator AscG 
Protein accessionYP_002216987 
Protein GI198243955 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.718161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.0271899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAA TGCTGGATGT TTCCCGCCAT GCGGGCGTAT CAAAGGCCAC CGTCTCACGA 
GTGCTGAATG GGACGGGGCA GGTAAAAGAA AGCACGCGCC AGAAAGTGTT TACGGCGATG
CAGGCTCTGG GCTATCGCCC CAACCTGCTG GCACGCTCGC TGGCGAATCG CACCAGCAAC
AGCATCGGTC TGGTCGTCTC TACGTTTGAC GGCTTCTATT TCGGCAGTTT GTTGCGCCGG
GCGTCGCGCC AGGCGGAGTT TCATAACAAG CAGTTGATCG TCACCGATGG TCACGATACG
CCGGAACGAG AGCAGAAAGC CGTACAAATG TTGGCCGACA GACAGTGCGA CGCTATTATT
CTTTACACTC GCTATATGGA TGAGCCGGCG ATTTTGTCGT TGATTGACGC CACGGAAATG
CCGCTTGTGA TTATTAATCG CAACGTCACT CAGGCCCGCG ATCGCGCTAT TTTCTTCGAG
CAGGAGACGG CGGCATTCCA GGCGGTGGAA TACCTGATTA CGCAGGGCCA TCGCGATATC
GCCTGTATTA CGCTGCCTGT TCATACTCCC ACTGGCACAT CACGCGTAGC GGGTTATCGC
AAGGCGCTGG AAAAGTATGG TATTCCCTGG CAACCGGCAA AAGTGAAATA CGGCGATTAC
ACGCTGACGC GCGGCTATGA CGCCTGCCGG GAATTACTGG AGGAAGGCGT CACGTTTAGC
GCGCTATTCG CCTGTAATGA TGACACGGCG CTGGGCGCGG CAAAAGCGCT GCGCCAGGCC
GGATTACGCA TCCCGCAGGA TGTGTCGCTG TTTGGTTTTG ACGATGCGCC GGGCGCAACC
TGGCTTGAAC CGGGGCTTTC AACAGTCTAT TTACCCATCG AGGATATGAT AGCCACCGCG
ATCGATCAGG CCGTTCGCCT GGCGAACAGC GAGCCGGTCG CCCCGATCCC GCCCTTTACC
GGCACGCTGA TTCTGCGCGA GTCCGTCGCC GCAGGCCCGT TTTTTCAACG TCCGGCCTAA
 
Protein sequence
MATMLDVSRH AGVSKATVSR VLNGTGQVKE STRQKVFTAM QALGYRPNLL ARSLANRTSN 
SIGLVVSTFD GFYFGSLLRR ASRQAEFHNK QLIVTDGHDT PEREQKAVQM LADRQCDAII
LYTRYMDEPA ILSLIDATEM PLVIINRNVT QARDRAIFFE QETAAFQAVE YLITQGHRDI
ACITLPVHTP TGTSRVAGYR KALEKYGIPW QPAKVKYGDY TLTRGYDACR ELLEEGVTFS
ALFACNDDTA LGAAKALRQA GLRIPQDVSL FGFDDAPGAT WLEPGLSTVY LPIEDMIATA
IDQAVRLANS EPVAPIPPFT GTLILRESVA AGPFFQRPA