Gene SeD_A1851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1851 
Symbolmic 
ID6873679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1791778 
End bp1792998 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID642784981 
Producttranscriptional regulator Mic 
Protein accessionYP_002215649 
Protein GI198245602 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGCTG ATAGTCAGCC TGGGCATATC GATCAAATTA AGCAGACCAA TGCTGGCGCA 
GTGTATCGCC TGATTGATCA GCTCGGACCG GTATCGCGAA TTGACCTGTC TCGTCTGGCG
CAGTTGGCGC CTGCCAGTAT TACGAAAATT GTTCGCGAAA TGCTGGAAGC GCATTTGGTT
CAGGAACTTG AAATTAAAGA GGCGGGCAGT CGCGGACGTC CCGCCGTCGG GCTGATGGTG
GAAACGGAAG CCTGGCACTA TTTATCTATT CGTATTAGCC GTGGCGAAAT TTTCCTTGCA
CTGCGCGATC TTAGCAGCAA ACTGGTGGTA GAAGAGTGTC TGGCGCTGCC GTTAAACGAA
GCTACGCCGT TGCTTGAGCG AATTATTACG CACGTTGATC GGTTTTTTAC CCGCCATCAG
CAGAAACTGG AGCGTCTGAC CTCCATTGCC ATTACGTTAC CGGGCATTAT CGATACCGAA
AACGGCGTTG TGCACCGGAT GCCGTATTAC GAAGATGTCA AAGAGATGCC TTTGGGAGAT
GCGCTGGAGC GGCACACCGG CGTACCGGTT TACATTCAGC ATGATATTAG CGCCTGGACG
ATGGCAGAGG CGCTTTTTGG CGCCTCACGC GGCGCGCGCG ACGTTATCCA GGTGGTGATT
GATCATAATG TGGGGGCGGG CGTTATCACC GACGGTCATT TGCTTCATGC GGGTAGTAGC
AGTCTGGTAG AGATTGGGCA TACCCAGGTC GATCCTTATG GTAAGCGCTG TTATTGCGGT
AATCATGGCT GTCTGGAGAC CATCGCCAGC GTCGATAGCG TGCTGGAACT TACGCAGCTT
CGGCTTAATC AGTCGATGAG TTCAATGTTG CACGGCCAGC CGTTAACGGT AGATTCACTG
TGTCAGGCGG CGATGCAGGG AGATCTATTA GCAAAAGATA TTATTAGCGG CGTTGGCGCG
CATGTCGGAC GCATTCTGGC TATCATGGTG AATTTATTTA ATCCGCAAAA AATTCTTATT
GGTTCGCCGC TAAGTAAAGC GGCTGATATC CTTTTTCCAG CCATTGCTGA CAGTATCCGT
CAACAGGCGC TGCCCGCCTA CAGCAGGAAT ACGGTTGTGG AAAGCACGCA GTTTACCAAC
CAGGGTACGA TGGCCGGGGC GGCGTTGGTA AAAGACGCGA TGTATAACGG CTCTTTGTTG
ATTCGTCTAT TACAGGGTTA A
 
Protein sequence
MVADSQPGHI DQIKQTNAGA VYRLIDQLGP VSRIDLSRLA QLAPASITKI VREMLEAHLV 
QELEIKEAGS RGRPAVGLMV ETEAWHYLSI RISRGEIFLA LRDLSSKLVV EECLALPLNE
ATPLLERIIT HVDRFFTRHQ QKLERLTSIA ITLPGIIDTE NGVVHRMPYY EDVKEMPLGD
ALERHTGVPV YIQHDISAWT MAEALFGASR GARDVIQVVI DHNVGAGVIT DGHLLHAGSS
SLVEIGHTQV DPYGKRCYCG NHGCLETIAS VDSVLELTQL RLNQSMSSML HGQPLTVDSL
CQAAMQGDLL AKDIISGVGA HVGRILAIMV NLFNPQKILI GSPLSKAADI LFPAIADSIR
QQALPAYSRN TVVESTQFTN QGTMAGAALV KDAMYNGSLL IRLLQG