Gene SeD_A4981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4981 
Symbol 
ID6872004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4809149 
End bp4810699 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content54% 
IMG OID642787851 
Producthypothetical protein 
Protein accessionYP_002218441 
Protein GI198243246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.986087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCT CATGCGAAAC TGCGCTCCAG CAGCGTTGCC AGCAAATTGT GACCAGCCCG 
GTGCTCACGC CTGAACAAAA ACGCCATTTT CTGGCGCTGG AAGCTGAAAA CGCCCTGCCT
TATCCCCCCC TGCCGGAAGA TGCCCGCCAG GCGCTGGATG AAGGTGTCAT TTGCGATATG
TTTGAGGGGC ACGCCCCCTT CAAACCGCGC TACGTGTTGC CCGATTACGC CCGATTTCTG
GCTAACGGTT CACAGTGGCT GGAACTGGAA GGCGCGAAAG ATCTGGATGA TGCGTTATCC
CTACTCACCA TTCTGTATCA TCACGTTCCT TCCGTGACGT CCATGCCGGT TTATCTCGGC
CAGCTTGATG CGTTGCTGCA ACCATATGTT AGAATTATAA CACAAGATGC GATCGATATT
CGAATAAAAC GTTTCTGGCG TTATCTCGAC AGAACGCTGC CAGACGCCTT TATGCATGCC
AATATTGGCC CTGCCGATAC GCCTGTCACA CGAGCGATTT TGCGCGCTGA TGCCGAGCTA
AAGCAGGTGA CGCCTAACCT GACGTTTATC TACGATGCGG AAATTACGCC GGACGATCTG
CTGCTGGAGG TCGCCAAAAA CATTTGCGAA TGCAGTAAGC CACACATTTC CAACGGCCCT
GTAAATGATA AAATTTTCAC AAAAGGGCAT TATGGCATCG TCAGTTGTTA TAACTCGCTA
CCGCTTGGCG GCGGCGGCAG TACGCTGGTA CGTCTCAACC TGAAAGCCGT GGCAGAACGC
AGTACGTCTG TCGATGACTT CTTTTCACGC ACGCTACCGC ACTACTGCCG ACAGCAGATC
GCCATCATTA ATTCACGATG TGAATTCCTC TATGAAAAGT CACATTTCTT TGAGAATAGC
TTTCTTGTAC AGGAAGGTTT GATCGATCCC GAACGTTTTG CGCCGATGTT CGGTATGTAC
GGGCTGGCGG AAGCCGTGAA TCTGCTGTGC GAAAACGCGG GCCTGAACGC CCGTTACGGC
AAGAATGAAA CGGCGAACGA GCTGGGCTAC CGTATCAGCG CCCAACTGGC GGATTTCGTC
GAAAATACGC CAGTGAAGTA TGGCTGGAAG CAACGGGCGC TGCTCCATGC CCAGTCTGGC
ATCAGTTCCG ATATCGGCAC TACGCCGGGC GCGCGTCTGC CGTATGGCGA TGAACCGGAC
CCTATCACCC ATTTGCAAAC CGTCGCGCCG CACCATGCCT TTTACCATGC CGGGATCAGC
GACATTCTGA CGCTGGACGA AACCATCAAG CGTAATCCGC AGGCGCTGGT TCAGCTTTGT
CTTGGCGCGT TCAAAGCCGG GATGCGGGAA TTTACTGCCA ATGTCAGCGG CAACGATCTG
GTGCGCGTCA CCGGTTATAT GGTGCGCCTG TCGGATCTGG CGAAATTTCG CGCCGAAGGC
TCGCGCACGA ATACCACCTG GCCGGGAGAA GAAGCCGCAC GTAATACCCG CATCCTGGAA
CGACAGCCAC GCGTAGTCAG CCATGAACAA CAGATGCGCT TTAGTCAGTA A
 
Protein sequence
MPASCETALQ QRCQQIVTSP VLTPEQKRHF LALEAENALP YPPLPEDARQ ALDEGVICDM 
FEGHAPFKPR YVLPDYARFL ANGSQWLELE GAKDLDDALS LLTILYHHVP SVTSMPVYLG
QLDALLQPYV RIITQDAIDI RIKRFWRYLD RTLPDAFMHA NIGPADTPVT RAILRADAEL
KQVTPNLTFI YDAEITPDDL LLEVAKNICE CSKPHISNGP VNDKIFTKGH YGIVSCYNSL
PLGGGGSTLV RLNLKAVAER STSVDDFFSR TLPHYCRQQI AIINSRCEFL YEKSHFFENS
FLVQEGLIDP ERFAPMFGMY GLAEAVNLLC ENAGLNARYG KNETANELGY RISAQLADFV
ENTPVKYGWK QRALLHAQSG ISSDIGTTPG ARLPYGDEPD PITHLQTVAP HHAFYHAGIS
DILTLDETIK RNPQALVQLC LGAFKAGMRE FTANVSGNDL VRVTGYMVRL SDLAKFRAEG
SRTNTTWPGE EAARNTRILE RQPRVVSHEQ QMRFSQ