Gene SeD_A1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1103 
Symbol 
ID6873365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1100304 
End bp1101878 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content56% 
IMG OID642784288 
Productnudix hydrolase 
Protein accessionYP_002214962 
Protein GI198241782 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1051] ADP-ribose pyrophosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.426954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACCT ACGCTGCCGG GATCCTGTTT AAGTCTGGCG GGAAAATATT TCTGGTTAAG 
CGTGGGGATG ATGGTTCGTG GACGGTACCG GGCGGAAAAC TCGAAGAGGG GGAGACGCCT
GAAGCCGCGG CAAAGCGTGA AGTGCTGGAA GAATGCGGGT TTGATTATTC CGCACCGCTG
ACGCCTCATA CCCTGATTGA TGGCTATGTT ACCTACCTCG CAGATGATGC TGAGCAATTC
GACGCGGTAC TGAACGATGA AAATCAGGCC TGTGGCTGGT TTTCTCCGGA TGAACTGCCG
GAACCGTTGC ATCCCGGTAT GGTGGCAATG CTTGATGCCG AACCACTCAA TGAAAAGGAC
GTTGCCGGGC TTATTGCCGA CGGGCAACTC ACATCCCCGC AGTTTTTCAG AAATATGTAC
CTGTGGGCGC TGCGTATCAC CGGAACGGGT GTTACCTGGC GTTCTAAGTT CAGGCAATAC
GCTTACCGTT CTCCCGAGAA TTACCTCACT GATGATTTCC TCGCCCGGTG CTCTGGCCTG
CCGGTGATCT GGCTGCACCC GGAGAAAAAC ACGCTGAACA GCGAGGAGTA CGCCGCGAGG
ACTATCGGTG CGATTGCATT TGCCTGGATC CAGGGTGATG AGGTGTGGGG AATGGCCCGC
ATCTACGACA CTGACGCCGC CACGATTCTT TCAACGCGGC AACTGAGTAC ATCCCCCACG
GTGACGGGCG GCGATGACGT TCTGATCAAC GTCGACGGCG AGCCGCTGCT GCTGGAGGGG
AACCCTGTTT TACTGGACCA CCTGGCTATT TGTGAGCAGG GCGTCTGGGA CAAGCTGGGG
GAACCGACGG GAGTTAAATC CGACACACTT TTGAACGAGG TCCAGAAAAT GGATGAAGAA
AAAGTATTAG CACTCATTAA CCAGGCGCTG GACGCTCGCG AAGCCCGCGC AAAGGCCGAC
GCCGAGGAAA AAGCAAAAGC AGATGCTGAA GCAGCAGAAA AGGCGAAAGC TGATGAAGAT
GCCGCCCGTC TCAAGGAAGA GGAAGAAAAG GCGAAGGCTG ACGCTGAAGC AAAGGCCAAA
GCGGACGCGG AGGCAGAAGA AAAAGCCAAA GCGGATGCCG AACTGGAAAA AATCCGCGCA
GACATGGAAG AAATGAAAAG TCGTGTACCG CAGGAACTCA GCGATGAAGA GCGCAATGAA
ATCGCTGATA CCCAGTGTAA GGCCGACAGC GTGTTTGCTT CATTTGGTGA GCGCGCGCCG
CAGCCGATGG CGGGAGAACG CGCTATGCCA TACCGCCGCC GCATCATGAC TCGCCTGCAA
AAATATTCTT CAGACTATAA AGAAGTGGAT CTGCATGCCA TCGCAGACAG CCAGCTCCTG
AGTATTGCGG AGAAAAAAAT CTATGCCGAT GCGCAGGCAT CAGCGGCATC CAGTCTGGAG
CCCGGCGCCG GGTTACGTGA AGTCATCCGC ACCGACGCCA CCGGACGCCG TATCAGTACC
TTTATCGGCG ATCCGTCCGC AACATGGGCA CCGTTCCAGG CCGTCAGCCG CAAAGTCGCT
GGCATCAAAC AGTAA
 
Protein sequence
MNTYAAGILF KSGGKIFLVK RGDDGSWTVP GGKLEEGETP EAAAKREVLE ECGFDYSAPL 
TPHTLIDGYV TYLADDAEQF DAVLNDENQA CGWFSPDELP EPLHPGMVAM LDAEPLNEKD
VAGLIADGQL TSPQFFRNMY LWALRITGTG VTWRSKFRQY AYRSPENYLT DDFLARCSGL
PVIWLHPEKN TLNSEEYAAR TIGAIAFAWI QGDEVWGMAR IYDTDAATIL STRQLSTSPT
VTGGDDVLIN VDGEPLLLEG NPVLLDHLAI CEQGVWDKLG EPTGVKSDTL LNEVQKMDEE
KVLALINQAL DAREARAKAD AEEKAKADAE AAEKAKADED AARLKEEEEK AKADAEAKAK
ADAEAEEKAK ADAELEKIRA DMEEMKSRVP QELSDEERNE IADTQCKADS VFASFGERAP
QPMAGERAMP YRRRIMTRLQ KYSSDYKEVD LHAIADSQLL SIAEKKIYAD AQASAASSLE
PGAGLREVIR TDATGRRIST FIGDPSATWA PFQAVSRKVA GIKQ