Gene SeD_A4465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4465 
Symbol 
ID6871751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4304871 
End bp4305911 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content57% 
IMG OID642787382 
ProductADP-ribosylglycohydrolase family protein 
Protein accessionYP_002217993 
Protein GI198242276 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1397] ADP-ribosylglycohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.761974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTTG ATCAAAACAA AATACTGGGA TGCCTGGTGG GCGCCGCCGC CGCGGACGCG 
ATGGGAGCCG CGACGGAAGT ACGCACCCAG CAGCAAATAA AAGACTATTT TGGCGGCTGG
GTGACGACCT TTCAAAAACC GCCAGCGGAC ACGTTTGGCC GCTGCAACGA AGCGGGGATG
TGCACGGATG ATTTTATTCA GGCGAAGTAC ATCATGGATG CGCTGCTGCG CCATCAACGC
CAGGTCAGCG ACGAGGCGAT GCGCGAGGCT TTTCAGCAGT GGCTGGATTA CCCGTACTAC
GCCAACTTTA CCGGCCCGAC GACGCGTGCG GCAATGAAGG CAATATTCAA TGATAACCGC
GCCTCTTTAC AGGGTGAGCT GGAAGGCGAG AAACAGTCGG TACAGATTAT TAATAAGGGT
AACGCGGAGG CAACGAACGG CGCCGCCATG AAGATTTGGC CAGCGGCGGT GCTGCACCCG
GGTGATATTG ACGCGGCGAT TGACTGCGCG CTGCAGATTT GCCGTTTTAC GCATAATAAC
GTGCTGGCGA TGTCCGGCGC AGCGGCGATG GCGGCGGCAA CCAGCGAGGC GTTAAGAGCG
CAGACCAATG CAGACAGCAT TATTGCCGCC GGTATTTACG GTGCGCAAAG GGGCTATCTG
CTGGCGCAGG AGCAAGGGGC GATGATGGTG GCAGGTCCTT CCGTTGCCCG ACGCATTGAA
CTGGCCGTAG ATATCGGTAA ACGTCATCGC CATTGGGAAA CGGCGGTGGT GGAACTTGCT
GATATTATTG GCTCCGGGCT GCACGTGAGT GAAGCGGTGC CGGCGGCCTT TGGCCTGTTC
GCGTGTTGTC CGAATTCTGC CGTAGATGCT ATTATCTCCG GCGTTAATAT CGGCAATGAT
ACTGATACTG TCGCCACCAT GGTCGGGGCG ATTTCCGGCG CATTCCATGG CGTGGAGGCT
TTTCCCGCCG ATTATTTAAC GACTTTGGAT CGTATGAATC ATTTCGATTT GGCAGAACTG
GCCAGGCAAA TCGCAGGGTA G
 
Protein sequence
MHVDQNKILG CLVGAAAADA MGAATEVRTQ QQIKDYFGGW VTTFQKPPAD TFGRCNEAGM 
CTDDFIQAKY IMDALLRHQR QVSDEAMREA FQQWLDYPYY ANFTGPTTRA AMKAIFNDNR
ASLQGELEGE KQSVQIINKG NAEATNGAAM KIWPAAVLHP GDIDAAIDCA LQICRFTHNN
VLAMSGAAAM AAATSEALRA QTNADSIIAA GIYGAQRGYL LAQEQGAMMV AGPSVARRIE
LAVDIGKRHR HWETAVVELA DIIGSGLHVS EAVPAAFGLF ACCPNSAVDA IISGVNIGND
TDTVATMVGA ISGAFHGVEA FPADYLTTLD RMNHFDLAEL ARQIAG