Gene SeHA_C4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4398 
Symbol 
ID6488922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4282300 
End bp4283340 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content57% 
IMG OID642744481 
ProductADP-ribosylglycohydrolase superfamily 
Protein accessionYP_002048070 
Protein GI194451860 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1397] ADP-ribosylglycohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTTG ATCGAAACAA AATACTGGGA TGCCTGGTGG GCGCCGCCGC CGCGGACGCG 
ATGGGAGCCG CGACGGAAGT ACGCACCCAG CAGCAAATAA AAGACTATTT TGGCGGCTGG
GTGACGACCT TTCAAAAACC GCCAGCGGAC ACGTTTGGCC GCTGCAACGA AGCGGGGATG
TGCACGGATG ATTTTATTCA GGCGAAGTAC ATCATGGATG CGCTGCTACG CCATCAACGC
CAGGTCAGCG ACGAGGCGAT GCGCGAGGCT TTTCAGCAGT GGCTGGATTA CCCGTACTAC
GCCAACTTTA CCGGCCCGAC GACGCGTGCG GCAATGAAGG CAATATTCAA TGATAACCGC
GCCTCTTTAC AGGGTGAGCT GGAAGGCGAG AAACAGTCGG TACAGATCAT TAATAAGGGT
AATGCGGAGG CAACGAACGG CGCCGCCATG AAGATTTGGC CAGCGGCGGT GCTGCACCCG
GGTGATATTG ACGCGGCGAT TGACTGCGCG CTGCAGATTT GCCGTTTTAC GCATAATAAC
GTGCTGGCGA TGTCCGGCGC AGCGGCGATG GCGGCGGCAA CCAGCGAGGC GTTAAGAGCG
CAGACCAACG CAGACAGCAT TATTGCCGCC GGTATTTACG GTGCGCAAAG GGGCTATCTG
CTGGCGCAGG AGCAAGGGGC GATGATGGTC GCCGGGCCTT CCGTTGCCCG ACGCATTGAG
CTGGCCGTAG ATATCGGTAA ACGTCATCGC CATTGGGAAA CGGCGGTGGT GGAACTTGCT
GATATTATTG GCTCCGGGCT GCACGTGAGT GAAGCGGTGC CGGCGGCCTT TGGCCTGTTC
GCGTGTTGTC CGAATTCTGC CGTAGATGCT ATTATCTCCG GCGTTAATAT CGGCAATGAT
ACTGATACTG TCGCCACCAT GGTCGGGGCG ATTTCCGGCG CATTCCATGG CGTGGAGGCT
TTTCCCGCCG ATTATTTAAC GACTTTGGAT CGTATGAATC ATTTCGATTT GGCAGAACTG
GCCAGGCAAA TCGCAGGGTA G
 
Protein sequence
MHVDRNKILG CLVGAAAADA MGAATEVRTQ QQIKDYFGGW VTTFQKPPAD TFGRCNEAGM 
CTDDFIQAKY IMDALLRHQR QVSDEAMREA FQQWLDYPYY ANFTGPTTRA AMKAIFNDNR
ASLQGELEGE KQSVQIINKG NAEATNGAAM KIWPAAVLHP GDIDAAIDCA LQICRFTHNN
VLAMSGAAAM AAATSEALRA QTNADSIIAA GIYGAQRGYL LAQEQGAMMV AGPSVARRIE
LAVDIGKRHR HWETAVVELA DIIGSGLHVS EAVPAAFGLF ACCPNSAVDA IISGVNIGND
TDTVATMVGA ISGAFHGVEA FPADYLTTLD RMNHFDLAEL ARQIAG