Gene SeD_A4607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4607 
Symbol 
ID6872655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4447472 
End bp4448335 
Gene Length864 bp 
Protein Length287 aa 
Translation table11 
GC content48% 
IMG OID642787510 
Producthypothetical protein 
Protein accessionYP_002218108 
Protein GI198246134 
COG category[R] General function prediction only 
COG ID[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.356417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTTAA TCACCGATCT GGATGGAACA TTACTCACCT CGCAGAAAAC AATCAGCCCG 
CGCACGCGTC AGGCGCTGAT TGCGTTCCGC CAGGACGGTG GTCTACTGGC GGCATGTTCC
GCCAGACCGG TCTCCTCAAT GGTACGCCTG CTACGCCAAC AACAGGTTGA TAGGTTGTTT
AGCTGGTGCG CCGGTTTTAA CTGCGGACAC CTTCTGGAGA TGGCGGGACA GCGCATTATT
CATGCTGCCC CTCTGACCGC CACAGACCTG TGGAATATTG ACCAGCATAT TTCTCTTTCC
CGCTATCACC ACCATTTTTT TAGTGCCGAA GCAATTCACC ATCGTGACGA TAGACTGATT
GCGCACTGGA CAACATATGA GGCTCGCTTA TTTGGATTAC CGCTTATAAC TGAAACTGCA
GAAAATATCT TTAATCGTCG CAACATATAT AAAATTACAC TTGTTGCCGC ATCTCCGGAG
ATAGATAATC TGTGTACAGA AGTGAATAAT CACCTGCCTT GTGGATATTA TGCGGTTGTC
ACGGGAGAGA ATTATATTGA TATTCAAAGA TCCGATATAA ATAAAGGGTG CATAATAGAA
CAATTAATTC ATTATTTAAA TATATCTTCT GACAAGGTGG TCGCGATTGG CGATCAGCAG
AATGATGTCA GCATGTTTGC CGCCGCCGGA ATCAGCATCG CAATGGGCAA CGCGCCGGAT
GCAGTAAAGC GGCAGGCCGG CTATGTGACT GCCACGAATG ATGAGGAGGG TATCGTCCAT
GCGTTGGAGT GGTTGCGTTG CCTTACGCAT CCAGTTACCA TGCGCCAAAG GTTGACGGCG
GCGAAAGATA ATGAATCCAA TTAA
 
Protein sequence
MLLITDLDGT LLTSQKTISP RTRQALIAFR QDGGLLAACS ARPVSSMVRL LRQQQVDRLF 
SWCAGFNCGH LLEMAGQRII HAAPLTATDL WNIDQHISLS RYHHHFFSAE AIHHRDDRLI
AHWTTYEARL FGLPLITETA ENIFNRRNIY KITLVAASPE IDNLCTEVNN HLPCGYYAVV
TGENYIDIQR SDINKGCIIE QLIHYLNISS DKVVAIGDQQ NDVSMFAAAG ISIAMGNAPD
AVKRQAGYVT ATNDEEGIVH ALEWLRCLTH PVTMRQRLTA AKDNESN