Gene SeD_A0879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0879 
Symbol 
ID6872088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp872380 
End bp873240 
Gene Length861 bp 
Protein Length286 aa 
Translation table11 
GC content52% 
IMG OID642784074 
Productphosphotransferase 
Protein accessionYP_002214749 
Protein GI198244030 
COG category[R] General function prediction only 
COG ID[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.531066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACAG ACGGAATTAT TACTCTTAAT CTGGAAAAAA TTATGACTGC ACGCGTGATT 
GCCCTCGATT TAGACGGAAC ATTATTAACC CCGCATAAAA CCTTACTCCC CTCCTCGCTT
GAAGCGCTAT CACGCGCCAA AGAGGCGGGC TTTCAACTTA TCATTGTCAC GGGTCGCCAT
CACGTTGCTA TTCATCCTTT TTATCAGGCG CTGGCGCTGG AAACACCTGC TATTTGCTGC
AACGGCACCT ATTTGTATGA TTATCAAGCT AAAACTGTCC TGGATGCCGA TCCTATGCCC
GTGGATAAGG CGTTGCAGTT GATTGATTTA CTGGATGAGC ATCAGATTCA CGGCCTGATG
TATGTTGATG ACGCTATGCT TTACGAACAC CCAACCGGTC ACGTCGTGCG TACCTCCCGG
TGGGCGCAGA CCTTGCCGCC GGAGCAACGT CCGACCTTTA CACAGGTCTC TTCGTTGGCG
CAGGCGGCGC GCGACGTGAA TGCCGTGTGG AAGTTTGCGC TTACCGATGA AGATATTCCC
AGGCTACAGC GGTTCGGTCA GCATGTTGAA CAGGCGCTTG GCCTGGAGTG CGAATGGTCA
TGGCACGATC AGGTGGATAT CGCGCGCAAA GGCAACAGTA AAGGCAAGCG CCTTACCCAG
TGGATAGAAG CGCAGGGAGG GTCAATGAAA AATGTGATCG CTTTCGGCGA TAACTACAAC
GACATCAGTA TGCTGGAGGC GGCAGGCACC GGCGTTGCGA TGGGCAACGC CGATGAGGCG
GTGAAAGCGC GCGCTGACGT TGTGATCGGC GATAACACTA CCGATAGCAT CGCCAAATTT
ATTTACACCC ACCTACTATA G
 
Protein sequence
MPTDGIITLN LEKIMTARVI ALDLDGTLLT PHKTLLPSSL EALSRAKEAG FQLIIVTGRH 
HVAIHPFYQA LALETPAICC NGTYLYDYQA KTVLDADPMP VDKALQLIDL LDEHQIHGLM
YVDDAMLYEH PTGHVVRTSR WAQTLPPEQR PTFTQVSSLA QAARDVNAVW KFALTDEDIP
RLQRFGQHVE QALGLECEWS WHDQVDIARK GNSKGKRLTQ WIEAQGGSMK NVIAFGDNYN
DISMLEAAGT GVAMGNADEA VKARADVVIG DNTTDSIAKF IYTHLL