Gene SNSL254_A0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0848 
Symbol 
ID6485117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp852550 
End bp853410 
Gene Length861 bp 
Protein Length286 aa 
Translation table11 
GC content53% 
IMG OID642736260 
Productphosphotransferase 
Protein accessionYP_002040020 
Protein GI194446604 
COG category[R] General function prediction only 
COG ID[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.457335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACAG ACGGAATTAT TACTCTTAAT CTGGAAAAAA TTATGACTGC ACGCGTGATT 
GCCCTCGATT TAGACGGAAC ATTATTAACC CCGCATAAAA CCTTACTCCC CTCCTCGCTT
GAAGCGCTAT CACGCGCCAA AGAGGCGGGC TTTCAACTTA TCATTGTCAC GGGTCGCCAT
CACGTTGCTA TTCATCCTTT TTATCAGGCG CTGGCGCTGG AAACACCTGC TATTTGCTGC
AACGGCACCT ATTTGTATGA TTATCAAGCT AAAACTGTCC TGGATGCCGA TCCTATGCCC
GTGGATAAGG CGTTGCAGTT GATTGATTTA CTGGATGAGC ATCAGATTCA CGGCCTGATG
TATGTTGATG ACGCTATGCT TTACGAACAC CCAACCGGTC ACGTCGTGCG TACCTCCCGG
TGGGCGCAGA CCTTGCCTCC GGAGCAACGT CCGACCTTTG CACAGGTCTC TTCGTTGGCG
CAGGCGGCGC GCGACGTGAA TGCCGTGTGG AAGTTTGCGC TTACCGATGA AGATATTCCC
AGGCTACAGC GGTTCGGTCA GCATATTGAA CAGGCGCTTG GCCTGGAGTG CGAATGGTCA
TGGCACGATC AGGTGGATAT CGCGCGCAAA GGCAACAGTA AAGGCAAGCG CCTTACCCAG
TGGATAGAAG CGCAGGGAGG GTCAATGAAA AATGTGATCG CTTTCGGCGA TAACTACAAC
GACATCAGTA TGCTGGAGGC GGCAGGCACC GGCGTTGCGA TGGGCAACGC CGATGAGGCG
GTGAAAGCGC GCGCTGACGT CGTGATCGGC GATAACACTA CCGATAGCAT CGCCAAATTT
ATTTACACCC ACCTGCTATA G
 
Protein sequence
MPTDGIITLN LEKIMTARVI ALDLDGTLLT PHKTLLPSSL EALSRAKEAG FQLIIVTGRH 
HVAIHPFYQA LALETPAICC NGTYLYDYQA KTVLDADPMP VDKALQLIDL LDEHQIHGLM
YVDDAMLYEH PTGHVVRTSR WAQTLPPEQR PTFAQVSSLA QAARDVNAVW KFALTDEDIP
RLQRFGQHIE QALGLECEWS WHDQVDIARK GNSKGKRLTQ WIEAQGGSMK NVIAFGDNYN
DISMLEAAGT GVAMGNADEA VKARADVVIG DNTTDSIAKF IYTHLL