Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0848 |
Symbol | |
ID | 6485117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 852550 |
End bp | 853410 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642736260 |
Product | phosphotransferase |
Protein accession | YP_002040020 |
Protein GI | 194446604 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.457335 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACAG ACGGAATTAT TACTCTTAAT CTGGAAAAAA TTATGACTGC ACGCGTGATT GCCCTCGATT TAGACGGAAC ATTATTAACC CCGCATAAAA CCTTACTCCC CTCCTCGCTT GAAGCGCTAT CACGCGCCAA AGAGGCGGGC TTTCAACTTA TCATTGTCAC GGGTCGCCAT CACGTTGCTA TTCATCCTTT TTATCAGGCG CTGGCGCTGG AAACACCTGC TATTTGCTGC AACGGCACCT ATTTGTATGA TTATCAAGCT AAAACTGTCC TGGATGCCGA TCCTATGCCC GTGGATAAGG CGTTGCAGTT GATTGATTTA CTGGATGAGC ATCAGATTCA CGGCCTGATG TATGTTGATG ACGCTATGCT TTACGAACAC CCAACCGGTC ACGTCGTGCG TACCTCCCGG TGGGCGCAGA CCTTGCCTCC GGAGCAACGT CCGACCTTTG CACAGGTCTC TTCGTTGGCG CAGGCGGCGC GCGACGTGAA TGCCGTGTGG AAGTTTGCGC TTACCGATGA AGATATTCCC AGGCTACAGC GGTTCGGTCA GCATATTGAA CAGGCGCTTG GCCTGGAGTG CGAATGGTCA TGGCACGATC AGGTGGATAT CGCGCGCAAA GGCAACAGTA AAGGCAAGCG CCTTACCCAG TGGATAGAAG CGCAGGGAGG GTCAATGAAA AATGTGATCG CTTTCGGCGA TAACTACAAC GACATCAGTA TGCTGGAGGC GGCAGGCACC GGCGTTGCGA TGGGCAACGC CGATGAGGCG GTGAAAGCGC GCGCTGACGT CGTGATCGGC GATAACACTA CCGATAGCAT CGCCAAATTT ATTTACACCC ACCTGCTATA G
|
Protein sequence | MPTDGIITLN LEKIMTARVI ALDLDGTLLT PHKTLLPSSL EALSRAKEAG FQLIIVTGRH HVAIHPFYQA LALETPAICC NGTYLYDYQA KTVLDADPMP VDKALQLIDL LDEHQIHGLM YVDDAMLYEH PTGHVVRTSR WAQTLPPEQR PTFAQVSSLA QAARDVNAVW KFALTDEDIP RLQRFGQHIE QALGLECEWS WHDQVDIARK GNSKGKRLTQ WIEAQGGSMK NVIAFGDNYN DISMLEAAGT GVAMGNADEA VKARADVVIG DNTTDSIAKF IYTHLL
|
| |