Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3425 |
Symbol | |
ID | 6872005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3290440 |
End bp | 3291456 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642786420 |
Product | hypothetical protein |
Protein accession | YP_002217058 |
Protein GI | 198243664 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.955308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.00600667 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACGC TTATCTGTAA TAATCCGGGA AATATCGAAT ACATTGAAAG AGATATTCCT CATTTAAAAG ACGATGAAGT ACTGTTAAAA ATTAAAGCGG TTGGTATTTG CGGCACCGAT ATCCATGCGT TTGCCGGTCG CCAGCCCTTT TTTGCCTATC CGCGCGTGCT GGGTCATGAA ATTTGCGGCG TGGCGGAAAT ATTGGGTAAA TCGTGCAGCA CGGCGAAGGT CGGCCAACGC TATAGCGTTA TTCCCTGTAT TCCTTGCGGC GCTTGCGCAG CCTGCCGGGA AGGAAAAACC AACTGCTGTG AGAACGTCTC GCTGTATGGC GTTCACCAGG ACGGCGGCTT TAGCGAATAT CTGGCCGTTC GTGAACAAAA TCTGGTTGAA CTGTCGGATA ACCTCACCGA CAGCGCGGGC GCGCTGGTTG AGTGTTTCGC CATCAGCGCC CACGCAGTAC GCCGCGCTGA CGTGAAGCCG CAGCAAAACA TTGTGGTGGT CGGCGCCGGG CCGATTGGGC TGGCCGCCGC CGCGATAGCA AAAGCCAAAG GCGCTCGCGT CGCGGTAGCT GATATTGACG CCGAACGCCG TCGTCTGGTC GCCGAAAAGG TAGGCGTTGC AACGCTCGAT CCGTCGTCAG ACGACTATAT TGATGTGCTG AAAGCCTGTT TCTCCGGTGA ATTAGCCGGC ATTGTGCTTG ATGCCACCGG CAATAAATCC TCCATGAGCC GCGCTGTTGA TCTGATTCTT CACGGCGGAA AAATCGTCTT CATCGGGCTA TATATCGGCG AGCTCGTCAT TGACGATCCG ACCTTCCATA AAAAAGAGAC CACGCTATTG AGCAGTCGTA ACGCGACGCG GGAAGACTTC GAATGTGTCA TTGAGCTAAT GGCTCAGGGC GCTATTAGCG AAACAATGAT GAAAAACCAG GAATTCGATT TCTATACGTT CGGCAACCAG TACCAGAAGA ACGTGGTAGA AAACAAAAAA TTGGTTAAAG GCGTTATCAA ATTTTAA
|
Protein sequence | MKTLICNNPG NIEYIERDIP HLKDDEVLLK IKAVGICGTD IHAFAGRQPF FAYPRVLGHE ICGVAEILGK SCSTAKVGQR YSVIPCIPCG ACAACREGKT NCCENVSLYG VHQDGGFSEY LAVREQNLVE LSDNLTDSAG ALVECFAISA HAVRRADVKP QQNIVVVGAG PIGLAAAAIA KAKGARVAVA DIDAERRRLV AEKVGVATLD PSSDDYIDVL KACFSGELAG IVLDATGNKS SMSRAVDLIL HGGKIVFIGL YIGELVIDDP TFHKKETTLL SSRNATREDF ECVIELMAQG AISETMMKNQ EFDFYTFGNQ YQKNVVENKK LVKGVIKF
|
| |