Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2071 |
Symbol | |
ID | 7088364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 2452159 |
End bp | 2452980 |
Gene Length | 822 bp |
Protein Length | 273 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643460974 |
Product | short chain dehydrogenase |
Protein accession | YP_002357998 |
Protein GI | 217973247 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00111484 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000000219735 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTTTAATG CTACGATGTT TAACTATCAA GGTAAGAATG TTGTCGTTGT TGGTGGTACC AGTGGGATAA ATCTCGCCAT TGCAGTGGCG TTCGCACAGG CTGGCGCCAA TGTCGCAGTC GCGAGTCGCA GCCAAGATAA AGTTGATGCT GCGGTATTGC AGCTGCAGCA GGCCAATCCT GATGGTATTC ATTTAGGTGT GAGTTTTGAT GTGCGCGATC TTAGCGCCCT TGAAGTCGGT TTTGATAAAA TCGCTTCTGA GTTTGGTTTT ATCGATGTGC TGATTAGCGG CGCGGCGGGG AATTTCCCTG CCAGCGCCGC AAAGCTTTCG GCTAATGGTT TTAAGTCGGT GATGGACATT GATTTGCTGG GCAGTTTTCA AGTGCTAAAG CAGGCTTATC CTTTGCTGCG TCGACCCAAT GGCAATATTA TCCAGATTTC GGCGCCGCAG GCTTCAATTG CTATGCCCAT GCAAGTACAT GTGTGCGCGG CAAAAGCGGG CGTGGATATG CTGACGCGAA CCTTAGCGTT AGAGTGGGGC TGTGAGGGGC TGAGAATTAA TTCGATTATG CCAGGCCCCA TAGCTAACAC TGAAGGTTTT AATCGCCTTG CACCCACCGC CGAATTACAG CAGAAGGTTG CCCAATCCGT TCCGCTAAAA CGCAACGGCG CAGGTCAGGA TATTGCTAAT GCAGCGTTAT TCTTAGGCTC CGAATTGGCG TCCTATATTA CGGGAGTTGT GCTGCCGGTT GATGGCGGTT GGTCACTTGG CGGCGCCAGT ATTGCGATGA CAGAGCTTGG CGAACTGGCC GCAAAAATGT AG
|
Protein sequence | MFNATMFNYQ GKNVVVVGGT SGINLAIAVA FAQAGANVAV ASRSQDKVDA AVLQLQQANP DGIHLGVSFD VRDLSALEVG FDKIASEFGF IDVLISGAAG NFPASAAKLS ANGFKSVMDI DLLGSFQVLK QAYPLLRRPN GNIIQISAPQ ASIAMPMQVH VCAAKAGVDM LTRTLALEWG CEGLRINSIM PGPIANTEGF NRLAPTAELQ QKVAQSVPLK RNGAGQDIAN AALFLGSELA SYITGVVLPV DGGWSLGGAS IAMTELGELA AKM
|
| |