Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4858 |
Symbol | |
ID | 5604032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 5379676 |
End bp | 5380566 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640940431 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001481079 |
Protein GI | 157373090 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.610122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00520466 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACCCG ACCTGCAGGA TCGTGTTGCG ATTATTACCG GAGCCGGAGG CGGGCTTGGC CGAGCACACG CACATCTGCT AGCCCGTTAT GGTGCGAAAG TTGTGATCAA TGATATCGGG CTGGGTGCCG ACAAGGTTGC CGAAGAAATC AACTCATCAG GAGGGGATGC CATTAGTTTC CAAGCCTCTG TGACTGATGA AGAGAAAATA AAGGCCATGG TCACGGAAAC TATAGAACGG TGGGGAAGGG TCGATATTCT GGTCAACAAT GCAGGCAACC TGCGTGACAA GACTTTTGCC AAAATGACGT TAGATGATTT TCGCTCTGTG TTGGACGTAC ATCTAATGGG GGCAGTGATT TGCACCAAGG CCGTATGGGA GCATATGCAT CGCCAAAAGT ATGGGCGCAT CGTGTTCACG ACGTCCTCTT CTGGTCTTTA TGGCAATTTC GGGCAGGCCA ATTATGCAGC GGCGAAAATG GGGCTGGTGG GGCTGATGCA AACACTGGCA CTCGAAGGAG CCAAGAGTAA CATTTTGGTG AATTGTTTGG CGCCTACTGC AGCGACCGCT ATGAGCGATG GGCTTTTGAT GCCGCAAGCA AAGCGTGTAT TACAACCTGA AGCAGTGAGC CCGGCACTGT TGCCGCTGGT TTCGCAGTGT GCTCCTACCA GGGCTATTGT TTGCGCTGGT GCGGGTAATT TTGCGCGCGC CTACATAACG TTGACCGGTG GACGCTTTTT GGGCAACGAC GCACATACCA TCAATCGTAT TTTCGCCGAC TGGCAAGTGA TCGGCGATCG CGAATCAGAG CAAGTTCCTG AGTCTGCCAC TGCGCAGGTT CATCAAGAAC TCATCGCTGC TTTCCGTTCA GAATCCGAAA ATAACCAGTG A
|
Protein sequence | MKPDLQDRVA IITGAGGGLG RAHAHLLARY GAKVVINDIG LGADKVAEEI NSSGGDAISF QASVTDEEKI KAMVTETIER WGRVDILVNN AGNLRDKTFA KMTLDDFRSV LDVHLMGAVI CTKAVWEHMH RQKYGRIVFT TSSSGLYGNF GQANYAAAKM GLVGLMQTLA LEGAKSNILV NCLAPTAATA MSDGLLMPQA KRVLQPEAVS PALLPLVSQC APTRAIVCAG AGNFARAYIT LTGGRFLGND AHTINRIFAD WQVIGDRESE QVPESATAQV HQELIAAFRS ESENNQ
|
| |