Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1038 |
Symbol | |
ID | 5706537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1161889 |
End bp | 1162866 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641270554 |
Product | saccharopine dehydrogenase |
Protein accession | YP_001535938 |
Protein GI | 159036685 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1748] Saccharopine dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.264702 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGAG TCCTCGTCCT CGGCGGATAC GGCGCCGTTG GCCTGCACGC CGTGACCGCG CTGGTCGCGC ACCTCCCCGC GACGAACGTG GTGGTGGCCG GCCGCAACCC ACACCGCGCG CCCCGCGTGC CGGGCTCCAC CGCGGTCCGC CTGGACGCCG CCGACCCCGG CGACCTGTCC ACCGCACTCA ACGGGGTCGA TGCGGTACTC ATGTGTGCCG AACTCGACAA CGCACGCGTC GCCCACGCCT GCCTGGAGCG AGGAATCCAC TACGTGGACG TCTCGGCGTC CCACCACCTG CACGTCGAGA TCGAACAGTT GGACGAGCTC GCCTCCCAAC GGCAGGCCAC GGCGGCCCTC AGCGTCGGGC TGGTCCCCGG GGTCAGCAAC CTGCTCGCCC GACACTGCGT CGAACAGTCA ACAACCCGGC AGGTGCACAT CGGCGTGCTG CTCGGTTCCG GCGAACGGCA CGGACCGGCG GCACTCGCCT GGACTCTCGA CGGGCTGGGC CGCCTGGAGG GCACGTGGAC GATGCGATTT CCGGCTCCGT ACGGCGAACG AACCGTCCAT CGGTTCCCGT TCTCCGACCA GTACACGCTG TCCAGCACCC TGGATGTCGC CGCGAGCACC GGCCTGTGCC TGGACTCCCG GCTCGCCACC GCGCTGTTGG CAGCCGCCGG GCGGCCCGGC ATCGCCCGCT CGTTGCGTCG CCCCCGGATC CGCCGCATCG TGCTGGACGC GCTGGCCCGG ACCCATCTCG GCGGCGACGG ATTCGCCGTC ACCGTCGACT CCGGTACCAG CCAGGCGAGC TTCAGCGGTC ACCAACAAAG CCGCGCCACC GGGCTCGCCG CGGCGCTACT CGTCCGAGAC CTGCCCGCTC TGCCATCCGG CGTCCGGCAC ATCGAGCACC TCGTGGAGCC GGAAGCCTTC CTCACCGAAC TCGCCGCCAG CGGATTCCTG CTCGACCTTC GGAACTGA
|
Protein sequence | MNRVLVLGGY GAVGLHAVTA LVAHLPATNV VVAGRNPHRA PRVPGSTAVR LDAADPGDLS TALNGVDAVL MCAELDNARV AHACLERGIH YVDVSASHHL HVEIEQLDEL ASQRQATAAL SVGLVPGVSN LLARHCVEQS TTRQVHIGVL LGSGERHGPA ALAWTLDGLG RLEGTWTMRF PAPYGERTVH RFPFSDQYTL SSTLDVAAST GLCLDSRLAT ALLAAAGRPG IARSLRRPRI RRIVLDALAR THLGGDGFAV TVDSGTSQAS FSGHQQSRAT GLAAALLVRD LPALPSGVRH IEHLVEPEAF LTELAASGFL LDLRN
|
| |