Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3900 |
Symbol | |
ID | 5705838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4440446 |
End bp | 4441501 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641273325 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_001538682 |
Protein GI | 159039429 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.577323 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGC TCTACCTACA GGATGTGACG CTGCGGGACG GGATGCACGC CATCGCCCAC CGCTACACCG CTGACCAGGT ACGCACGATC GCCGCCGCGC TCGACGCCGC CGGGATAGCC GCGATCGAGG TGGCGCACGG TGACGGGCTT GCCGGATCGA GTGTCAACTA CGGGCACGGC GCGGCCAGTG ACGCGGACTG GATCGCGGCG GCGGCCGAGG TGCTGACCAC CGCGCGGCTG ACCACGCTGC TGGTGCCCGG CATCGGCACC ATCGCCGATC TGAGGGCAGC GCGGGAACTT GGCGTGACCA GCGTGCGGAT CGCCACCCAC TGCACCGAGG CCGACATCTC CGCCCAGCAC ATCAGTTGGG CGCGGGAGAA CGGGATGGAC GTCTCCGGGT TTCTGATGAT GTCGCACATG AACGACCCGG CGGGACTGGC GGCGCAGGCC AAGCTGATGG AGTCGTACGG GGCGCACTGC GTCTACGTCA CCGACTCCGG CGGTCGGCTG CTGATGTCCG ACGTGGCCGA GCGGGTCGAC GCGTACCGTC AGGTGCTCGA ACCAGAGACG CAGATCGGCA TTCACGCCCA CCACAACCTG TCCCTCGGCG TGGCGAACAG CGTGATCGCC GTCGAACACG GCCGGATTCT CGGGGACGGG CCGTTGGGCG CTCCGGCCGG CCGAACCGTC CGGGTGGACG CCTCGCTCGC CGGGCAGGGC GCGGGCGCGG GTAATGCACC GCTCGAGGTC TTCGTCGCGG TCGCCGAGCT GCACGGCTGG GAGCACGGCT GCGACGTGTT CGCGCTGATG GATGCGGCCG AGGATCTGGT CCGGCCGTTG CAGGACCGAC CGGTGCGGGT TGATCGGGAG ACGCTCTCCC TGGGATACGC GGGCGTCTAC TCCAGCTTCC TGCGGCACGC CGAGCGGGCC GCCGAACGCT ACGGCGTGGA CGTCCGCTCG ATCCTGATCG AGCTGGGCCG GCGCCGGATG GTCGGTGGCC AGGAGGACAT GATCGTGGAC GTGGCGCTCG ACCTGGCCGG CAGGGAGAAG ACGTGA
|
Protein sequence | MTRLYLQDVT LRDGMHAIAH RYTADQVRTI AAALDAAGIA AIEVAHGDGL AGSSVNYGHG AASDADWIAA AAEVLTTARL TTLLVPGIGT IADLRAAREL GVTSVRIATH CTEADISAQH ISWARENGMD VSGFLMMSHM NDPAGLAAQA KLMESYGAHC VYVTDSGGRL LMSDVAERVD AYRQVLEPET QIGIHAHHNL SLGVANSVIA VEHGRILGDG PLGAPAGRTV RVDASLAGQG AGAGNAPLEV FVAVAELHGW EHGCDVFALM DAAEDLVRPL QDRPVRVDRE TLSLGYAGVY SSFLRHAERA AERYGVDVRS ILIELGRRRM VGGQEDMIVD VALDLAGREK T
|
| |