Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3003 |
Symbol | |
ID | 5707613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3410838 |
End bp | 3412097 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641272450 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_001537818 |
Protein GI | 159038565 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000677382 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGCATCG CCGATCTCCG GGCGACCACG GTCACCGTGC CGCTCGAAGC CTCCCTGCGC CACAGCAACG GCGCTCACTG GGGACGCTTT GTCCGCACCA TCATCGAGAT CGAAAGCGAC AACGGGCTCG TCGGCATCGG AGAAATGGGC GGCGGCGGCC AGAGCGCCGA GGCTGCCGTG GTGTCACTGC GAGATTATCT CGTCGGGCAC GACCCGGCCC GCACCGAGGT CCTGCGATTC ATGCTCGCCA ATCCAACTGC AAGCCTCTAC AACAACCGCA CCCAGCTGCT GGCCGCCGTG GAGTTCGCCT GTCTCGACCT GCAGGGCCAG CACCTCGGCG TCCCCGTGCA CGTCCTGCTG GGCGGCAAGA TCCGCGAACG GATCCCGTTC GCAAGCTACC TGTTCTTCCG CTACCCCGAC GCATCCGGCA AGGGCGAGGT TCGCACAGCT GAGCAGCTCG TCGCGCACGC GCGAGCCCTC AAGGACAAAC ACGGTTTCAC CATCCACAAG CTCAAGGGTG GCGTCTTCCC ACCCGACTAC GAACTCGAGT GCTACCGGGC GCTGGCTGAG GCGTTCCCGA AGGACCGCCT TCGGTACGAC CCGAACGGAG CGTTGAGCGT CGAGGAGGGC ATCCGCTTCG CCAAGGCCAT CGAGTCCGTG AACAACGACT ACCTTGAGGA CCCGGCGTTC GGGCTCAACG GGCTTCGCCG GATCCGCGAG AAGACGTCGA TCCCGATCGC GACCAACACC GTCGTGGTCA ACTTCGAGCA ACTGGCGACG ACAGTGCGCG ACCCCGCGGT CGACGTTGTG CTTCTCGACA CGACCTTCTG GGGCGGCATT CGCGCGTGCA TCCGGGCTGC GGCGGTCTGC GAGACGTTCC AGATCGGCGT TGCCGTGCAC TCCTCGGGAG AGCTGGGCAT CCAGCTCGCG TCGATGCTGC ACCTTGGTGC GGTGGTCCCC AACCTCACCT TCGCGGCGGA CGCGCACTAC CACCACCTGC TCGACGACGT GATCGAGGGC GGCAGGATGA CCTACCACGA CGGTGCCATC ACCGCGCCGG ACACGCCAGG GCTCGGCGTC CGCCTCGACG AGGCCAAGGT CCGCAAGTAC GCCGACCTCT ACCAGGAACT CGGCGGATAC CCGTACGACC GCGACCCCGG CCGCCCGGGC TGGTTCCCGC TGCTTCCCAA CTCCGACTTC GCCGACCCGA CAGTGACGCA GCTTCCGCTC ACCTCACCCG GAAGGCATGA GCCGCGATGA
|
Protein sequence | MRIADLRATT VTVPLEASLR HSNGAHWGRF VRTIIEIESD NGLVGIGEMG GGGQSAEAAV VSLRDYLVGH DPARTEVLRF MLANPTASLY NNRTQLLAAV EFACLDLQGQ HLGVPVHVLL GGKIRERIPF ASYLFFRYPD ASGKGEVRTA EQLVAHARAL KDKHGFTIHK LKGGVFPPDY ELECYRALAE AFPKDRLRYD PNGALSVEEG IRFAKAIESV NNDYLEDPAF GLNGLRRIRE KTSIPIATNT VVVNFEQLAT TVRDPAVDVV LLDTTFWGGI RACIRAAAVC ETFQIGVAVH SSGELGIQLA SMLHLGAVVP NLTFAADAHY HHLLDDVIEG GRMTYHDGAI TAPDTPGLGV RLDEAKVRKY ADLYQELGGY PYDRDPGRPG WFPLLPNSDF ADPTVTQLPL TSPGRHEPR
|
| |