Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4405 |
Symbol | |
ID | 5703454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4977949 |
End bp | 4979157 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641273824 |
Product | hypothetical protein |
Protein accession | YP_001539173 |
Protein GI | 159039920 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3616] Predicted amino acid aldolase or racemase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.443829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0267897 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCACCG ACAGTTCGGA TCTGCGCGCT CGGCTCGACC GGGCGACCGC TCACCTCGAC CCGCCGTACG CGGTGGTCGA CCTCAGGGCG TTCGATGCGA ACGCCGCCGC CCTCGCCAGT CGCGCCGCCG GTAAGCCGGT CCGCGTCGCC AGCAAGTCGA TACGCTGCCG GACGCTGATC TCCCGGGCCC TGACCGCCCC CGGCTGGGCG GGTGTGCTGA CGTTCACCCT GCCCGAGGCG CTCTGGCTGG TCCGCTGCGG GGTAACCGAC GACGCGGTGG TGGCGTACCC CACGGCCGAT CGAGCCGCGC TCGCCGAGCT GGCCGGTGAT CCGACGCTCG CCGCCGCGGT GACCCTGATG GTCGACGACA CCGCCCAGCT CGACCTGGTG GACGCCGTCA GTGCCCCGGG GCAGCGGCCC GAGCTGCGGG TCTGCCTCGA CCTGGACGCC TCCTGGCGAC CGCTGGGCGG CCGGCTGCAC GTCGGGGTCC GCCGCTCGCC GGTGCACGAT CCGCGGGCGG CCGGCGCGCT CGCCGCCGCC GTCGCCGCCC GGCCCGGGTT CCGGCTGGTC GGGCTCATGG CGTACGAGGC TCAGATCGCC GGGCTGGGCG ACGCGCCACC GAAACGGGCA GTGCTCGGCG CGGCGATCCG ACTGGCCCAG CGCGGGTCGT ACCGGGAGTT GCTGGCCCGC CGGAGTGCGG CGGTCGCGGC GGTACGCGAG CACGCCGAGC TGGAGTTCGT CAACGGTGGC GGCACCGGCA GCGTGGCCGC CACCAGCGCC GATCCCGCGG TCACCGAGGT GACCGCGGGG TCCGGGCTGT ACGGGCCGAC GCTGTTCGAT GCCTACCGGG CCTGGCGCCC GACCCCCGCC GCGTACTTCG CCTGCTCGGT GGTCCGCCGG CCAGCACCCG GCTACGCCAC TGTGCTCGGC GCCGGCTGGA TCGCCTCCGG ACCGGCCCAA CGGAGTCGGC TTCCCCGCCC CGTCCTACCG GCCGGCCTCC AGTTGGTCGA CGCCGAGGGC GCCGGCGAGG TGCAAACCCC GCTGACCGGC CGGGCAGCCG GCTCGCTACG GGTCGGCGAC CGGGTCTGGT TCCGGCACGC CAAGGCCGGT GAACTCGCCG AGCACGTCAA CGAGCTGCAT CTGGTGGAGG CCGACACCGC CGGGGCGGCC GCCGCCACGT ACCGGGGCGA GGGACGGGCG TTCCTCTGA
|
Protein sequence | MATDSSDLRA RLDRATAHLD PPYAVVDLRA FDANAAALAS RAAGKPVRVA SKSIRCRTLI SRALTAPGWA GVLTFTLPEA LWLVRCGVTD DAVVAYPTAD RAALAELAGD PTLAAAVTLM VDDTAQLDLV DAVSAPGQRP ELRVCLDLDA SWRPLGGRLH VGVRRSPVHD PRAAGALAAA VAARPGFRLV GLMAYEAQIA GLGDAPPKRA VLGAAIRLAQ RGSYRELLAR RSAAVAAVRE HAELEFVNGG GTGSVAATSA DPAVTEVTAG SGLYGPTLFD AYRAWRPTPA AYFACSVVRR PAPGYATVLG AGWIASGPAQ RSRLPRPVLP AGLQLVDAEG AGEVQTPLTG RAAGSLRVGD RVWFRHAKAG ELAEHVNELH LVEADTAGAA AATYRGEGRA FL
|
| |