Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0991 |
Symbol | |
ID | 5707531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1114181 |
End bp | 1115194 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641270506 |
Product | cysteine synthase |
Protein accession | YP_001535893 |
Protein GI | 159036640 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.145876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.133112 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCTACG CGCCACGTCG CTACCCGCCC GCTGAGGACG ACTCCGCCGT GGCGCGGTAC GACACTCTGC TGGACGCCTG TGGCGGCACG CCCCTGGTCG GGCTGCCCCG GCTCGCGCCG GCGGTGCCCG AGGGGGCGCC GCCGGTGCGG CTGTGGGCGA AATTGGAGGA CCGAAACCCG ACCGGCAGCA TCAAGGACCG GGCCGCGTTG TTCATGGTCC GGGCGGCCGA GGAGGCGGGC CGGCTCCGGC CGGGTGCCAC GATCTGCGAG CCCACCAGCG GGAACACCGG CATCGCGTTG GCCATGGTGG CGAAGCTGCG CGGATACCGC CTGGTGTGTG TGCTGCCGGA GAACGTCTCC ACCGAGCGGG TGCAGCTGCT GCGCATGTAC GGCGCGGAAA TCATCTTCTC GCCGGCGGCG GGCGGCTCGA ACCAGGCCGT CGCCACTGCC AAGCAGGTCG CCGCCGAGCA CCCCGACTGG GTGCTGCTGT ACCAGTACGG CAACGAGGCC AATGCCCAGG CCCACTACCT GACCACCGGC CCCGAACTGC TGCAGGACCT GCCCACGATC ACCCACTTCG TGGCCGGGCT CGGCACCACG GGAACCTTGA TGGGTACCGG CCGGTACCTG CGCGAGAAGG TGGAGGGCGT TCAGGTCGTG GCCGCCGAGC CGCGCTACGG CGAGCTGGTG TACGGATTGC GCAACATCGA CGAGGGGTAC GTTCCGGAGC TGTACGACGC CTCGGTGCTC AGTCGGCGCT TCTCGGTCGG CACTCGGGAC GCCGTGCTGC GCAGCCGTCA GTTGGTCGAG GTGGAAGGGC TCTTCGCCGG GCTCTCCACC GGCGCGATCC TGCACGCGGC GTTGGCGATG GCGCACGAGG CGGCCCGGGA GGGGCGCCGC GCCGACGTGG CCTTCGTGGT TGCCGACGGC GGTTGGAAGT ACCTGTCCAC CGGAGCGTAC GGCGGCACCC TCGCCGAGGC CGAGGAGGCT CTGGAAGGGC AGCTCTGGGC CTGA
|
Protein sequence | MAYAPRRYPP AEDDSAVARY DTLLDACGGT PLVGLPRLAP AVPEGAPPVR LWAKLEDRNP TGSIKDRAAL FMVRAAEEAG RLRPGATICE PTSGNTGIAL AMVAKLRGYR LVCVLPENVS TERVQLLRMY GAEIIFSPAA GGSNQAVATA KQVAAEHPDW VLLYQYGNEA NAQAHYLTTG PELLQDLPTI THFVAGLGTT GTLMGTGRYL REKVEGVQVV AAEPRYGELV YGLRNIDEGY VPELYDASVL SRRFSVGTRD AVLRSRQLVE VEGLFAGLST GAILHAALAM AHEAAREGRR ADVAFVVADG GWKYLSTGAY GGTLAEAEEA LEGQLWA
|
| |