Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1756 |
Symbol | |
ID | 5705083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2028746 |
End bp | 2029957 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641271259 |
Product | cysteine desulfurase family protein |
Protein accession | YP_001536634 |
Protein GI | 159037381 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01976] cysteine desulfurase family protein, VC1184 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.115577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0371464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTCG ACATCGCCCG CGCCCGGGCC GCCTATCCCG CCCTGGCCGA GGGACACGTC CACTTCGACG GTGCCGGTGG CACCCAGACC GCCGCGCCGG TGATCGCCGC GGTGGCCGAG ACGATGGGTA CGGCGCTCGG CAACCGCAGT GGTGGCAACC TACCCGGCCG ACGCTCGGTG GAACTGGTGT CCGCCGCCCG GACGGCCGTG GCCGACCTGC TCGGCGCGGT CCCGGAGGGG GTGGTGCTGG GCCCGAGCGC GACCGCGCTG ACGTACACCC TGGCCCGCGC CCTCGGGGCG ACCTGGCGGC CGGGCGACGA GGTGGTGGTG TCCCGACTCG ACCACGATGC CAACGTTCGG CCGTGGATCC AGGCGGCCGA GGCGGCCGGC GCGACGGTAC GGTGGGCCGA GTTCGACGAG CACACCGGCG AACTGCCCGC CGGCCAGTAC GCCGACCTGG TCAACGAGCG GACCCGGCTG GTGGCGGTCA CGGCCGGCAG CAACGCGATC GGCACGATCC CGGACGTGGC GGCGATCGCC AAGTCGGCTC ACGCCGCCGG CGCGTTGGTC TGCGTGGACG GCGTGCACTC GGTACCGCAC GGTCCGACCG ACCTCACCGC GCTGGGAGCG GACTTCCTCG TCACCAGTGC CTACAAGTGG TCCGGCCCGC ACCTGGCCGC GGTAGCAGCG GACCCGACGT GCTGGCAGCA CCTGCACCCG GCGAAGCTGC GCCCCTCCGC CGACACGGTG CCCGACCGGT TCGAGTACGG CACGCCCAGC TTTCCCCTGT TGGCCGGGGT GGCCGTCGCC GTGGACCACC TCGCCGGGCT GGACCCGACG GCTACCGGAA CCCGGCGGGA GCGGCTGCGG ACCAGTCTGA GCGCAGTCCG CACGTACGAG GAGGGACTGT TGGACCGGCT GCTCGACGGT CTCGCCGCGG TGTCCGGGGT CACCGTGCTC GGCTCACCGG GCCGGCGCTG CCCCACGGTC TCGTTCCGGT TGGCCGGCCG GTCTCCGGCC GACACCCAGG CGGCGCTGGG CGCGGCGGGG GTCTGCCTGT CCGCCGGCGA CTACTACGCC TACGAGTACT TCCAGACGTT GGGACTGCGG GACAGCGGCG GGGCGGTGCG GGTCAGCCTG TACCACTACA ACACCGTCGC CGAGGTGGAT CGCCTGCTCA ACGAGTTGGC GACCCTGACC ACCGGCGGCT GA
|
Protein sequence | MPFDIARARA AYPALAEGHV HFDGAGGTQT AAPVIAAVAE TMGTALGNRS GGNLPGRRSV ELVSAARTAV ADLLGAVPEG VVLGPSATAL TYTLARALGA TWRPGDEVVV SRLDHDANVR PWIQAAEAAG ATVRWAEFDE HTGELPAGQY ADLVNERTRL VAVTAGSNAI GTIPDVAAIA KSAHAAGALV CVDGVHSVPH GPTDLTALGA DFLVTSAYKW SGPHLAAVAA DPTCWQHLHP AKLRPSADTV PDRFEYGTPS FPLLAGVAVA VDHLAGLDPT ATGTRRERLR TSLSAVRTYE EGLLDRLLDG LAAVSGVTVL GSPGRRCPTV SFRLAGRSPA DTQAALGAAG VCLSAGDYYA YEYFQTLGLR DSGGAVRVSL YHYNTVAEVD RLLNELATLT TGG
|
| |