Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3533 |
Symbol | |
ID | 5704601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4074175 |
End bp | 4075326 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641272960 |
Product | cysteine desulfurase |
Protein accession | YP_001538326 |
Protein GI | 159039073 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000228802 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGAGCACCT CTCCGGTCTA CCTGGACGCC GCCTCCGCCG CACCGCTGCA CCCCGTCGCA CGGCAGGCGC TGCTGGCCGC GCTCGACGAC GGCTGGGCCG ACCCGCAACG GCTCTACACC CAGGCCCGCC GCGCACGGCA ACTCCTCGAC GCGGCGCGGG AGGCCGCCGC GGCCACGCTC GGCGTACGGC CGGACGAACT CTCCTTCGCC CCGAGCGGTA CTGCTGCCGC GCATTCGGCC GTCCTCGGCG GGCTCACCGG ACGTCGCCGG GTCGGTTCCG GCCTCGTGCA CTCGGCGATC GAGCACTCAG CGGTACTACA TGCCGCCGAT CGGCATGCGG CCGGCGGCGG CGCGGTGACG TCGGTCCCGG TGGATCGGAT CGGCCGCATC GACCCGGATA CCTGGTCCGC GGCGGTTCGG GCGCCCGGCG TGGCGCTCGC CGCACTGATC GCCGCGAGTC ACGAAGTGGG CACGGTGCAG CCCGTCGCCG CGGCGGGCGC CGCCTGCGCC GCGGCCGGGG TACCGCTCTA CGTTGACGCA GCGCAGGTGG TCGGGCACGG GCCGGTGCCG GTCGGCTGGT CGCTGCTGAC CGCGAGTGCC CACAAGTGGG GCGGGCCGCC GGGAGTCGGG CTGCTGGTGG TTCGCAAGGG CACCCGCTGG GAGTCGCCGT GGCCGGTGGA CGAACGCGAG GCCGGGCGTG TCCCGGGAGT GGTGAACCTG CCGGCGGTCG TCGCGGCGGC GGCGAGCCTG CGCGCGGCTG CCGCCGACGC GGACGCGCGG GCGGCCCGAC TCACCCCCCT GGTGGATCGG ATCCGTACCC GGGTGGTGAC GGACGTACCG GACGTGGAGG TGGTCGGCGA TCCCGATCAC CGGCTCCCCC ACCTGGTGAC CTTCTCCTGC CTGTACGTCG ACGGTGAGGC GCTGCTGCAG GCGCTGGACC GGCGGGGCTT CGCCGTCTCC TCCGGGTCGT CGTGCACGTC GTCGACGCTG CGTCCGTCGC ACGTGCTGGC GGCGATGGGG GTGCTGTCGC ACGGCAATGT TCGGGTCTCG CTGCACCGGG ACACCACCGA GGCCGAGGTG GAACGGTTCC TGGCCGAGTT GCCGGGGGTC GTGGCTGAGC TGCGGGCCGA GGCGGGCGTG GTGGGGCTGT GA
|
Protein sequence | MSTSPVYLDA ASAAPLHPVA RQALLAALDD GWADPQRLYT QARRARQLLD AAREAAAATL GVRPDELSFA PSGTAAAHSA VLGGLTGRRR VGSGLVHSAI EHSAVLHAAD RHAAGGGAVT SVPVDRIGRI DPDTWSAAVR APGVALAALI AASHEVGTVQ PVAAAGAACA AAGVPLYVDA AQVVGHGPVP VGWSLLTASA HKWGGPPGVG LLVVRKGTRW ESPWPVDERE AGRVPGVVNL PAVVAAAASL RAAAADADAR AARLTPLVDR IRTRVVTDVP DVEVVGDPDH RLPHLVTFSC LYVDGEALLQ ALDRRGFAVS SGSSCTSSTL RPSHVLAAMG VLSHGNVRVS LHRDTTEAEV ERFLAELPGV VAELRAEAGV VGL
|
| |