Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4271 |
Symbol | |
ID | 5705776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4847338 |
End bp | 4848288 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641273690 |
Product | luciferase family protein |
Protein accession | YP_001539043 |
Protein GI | 159039790 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03560] probable F420-dependent oxidoreductase, Rv1855c family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0653657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0110143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGTCT CCGTGTTCAC CGAGCCACAC CGCGGAGCCA GCTACGACGA CCAGCTCCGG TTCGCACGCC ACACGGAGAG CTGCGGCTAC GACGGATTCC TCCGCGCCGA CCACTACCGG GCGATGGGTG ACGAGCCAGC GCTACCCGGC CCGACGGATG CCTGGCTGAC ACTGGCCGCT CTGGCTCGCG AGACCAGCCG TATCCGGCTC GGAACGCTGG TCACGTCGGC GACCTTCCGG CTACCCGGCC CGCTGGCCGT CATGGTGGCC CAGGTCGACC AGATGAGTGG TGGCCGAATC GAGCTGGGCA TCGGCGCCGG CTGGTACGAA CGGGAGCACA CCGCCTACGG GATCCCGTTC CCCGGTGTCA GGGAACGGTT CGACCGGCTG GCCGAGCAAC TCGAGATCGT CACCGGGCTG TGGCGTACCC CTTCAGAAGG CAGGTTCAAC TACAGCGGAG AGCACTATCA GCTCGTTGAC GCGCCAGCGT TGCCGAAGCC GGTCCAGCGA CCCGGGCCAC CGGTGATCGT GGGCGGTCGC GGCCCGAAGC GTACCCCCGA GCTGGCCGCC CGGTACGCCG ACGAGTTCAA CATGCCGTTC ACCTCCGTCC CGCAGGCCGC AGCCGCGTAC GAGCGGGTCC GAGAGGCGTG CGACGCCGCC GGTCGGACCG ACTCCGGCCG GCAGCCGCTG GTGCTCTCGG CAGGAATCGT GGTGGCCGTT GGCCGCACGG ACGCCGAGGC ACGTCGCCGC GCCGCGCCAC TGCACCGGCC CAGCGCCCTG CCCCCGGAGG ACCCGGTCGT CGGTTCGCCC GCCCAACTCG TCGAACGGAT CGGTGAGTTC GCCGCGGTCG GAGCGACCCG CGTCCATCTG CGCCTCATCG ACCTCGCAGA CCTCGACCAC CTGGAGCTCA TCGCCGCCGA CGTGCTCCCC CAACTGGATG GACCACAATG A
|
Protein sequence | MRVSVFTEPH RGASYDDQLR FARHTESCGY DGFLRADHYR AMGDEPALPG PTDAWLTLAA LARETSRIRL GTLVTSATFR LPGPLAVMVA QVDQMSGGRI ELGIGAGWYE REHTAYGIPF PGVRERFDRL AEQLEIVTGL WRTPSEGRFN YSGEHYQLVD APALPKPVQR PGPPVIVGGR GPKRTPELAA RYADEFNMPF TSVPQAAAAY ERVREACDAA GRTDSGRQPL VLSAGIVVAV GRTDAEARRR AAPLHRPSAL PPEDPVVGSP AQLVERIGEF AAVGATRVHL RLIDLADLDH LELIAADVLP QLDGPQ
|
| |