Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3149 |
Symbol | |
ID | 5706207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3586860 |
End bp | 3588086 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641272581 |
Product | cytochrome P450 |
Protein accession | YP_001537948 |
Protein GI | 159038695 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.58668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0958953 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCATCG GTCAAACCCT GCCGGACCTG GTCTACAGCC CGGAGTTCAC CCGTGACCCG TACGCGATCT TCGCCCGGCT GCGCGAGCAG GCCCCGGTCT GCCGCGTGAC GACCCACCGT GGGATGAGCG CCTGGATGGT GACCCGTCAC GCCGACGTGC GGGCGCTGCT CGCGGACAAC CGGTTGGCAA AGGACGGCAA CCGAATCGGC GAGTTGATGC CCCGGCACAG CACGCTCACG GGTGCGGCCA CCGGGTTCCC GCCCGGACTG ACGACCAACA TGGTCAACAG TGACCCGCCC GACCACACCC GGCTGCGGCA CCTGGTCGGC CGCGAGTTCA CCGGGCACCG CGTCGAGGGC CTGCGCCCGC GGATCGAGGA GATCGTTGAC GACCTGCTCG ACGGCGTCGC CGCCTGCGGG GACGAGGCTG ACCTGGCGGA GACCCTCGCA CGGCGCCTGC CGATCGCGGT GATCGGCGAA CTGCTCGGCG TGCCCGAAGC CGACCGCGCG GAGTTCTTCC GCTGGGCCGA CACCCTGTAC GGCGGCACTG CGTCACCGGA AGCGCTGGGC CAGGCGTACA ACGCGATCGT CGACTACCTC GGCCGGCTCT GCGACGCCAA ACGTGACGTG CCCGCCGACG ACCTGCTCAC CGCGCTGGTG CAGGTCAGCG CCGACGAGGA CCGGCTGTCA CGCGAGGAAC TCGTGTCGAT GGCCCTGCTG CTGTTGGTGG CCGGGCACGA GACGACCAGC AAGCAGATCA GCAACGGGGT GCTGGCCCTG CTGCTCAACC CGGAGCAGCT GAAGCTGCTG AAGGCGCAAC CCGCGCGAAC CGCCGGTGCG GTCGAGGAAC TGCTGCGGTT CGAGGGCCCG AGCCTCTCGG CCAGCCTGCG CTTCACCACC GAGCCGGTGG AGGTAGCCGG TGTGGTCATC CCCGAGGGGG AGTTCGTCCT GCTGTCGCTG GCGTCGGGCA ACCGTGACCC GGAGAAGTTC CCCGACCCCG ACCGGCTCGA CATCACCCGC TCCACCCAGG GCAATCTGGC AATGGGACAC GGCATCCACC ACTGTGTCGG CGCTGCCCTC GCCCGCCTCG AACTGGAGAT CGTCCTCAGC CGTCTGGTGG CGCGGTTCCC GCAGATGCAA CTGGCCGTCG AGGCGGATGA CCTTGAGTGG CTGGTGAATT CCTTCTTTCG CGCGCCCCTG CACCTGCCGG TGTCACTCCG GCGGTGA
|
Protein sequence | MTIGQTLPDL VYSPEFTRDP YAIFARLREQ APVCRVTTHR GMSAWMVTRH ADVRALLADN RLAKDGNRIG ELMPRHSTLT GAATGFPPGL TTNMVNSDPP DHTRLRHLVG REFTGHRVEG LRPRIEEIVD DLLDGVAACG DEADLAETLA RRLPIAVIGE LLGVPEADRA EFFRWADTLY GGTASPEALG QAYNAIVDYL GRLCDAKRDV PADDLLTALV QVSADEDRLS REELVSMALL LLVAGHETTS KQISNGVLAL LLNPEQLKLL KAQPARTAGA VEELLRFEGP SLSASLRFTT EPVEVAGVVI PEGEFVLLSL ASGNRDPEKF PDPDRLDITR STQGNLAMGH GIHHCVGAAL ARLELEIVLS RLVARFPQMQ LAVEADDLEW LVNSFFRAPL HLPVSLRR
|
| |