Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2105 |
Symbol | |
ID | 5704719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2426466 |
End bp | 2427689 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641271590 |
Product | cytochrome P450 |
Protein accession | YP_001536961 |
Protein GI | 159037708 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.34341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0115341 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAAT CAATGCCGGT TCAGGACCTG CCGGCGTTCC CGATCCCCCG GGAGTGCCCG TACCGGCCCT CGGCGCAGCA CGTGTCACTG CGATCCGGCG GCCCGATGGC GAAAGTGCGG CTCTACAACG GCCGCACCGC GTGGCTGGTG ACCGACTCCG CGCATGCGCG CGCGGTCCTG TCTGACTATC GCCGTGTGTC GATCAAGCCC TACCACGGCA ACTACCCACT CCTGAACGAG GAGTTCGAGA AGGTCGTCGA CAGCGGGTAC GCGGACGTGT TGTTCGGCGT CGACCCGCCC GAGCACACCC GCCAACGACA GATGATCATG CCGAGCTTCA CGTTGCGGCG AACGGCGGTG CTCCGCCCGG ACATCCAGCG CATCGTTGAC GAAAAGCTCG ACGAGATGAT GCGCCACGGC GCCCCCGGCG ACCTGGTCAC CGAATTCGCC CAGCCCGTGC CGTCGATGGT GATGAGTTTC CTGCTCGGCG TTCCGTGGGA GGACCACGAG GAGTTCGAGA CCCCGGCGCA CAAGCTGTTC GTCCCGGAAC TCGCCGAGGA GGCAACCACC GAACTCGGCG CATACCTCGA ACGGCTGATC CAGAAGAAGG AACAGCCTGG TGGAACCCCC GGCGGGACCG GCCTGCTCGA CGACCTGATC CGGGATCACC TGCGGGCCGG CGCGCTGAGC CGGGACGAAC TCGTCCACAT CGCGATGGCG ATGCTGGTCG CCGGCACCGA CACGACCACC AATGTGATCT CCCTCGGCAC GCTCGCGCTG CTGGACAACC CGGACCAGTG GGCGGCCCTG CGCGACAACC CGGACGAGCT GATCCCCGGC GCGGTCGAGG AGATCCTGCG GTACACATCA CTGATCGAGG CGTTCGCCCG CGTCGCGGTG TCGGACATCG AGTTGAACGG TGCTGTCATC AAGGAGGGCG AGGGCATCCT GATCAGCTCC GCGGGCGTCA ACTTCGACCC GGCGCTGGCA CCGGACCCGG GCCGGTTCGA CATCCGCCGC CCACCCCGCC CAAGCTTCTC GTTCAGCCAC GGCATCCACC GCTGCCCAGG CGACAACCTG GCCCGCCTCG AACTCGAGAT TGCGTTTCGG AGCCTGGTCA CCCGCATGCC GAACCTCCGC ACCGCCAAGC CGATCGACCA GATTCCCAGC AACAACAACG ACGGGACGTT GCAGCGGCTG TACGAGCTCC CGGTTGTCTG GTAG
|
Protein sequence | MTKSMPVQDL PAFPIPRECP YRPSAQHVSL RSGGPMAKVR LYNGRTAWLV TDSAHARAVL SDYRRVSIKP YHGNYPLLNE EFEKVVDSGY ADVLFGVDPP EHTRQRQMIM PSFTLRRTAV LRPDIQRIVD EKLDEMMRHG APGDLVTEFA QPVPSMVMSF LLGVPWEDHE EFETPAHKLF VPELAEEATT ELGAYLERLI QKKEQPGGTP GGTGLLDDLI RDHLRAGALS RDELVHIAMA MLVAGTDTTT NVISLGTLAL LDNPDQWAAL RDNPDELIPG AVEEILRYTS LIEAFARVAV SDIELNGAVI KEGEGILISS AGVNFDPALA PDPGRFDIRR PPRPSFSFSH GIHRCPGDNL ARLELEIAFR SLVTRMPNLR TAKPIDQIPS NNNDGTLQRL YELPVVW
|
| |