Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2538 |
Symbol | |
ID | 5706860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2891511 |
End bp | 2892737 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641272001 |
Product | cytochrome P450 |
Protein accession | YP_001537371 |
Protein GI | 159038118 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.107795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000443731 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGAGA CTGCCTCCAG CCGGCTCACC GACACCGAGT TTCCGGTCCA GCGCGAATGC CCATTCGCCG AGCCCGTCGA GTACGAGCAG ATCAGGGAAC AATCGTCGAT CGCCATGGTC CGCCTGACGG GTGGTGGTGA GGCGTGGTGG ATCTCCGGAC ACGAGCAGGG GCGCGCCGTC CTGGCCGACC GACGGTTCTC CTCCGACCGC CGTAAGGCCA ACTTCCCGTT CGTCAGCACC GATCCGGCGA TAAGGAAACG GTTACACGCC CAGCCCCTGT CGCTGATCAG CATGGACGGC GCCGAGCACA CCCAGGCACG GCGGGCCCTC ATCGGCGAGT TCACCGTCCG GCGCCTGGCC GCGCTGCGAC CGCGGATCCA GCAAATCGTC GACCAGTGCA TCGACGAGAT GCTGACCACC GACCAGCACC GCGCCGATCT GGTCAAAACG CTGTCGCTGC CAGTGCCATC GCTGGTCATC TGTGAGTTGC TCGGCGTCCC CTATGCTGAC CACGACTTCT TCCAGGAACA CACCGCCACC TTGGTCCGCC GCAACACCGC ATCGGAGGTT CGACAACACA GCATCGACGA GCTGAACGCA TACCTCGGCG CGCTGATCGA CCGCAAGCTC GCCAGCCCCG ACGACGACCT GCTCGGTCGG CAGATCGCCA GACAACACCG GGACGGCACC TTCGATCGAT CGAGCATGGT CAGTCTGGCC TTCCTACTGC TCGTCGCCGG TCACGAAACC ACGGCGAACA TGATCTCCCT GGGCGTTGTC GGGCTGCTAC AGCATCCCGA GCAGTTGGCC ATGATCAAGG ACGACCCGGA CAAGACGCCG CTGGCGATCG AGGAACTGCT GCGCTTCTTC ACCATCGTCG ACAGTGTCAC CTCCCGCGTG GCCACCGAGG ACGTACGGTT CGGCGACACC ACCATCAACG CGGGCGACGG AGTGGTCGTC TCCGGACTGT CCGCCGACTG GGATCCCACG GTCTTCGCAG ACCCGGACCG ACTCGACCTC GAACGCGGCG CCCGCCACCA CCTTGCTTTC GGCTTCGGTC CGCACCAGTG CCTCGGCCAG AACCTGGCCC GCCTCGAGCT GCAGATTGTG TTCGACACAC TGTTCCACCG CATTCCCACC CTCCGCCTGG CCGCACCGCT CGACAAGATC CCGTTCAAGA CGGACGCGGC CATCTACGGC GCCCGGGAAC TCCCGGTCGC CTGGTGA
|
Protein sequence | MTETASSRLT DTEFPVQREC PFAEPVEYEQ IREQSSIAMV RLTGGGEAWW ISGHEQGRAV LADRRFSSDR RKANFPFVST DPAIRKRLHA QPLSLISMDG AEHTQARRAL IGEFTVRRLA ALRPRIQQIV DQCIDEMLTT DQHRADLVKT LSLPVPSLVI CELLGVPYAD HDFFQEHTAT LVRRNTASEV RQHSIDELNA YLGALIDRKL ASPDDDLLGR QIARQHRDGT FDRSSMVSLA FLLLVAGHET TANMISLGVV GLLQHPEQLA MIKDDPDKTP LAIEELLRFF TIVDSVTSRV ATEDVRFGDT TINAGDGVVV SGLSADWDPT VFADPDRLDL ERGARHHLAF GFGPHQCLGQ NLARLELQIV FDTLFHRIPT LRLAAPLDKI PFKTDAAIYG ARELPVAW
|
| |