Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4902 |
Symbol | |
ID | 5707418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5568641 |
End bp | 5569771 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641274297 |
Product | cytochrome P450 |
Protein accession | YP_001539642 |
Protein GI | 159040389 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000546373 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACAGCCG AGCCGACTCC GATTCCGCGC TCCGGCGCGC GACTCGGTCA GGAGTACGAC CAGCTTCGCA AGACCGGTGA CGTGCATCAG GTACTGCTTC CCGACGCCTC CCTGGCCTGG CTGGTCACCA ATCCGGAGGT GGCGGCCCGG GCGCTGGCGG ACCCCCGTTT GGCCCTGAAC CGTAGGAACA GCCGGGGTGG TTGGTCCGGT TTCGCGCTCC CGCCGGCCCT CGACGCGAAC CTGCTCAACC TGGACGCACC CGATCACACC CGGCTGCGCC GTCTGGTCGG CCCGGCGTTC AGCCCGCAAC GGGTCGCCGC GCTGCGTCCC GGCATCCGTC GTGCCGCCGA GCACCTCCTG GACACGCTCG TCGCCACGAG CGGGCCCACC GATCTGGTCA CCGGCTACTG CAACCCGCTG TCGGTCCAGG TCATCGCCGA TCTGATGGGC GTACCGGAGG CGGGACGGAC GAATCTGCGG GCCTGGACCG ACACGATGCT CACCAGCTAT CCACCGGACC GAGACGCGAT CCGGCGGGCG GTCACCGAAC TGCACGGGTA CGTCGTCGAC CTCATCGACA TCAAGCAGCA GCAACCCGGC GACGACCTGC TGAGTACGCT CGTCACCATC GAGCAGGACG GCGACCGGCT CAGCCGGGAC GAACTGACCT CACTGGCCTT CCTGATCCTG TTCGCCGGGT ATGAGAACAC CGCCAACCTG ATCGCCTCGG CCGTGCTGTG GCTGCTCGAC CACGGTGGGC TCAACGTGGT ACCCATCTCC GAGGCGATCG AGGCAACGCT GCGCCACGAG CCGCCCGCAC CCGTCGCCAT CCGTCGCTTT CCCACCGAGG ACATCATCAT CGGGGGCGTC ACGATTCCGG CCGGTGACAC CGTCCTGCTC AGTGTCGCCG CCGCCACCAG GGGCGCGGAC GGCAACGCCG CGAGGCTGGC CTTCGGCAAC GGCCCCCACT ACTGCCTCGG TGCGGCCCTG GCCCGGGTCG AGGCCGAGGA GGCACTCACC GTGCTGGCCC GTCGGCTACC CGGTCTGACG CTCGCCGTGC CCCCCTCCCA GGTCCGGTGG CGTCCGACGT TTCGCACCCA CGGCCCCGCC GAACTCCTGG TCGGCTGGTA G
|
Protein sequence | MTAEPTPIPR SGARLGQEYD QLRKTGDVHQ VLLPDASLAW LVTNPEVAAR ALADPRLALN RRNSRGGWSG FALPPALDAN LLNLDAPDHT RLRRLVGPAF SPQRVAALRP GIRRAAEHLL DTLVATSGPT DLVTGYCNPL SVQVIADLMG VPEAGRTNLR AWTDTMLTSY PPDRDAIRRA VTELHGYVVD LIDIKQQQPG DDLLSTLVTI EQDGDRLSRD ELTSLAFLIL FAGYENTANL IASAVLWLLD HGGLNVVPIS EAIEATLRHE PPAPVAIRRF PTEDIIIGGV TIPAGDTVLL SVAAATRGAD GNAARLAFGN GPHYCLGAAL ARVEAEEALT VLARRLPGLT LAVPPSQVRW RPTFRTHGPA ELLVGW
|
| |