Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_4119 |
Symbol | |
ID | 5060601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4685200 |
End bp | 4686411 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640476380 |
Product | globin |
Protein accession | YP_001160927 |
Protein GI | 145596630 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0543] 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases [COG1017] Hemoglobin-like flavoprotein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGGT CGCACATTCT GTGCGACCGA TTGACACGCT CCGTGGCCAT TCTATGTGTG CCCGATCCCC GTATGGAGAA GGAGCGCGCT CCCGTGGACG ACGTCGCGCG GCTACTCAAG GAGAGTTGGA CCCTGGTTGA GGAGCACCGG GACCGGCTGA GCGAGCACTT CTACGCCCGG CTGTTCCTGC TCGACCCCGA GCTGCGTTCG CTTTTTCCGG CGCAGATGGC GGGTCAGGGT GATCGTCTGT TGGAGGCGAT CATCACCGCC GCCCACACGG TGGACGACCC GGAGGGCTTC GACGAGTTCC TCCGTTCGCT GGGCCGGGAC CACCGCAAGT ACCACGTCGA GGCGACGCAC TACGAGACCA TGGGCGTCGC CCTGCTGGAC GCCTTGCGCA GCACTGCCGG CGACGGCTGG AATCTGACCT TCGACCAGGC CTGGCGGGAC GCGTACGCGG CGATCTCGGG CAAGATGCTC GCGGGGGCGG CGGCGGACGA CAACCCGCCG TTCTGGCATG CCGAGGTGCT GACCCATGCC CGGTACGGGC CGGACACGGC GGTGTTGACG GTCCGGGCCC TCCAGCATCC GCTGCGGTGG CAGGCGGGCC AGTACGTCAG CATTGAGGCG CCCCGATACC ACCCGCGGGT GTGGCGGACC TATTCGGTGG CGAACGCGCC AAACGACGAG AACGTGCTGG AGTTCCACGT TCGGACTCCG CCGGGCGCGG GGTGGTTGTC CGGCGCGTTG GTCCGTCGGG TGAAGCCGGG TGACCTGTTG CGGTTGGCGG CGCCGATGGG GTCGATGACG TTGGATCGGG CGTCGGACCG GGACATCCTC TGTGTCGCCG GCGGGGTTGG GTTGGCTCCG GTGAAGGCGC TGGTCGAGGA GCTGGCGGGC TATAACCGGA CCCGCTGGGT GCACGTCTTC TACGGCGCCC GTACGCCGCT TGACCTCTAT GGCCTGGCCG GGCTCCAGGA GATGGTCGCC CGGCATCCGT GGTTGTCGGT GACGCCGGCG TGCAGTGCGG ACACGGGCTT CGACGGTGAA CTGGGCGATA TCTCCGAGGT GGTCGGCCGG TACGGCCCGT GGACGGCGCA CGACTGCTAC GTCTCCGGGG CGGCGCCGAT GGTCCGGGCC ACACTGCGGG TCCTGTCCGG CGACGAGGTG CCGGCGGAGC GTACTCGGTA CGACACCTAT GGTGATTTGT AG
|
Protein sequence | MNRSHILCDR LTRSVAILCV PDPRMEKERA PVDDVARLLK ESWTLVEEHR DRLSEHFYAR LFLLDPELRS LFPAQMAGQG DRLLEAIITA AHTVDDPEGF DEFLRSLGRD HRKYHVEATH YETMGVALLD ALRSTAGDGW NLTFDQAWRD AYAAISGKML AGAAADDNPP FWHAEVLTHA RYGPDTAVLT VRALQHPLRW QAGQYVSIEA PRYHPRVWRT YSVANAPNDE NVLEFHVRTP PGAGWLSGAL VRRVKPGDLL RLAAPMGSMT LDRASDRDIL CVAGGVGLAP VKALVEELAG YNRTRWVHVF YGARTPLDLY GLAGLQEMVA RHPWLSVTPA CSADTGFDGE LGDISEVVGR YGPWTAHDCY VSGAAPMVRA TLRVLSGDEV PAERTRYDTY GDL
|
| |