Gene Strop_4119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4119 
Symbol 
ID5060601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4685200 
End bp4686411 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content68% 
IMG OID640476380 
Productglobin 
Protein accessionYP_001160927 
Protein GI145596630 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0543] 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases
[COG1017] Hemoglobin-like flavoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGGT CGCACATTCT GTGCGACCGA TTGACACGCT CCGTGGCCAT TCTATGTGTG 
CCCGATCCCC GTATGGAGAA GGAGCGCGCT CCCGTGGACG ACGTCGCGCG GCTACTCAAG
GAGAGTTGGA CCCTGGTTGA GGAGCACCGG GACCGGCTGA GCGAGCACTT CTACGCCCGG
CTGTTCCTGC TCGACCCCGA GCTGCGTTCG CTTTTTCCGG CGCAGATGGC GGGTCAGGGT
GATCGTCTGT TGGAGGCGAT CATCACCGCC GCCCACACGG TGGACGACCC GGAGGGCTTC
GACGAGTTCC TCCGTTCGCT GGGCCGGGAC CACCGCAAGT ACCACGTCGA GGCGACGCAC
TACGAGACCA TGGGCGTCGC CCTGCTGGAC GCCTTGCGCA GCACTGCCGG CGACGGCTGG
AATCTGACCT TCGACCAGGC CTGGCGGGAC GCGTACGCGG CGATCTCGGG CAAGATGCTC
GCGGGGGCGG CGGCGGACGA CAACCCGCCG TTCTGGCATG CCGAGGTGCT GACCCATGCC
CGGTACGGGC CGGACACGGC GGTGTTGACG GTCCGGGCCC TCCAGCATCC GCTGCGGTGG
CAGGCGGGCC AGTACGTCAG CATTGAGGCG CCCCGATACC ACCCGCGGGT GTGGCGGACC
TATTCGGTGG CGAACGCGCC AAACGACGAG AACGTGCTGG AGTTCCACGT TCGGACTCCG
CCGGGCGCGG GGTGGTTGTC CGGCGCGTTG GTCCGTCGGG TGAAGCCGGG TGACCTGTTG
CGGTTGGCGG CGCCGATGGG GTCGATGACG TTGGATCGGG CGTCGGACCG GGACATCCTC
TGTGTCGCCG GCGGGGTTGG GTTGGCTCCG GTGAAGGCGC TGGTCGAGGA GCTGGCGGGC
TATAACCGGA CCCGCTGGGT GCACGTCTTC TACGGCGCCC GTACGCCGCT TGACCTCTAT
GGCCTGGCCG GGCTCCAGGA GATGGTCGCC CGGCATCCGT GGTTGTCGGT GACGCCGGCG
TGCAGTGCGG ACACGGGCTT CGACGGTGAA CTGGGCGATA TCTCCGAGGT GGTCGGCCGG
TACGGCCCGT GGACGGCGCA CGACTGCTAC GTCTCCGGGG CGGCGCCGAT GGTCCGGGCC
ACACTGCGGG TCCTGTCCGG CGACGAGGTG CCGGCGGAGC GTACTCGGTA CGACACCTAT
GGTGATTTGT AG
 
Protein sequence
MNRSHILCDR LTRSVAILCV PDPRMEKERA PVDDVARLLK ESWTLVEEHR DRLSEHFYAR 
LFLLDPELRS LFPAQMAGQG DRLLEAIITA AHTVDDPEGF DEFLRSLGRD HRKYHVEATH
YETMGVALLD ALRSTAGDGW NLTFDQAWRD AYAAISGKML AGAAADDNPP FWHAEVLTHA
RYGPDTAVLT VRALQHPLRW QAGQYVSIEA PRYHPRVWRT YSVANAPNDE NVLEFHVRTP
PGAGWLSGAL VRRVKPGDLL RLAAPMGSMT LDRASDRDIL CVAGGVGLAP VKALVEELAG
YNRTRWVHVF YGARTPLDLY GLAGLQEMVA RHPWLSVTPA CSADTGFDGE LGDISEVVGR
YGPWTAHDCY VSGAAPMVRA TLRVLSGDEV PAERTRYDTY GDL