Gene Arth_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1940 
Symbol 
ID4445524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2187831 
End bp2189300 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content69% 
IMG OID639689750 
Productmonooxygenase, FAD-binding 
Protein accessionYP_831422 
Protein GI116670489 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGACG TGATTATTAG CGGCGGCGGC CCGACCGGGA TGATGCTGGC CAGCGAGCTG 
CGACTGCACG AGGTGGACGT GCTCGTGCTG GAGAAGGACG CGGAGCCAGG AAATGCAGTC
CGCTCGCTCG GCCTGCACCC GCGCAGCATC GAGATCATGG ACCAGCGCGG GCTGCTGGAC
AGGTTCCTCG CACACGGGCA GCAGTATCCC GGTGTCGGCG GCTTCGCCGC GATCGACAAA
CCCCGAACCG CGAACCTGGA CACCGCACAC GGTTACGTCC TCGGCATCCC CCAGCCGGTC
ACCGACGGCC TGCTGGCCGA GCGCGCCGGC GAGCTCGGCG CGCAGATCCG CCGCGGCTGC
GAGGTGACGA TGGTTGAGCA GGGCAGGGAC GGGGTGGACG TCGGGCTCGG CGACGGCACA
CGGTTGCGCT CACGCTGGCT GGTCGGGTGC GACGGCGGAC GCAGCCTGGT GCGCAGACTG
CTCGGCATCG GCTTCCCCGG CGAGCCCGCC ACTACCGAGT GGCTCCTCGG CGAGGTGGAG
GTGACCACGC CGCCGGGCGA GCTCGCGGAG GTGGCGGGCG AGGTGCGAAA GACCCACAAG
GGGTTCGGCA TCGGTCCCAC CGGTAACGGC CTGTACCGGG CCGTCGTTCC GGCGGCGACG
GTGGCCGAGG ACCGTTCGGT CCCGCCTACT CTGGAAGAGT TCCGGACGCA GCTGCGGGCT
TATGCGGGAA CTGACTTCGG CGCCCACTCA CCACGGTCGC TGTCCAGGTT TAGCGACGCG
ACACGGCTTG CCGAGCGATA TCGAGTGGGC CGGGTTCTGC TCGCTGGCGA CGCGGCCCAC
GTCCACCCGC CGTTGGGAGG TCAGGGTTTG AACCTGGGAA TCCAGGACGC GTTCAACCTC
GGATGGAAGC TCGCGGCCGA GGTCAACGGG TGGGCACCGG AGGGGCTGCT GGACAGTTAT
TACGCCGAGC GTCACCCTGT CGCCGAGGAC GTGCTGACCA TCACCCGGGC CCAGAGCGAG
TTGCTCTCCA CCGAGCCCGG CCCGCAGGCT GTGCGCCGGC TGATGACCGA GCTGATGGAC
TTCGAGGACG TCCGCCAGTT CCTGGCCGAG AAGATCACCG CGATCGGGAT CCGCTACAAC
TTCGGCGAAG GGCCCGAGCT GCTTGGCAAA CGGCTACGCG ATATTCCTCT CTCACGCGGT
CGTCTCTACG AGCTCACTCG TGAAGGCCGC GGGCTCCTGC TGGACCAGAC CGGCGAACTC
TCCGTGACGG GATGGACGGA TCGGATCGAC CATGTCGCGG ACGTCAGCGG AGAACTGGAC
GCACCTGCGG TCCTGCTCCG ACCGGACGGC CACGTCGCAT GGATCGGCGA AGACCAGACC
GACCTGCTCC GCCACCTGCC CACATGGTTC GGAGCCGCCT CGGAGGACCC GCTCGTCCGC
GCAGCCCCAG GCGCCGGCCA GAACCAATGA
 
Protein sequence
MFDVIISGGG PTGMMLASEL RLHEVDVLVL EKDAEPGNAV RSLGLHPRSI EIMDQRGLLD 
RFLAHGQQYP GVGGFAAIDK PRTANLDTAH GYVLGIPQPV TDGLLAERAG ELGAQIRRGC
EVTMVEQGRD GVDVGLGDGT RLRSRWLVGC DGGRSLVRRL LGIGFPGEPA TTEWLLGEVE
VTTPPGELAE VAGEVRKTHK GFGIGPTGNG LYRAVVPAAT VAEDRSVPPT LEEFRTQLRA
YAGTDFGAHS PRSLSRFSDA TRLAERYRVG RVLLAGDAAH VHPPLGGQGL NLGIQDAFNL
GWKLAAEVNG WAPEGLLDSY YAERHPVAED VLTITRAQSE LLSTEPGPQA VRRLMTELMD
FEDVRQFLAE KITAIGIRYN FGEGPELLGK RLRDIPLSRG RLYELTREGR GLLLDQTGEL
SVTGWTDRID HVADVSGELD APAVLLRPDG HVAWIGEDQT DLLRHLPTWF GAASEDPLVR
AAPGAGQNQ