Gene Arth_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2055 
Symbol 
ID4445429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2316558 
End bp2317808 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID639689863 
Productmonooxygenase, FAD-binding 
Protein accessionYP_831535 
Protein GI116670602 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.6459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAGC ACGCCACGTC CACCGATGTC CTCGTCATCG GAGGGGGTAT GGCCGGACTC 
GCCGGAGCCC TCGCGCTACG CGAAAACGGT GCCGACGTCA CCCTCGTGGA ACGGGCACCG
GAGTTCGGCG AGGTGGGCGC CGGGCTGCAG ATGGCCCCCA ACGCCTCCCG CGTCCTCCGG
AGCTGGGGCC TGCTGGAGAA AGCGCTCGAA ATCGGTGTCC AGCCCAAGCA CCTGGTCTTC
CGCGACGCCG TCACCGGAGA AGAGCTCACC CGCCAGACGC TCGGCACCGA ATTCGAGGAA
CGCTACGGTG CGCCGTATGT CGTGATCCAC CGCAGCGACC TCCACCGGGT CCTCCTCGAG
GGCTGTGAAG CAGCCGGGGT CAAGCTCGTC AACGACGTCA TGGTGGAGAG CGTTGAGACT
GTGAACGGCC GCGGCGTGGC CCACACCGCC GGAGGCGTTG ACTACGAGGC CGACGTCGTT
ATCGGCGCCG ACGGTCTCCG GTCCACCCTC CGTCCGCTCG TGGCAAACGA CGAACCCGTC
TCCTCGGCGT ACGTCGCCTA CCGCGGCACG GTGCCGATCA CCGAGAACAC ACCCAAGGCC
GACCTTGAAG ACGTCATCGT CTACCTCGGA CCGGACTGCC ACCTGGTGCA GTACCCGTTG
CGCAAGGGCG AACTGCTGAA CACCGTGGCC GTCTTCAAGT CGCCGTCCTT CGAAGCCGGC
GTCGAACAGT ACGGGGGAGT GGACGAGCTC GAGGCAGCCT ACAAGGACTG CGTCCCGGCG
GTTCAGGAAG CGCTGAAGAA CCTGGCCACC GGCATCCGCT GGCCCATGTA CGACCGTGAC
CCGATCGAGA ACTGGGTTGC CGGCCGCATG GTGCTGATGG GCGACGCCGC GCACCCGATG
CTGCAGTACC TGGCGCAGGG CGCTTGCCAG GCCCTCGAGG ACGCAGCCGT GCTGCAGGAC
GTCAGCGCCG GCACCGTCTT CACCGCGGAC GGCGTCAACC CCGAAGCCTG GGACGGCGCC
ATCAAGGAGT TCAACGCAAT CCGCGCCGGC CGCACCGCCC GGGTCCAGCG CACCGCCCGC
GTCTGGGGCG AATCCTGGCA TGTCTCCGGA CTGGCCCGGA CCCTGCGCAA CCTGCTCTTT
AAGAGCCGGA AGGACAACGA CTTCCAGTAC AACGACTGGC TGTACGGCCA GGCCGGCGAC
GGAGTTCCTG CCGCCGTCCG TAAGGAGACC GCGGTGAAAG TCCCCGCCTG A
 
Protein sequence
MSEHATSTDV LVIGGGMAGL AGALALRENG ADVTLVERAP EFGEVGAGLQ MAPNASRVLR 
SWGLLEKALE IGVQPKHLVF RDAVTGEELT RQTLGTEFEE RYGAPYVVIH RSDLHRVLLE
GCEAAGVKLV NDVMVESVET VNGRGVAHTA GGVDYEADVV IGADGLRSTL RPLVANDEPV
SSAYVAYRGT VPITENTPKA DLEDVIVYLG PDCHLVQYPL RKGELLNTVA VFKSPSFEAG
VEQYGGVDEL EAAYKDCVPA VQEALKNLAT GIRWPMYDRD PIENWVAGRM VLMGDAAHPM
LQYLAQGACQ ALEDAAVLQD VSAGTVFTAD GVNPEAWDGA IKEFNAIRAG RTARVQRTAR
VWGESWHVSG LARTLRNLLF KSRKDNDFQY NDWLYGQAGD GVPAAVRKET AVKVPA