Gene Arth_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1994 
Symbol 
ID4445473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2248152 
End bp2249429 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content66% 
IMG OID639689803 
Productcytochrome P450 
Protein accessionYP_831475 
Protein GI116670542 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.269689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACCCT CGACGGAAAC GGCCGGCCGC TGTCCCTTTG GACACGGCGC CGAAGCACCC 
GCAGGGCACC ACGGCTACGA GCCGTTCCAG ATGAAGGACC CCTTCCCGGC CTACGCCGAA
CTCCGGGCCG AGCAGCCCGT GATGTTCGAC GAGCGGGTCG GCCTCTACGT CGTCTCCCGC
TATGACGACA TCAAGGCTGT CTTCGAGGAC TGGGAGACCT TCTCCAGCGA AAACGCCCAG
GCCCCCGTCC GCGAACGCGG CCCCGCCGCG AAGAAAATCA TGGAAGACGG CGGCTTCACC
GCCTACTCGG GCCTGTCCGC CCGCCGCCCT CCGGAGCACA CCCGCATCCG CGCCGTGGTC
CAAAAGGCCT TCACGCCGCG CCGCTACAAG GCGCTGGAGC CCTTCATCCG GCAGAACGTC
ATCGAACTGA TCGAAAAGAT GCTGGCGCGC CCGGAACACC GCGGGGACAT GGTCAAGGAC
CTCGCCTACG ACGTCCCCAC CATCACCATC CTCACCCTCA TCGGAGCGGA TGTCTCCCAG
GTGGACACCT TCAAGCGCTG GAGCGACTCC CGCGCGGCCA TGACCTGGGG CGACCTCAGC
GACGAGGAAC AAATTCCGCA CGCCCACAAC CTCGTGGAGT ACTGGCAGGA ATGTCTCCGG
CTGGTCAAGG TGGCCCACGA GCAAGGCGGC GACAACCTCA CCGCGGACCT GGTGAAGTCG
CAGCAGGAGG GTGCGGAGAT TTCCGACCAC GAGATCGCCT CCGTACTCTA CAGCCTGCTC
TTCGCCGGGC ACGAGACCAC CACCACGCTG ATCTCCAACG CCCTGCGCGA GCTCCTGTCC
CGGCCCGAGC AGTGGCAGCA GCTCGTCGAG GACCCCAAGA AGATCCCCGC CGCCATTGAC
GAGGTCCTGC GCTACGCCGG CTCGATCGTC GGCTGGCGCC GCAAGGCGCT CAAGGACACC
GAGGTTGGTG GCGTGCCCAT TGAGGAGGGC GCGCAGCTGC TGCTCCTGAT GGGCTCCGCC
AACCGGGACG AGGCCAAATT CAACGCCGGC GAAGACTTCG ACATCACCCG CCCCAACGCC
CGCGAGCACC TCTCCTTCGG GTTCGGCATC CACTACTGCC TGGGCAACAT GCTCGCCAAA
CTCCAGGCCA AGATCGCACT CGAGGAAGTG GCACGGCTCG CCCCGGCGCT GCAGCTGGAG
AACCCGGAAG CCATCACCTT CCGGGAAAAC CTCTCCTTCC GAGTCCCCGA GTCCGTCCCC
GTCAGCTGGA AGGCCTGA
 
Protein sequence
MSPSTETAGR CPFGHGAEAP AGHHGYEPFQ MKDPFPAYAE LRAEQPVMFD ERVGLYVVSR 
YDDIKAVFED WETFSSENAQ APVRERGPAA KKIMEDGGFT AYSGLSARRP PEHTRIRAVV
QKAFTPRRYK ALEPFIRQNV IELIEKMLAR PEHRGDMVKD LAYDVPTITI LTLIGADVSQ
VDTFKRWSDS RAAMTWGDLS DEEQIPHAHN LVEYWQECLR LVKVAHEQGG DNLTADLVKS
QQEGAEISDH EIASVLYSLL FAGHETTTTL ISNALRELLS RPEQWQQLVE DPKKIPAAID
EVLRYAGSIV GWRRKALKDT EVGGVPIEEG AQLLLLMGSA NRDEAKFNAG EDFDITRPNA
REHLSFGFGI HYCLGNMLAK LQAKIALEEV ARLAPALQLE NPEAITFREN LSFRVPESVP
VSWKA