Gene Arth_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1978 
Symbol 
ID4445492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2230510 
End bp2231676 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content63% 
IMG OID639689787 
Productcytochrome P450 
Protein accessionYP_831459 
Protein GI116670526 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.449143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTCG CCGCCGCCAA TGAGAACCCG CTGGATCCCT TCCCCTATTA CGAGCGAATG 
AGGGAAGCCG CACCCGTCTT CCACGACGAG CAATCGGGAA GTTGGCATGT CTTCAGGTAT
GACGACGTGC AGCGGGTCCT GTCCGAATAC GCCACCTTTT CTTCCCGGAT GGGCGGCGAC
GATCCTTCCG AGACAGGCCA GCTGTTCGCC TCGAGCCTGA TCACCACAGA TCCGCCGCGG
CACCGTCATT TACGCTCGCT GGTGACCCAG GCGTTCACAC CGAAAGCCGT GGACGCGCTT
GCTCCCCGCA TTTCGGAACT CACGGAAGAG CTTCTGGACG GGATCGTTTC CCGCGGTGGC
GCCGACCTGA TCGAGGAGTT GGCGTACCCG CTGCCGGTTA TCGTGATTTC GGAACTCATG
GGTATCCCCG CGGATGACCG GGACCGCTTC AAGCAGTGGT CCGATGTCAT CGTCAGCCAA
ACGCGGACCA ATGCGGCAAC GGAAGACCAC CAGGCCACTA ACCGGGAAAT GACGGGATAC
TTCCTGGACC TCATCGAACA GCGACGGCGG CGGCCCGGCG ACGACTTGAT CAGCAACCTG
CTCAGCGCCG AGATTGACGG GCAGAAACTG AACGTGGCCG AACTGCTGGG CTTCTGCGCC
CTGCTGCTCG TCGCCGGCAA CGAAACAACC ACGAACCTGA TCGGCAACGC GGTCCTTTGC
TTTACCGAGG TGCCTGGCAC CATCGATCGG TTAGTGATGG AGCCGGCACT GCTCCCTCAG
GCCATCGAAG AAGTGCTTCG CTTTCGGTCC CCGGTCCAGT CCATGTACCG GGTGACGGTC
ACCGACACCA TCCTCGGCGA CGTTCAGATG CCTGCCGGCG CACCCGTGGT GGCGTGGATC
GGCTCCGCAA ACCGCGACGA ACGGCAATTC CAACGCCCTG CCGAGTTCGA CGTCGACCGG
GGCCAGATCC GTCACTTGGC ATTCGGCCAC GGCGTCCACT TCTGCCTCGG TGCGCCGCTT
GCGAGGCTTG AAGCAAGGAT CGCACTGGAA GCCATCCTGT CCCGGCTGCC TGGACTGGCA
CTCGCCCCGG GCGCGCACCT GGAACGGATG GACAGCACCA TTGTCTACGG GCTGAAGGCG
CTGCCTGCGG GCTGGCAGGC AGCCTGA
 
Protein sequence
MDFAAANENP LDPFPYYERM REAAPVFHDE QSGSWHVFRY DDVQRVLSEY ATFSSRMGGD 
DPSETGQLFA SSLITTDPPR HRHLRSLVTQ AFTPKAVDAL APRISELTEE LLDGIVSRGG
ADLIEELAYP LPVIVISELM GIPADDRDRF KQWSDVIVSQ TRTNAATEDH QATNREMTGY
FLDLIEQRRR RPGDDLISNL LSAEIDGQKL NVAELLGFCA LLLVAGNETT TNLIGNAVLC
FTEVPGTIDR LVMEPALLPQ AIEEVLRFRS PVQSMYRVTV TDTILGDVQM PAGAPVVAWI
GSANRDERQF QRPAEFDVDR GQIRHLAFGH GVHFCLGAPL ARLEARIALE AILSRLPGLA
LAPGAHLERM DSTIVYGLKA LPAGWQAA