Gene Arth_2846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2846 
Symbol 
ID4444678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3201744 
End bp3202793 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content69% 
IMG OID639690668 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_832325 
Protein GI116671392 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0398163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGGCA CCATTTTCGG TACACGGATC ATCGCCGCAC CCATGGCAGG GGGAACATCG 
ACGCCCCGGT TTGTGCGGGC CGCGCACCGG GCAGGCGGGT TGGGCTTCCT GGCCGCGGGC
TACAAAACCG TGGCTGCCAT GCAGTCCGAC ATCAGCGAGG TGCGCTCCGC AGGGGCGCGC
TTCGGCATGA ACCTCTTCGT TCCCGACCGG ACGCAGCTCA ATTCCCCGCA AGCTGTCAGG
GAACAGTTGG AGAACTACCG GCAGAGCCTC GAGCCGGACG CCCGGCGGTA CGGGGTGGCC
CTTCCCGGGC TGCGCCTCGA CGACGACGAC GAGTGGCAGG GCAAGATCGA CGCCCTGCTG
GCGGATCCCG TGGAGTTTGT CAGCTTCACG TTCGGGTTCC CCGAGCAGGC CGAAGTGCGC
GCGCTCCAGC GCGCCGGTTC CACGATCATT GCCACGGTCA CCAGCCCGGC GGAAGCCTCG
CTGGCCGTGG AGCGCGGAGC GGACATCCTT GTGGTCCAGC ACGGCAGCGC CGGCGGCCAC
AGCGCCGCCT TCCTGGCCCG GGAAAGCACC ACGGGCTCCC CCGGCCCCGC GACGACGGCG
GAGCTGCTCG CCGCTGTGCG CACCGCCGTC GGCGTACCCC TCGTGGCGGC CGGCGGAATC
ATGGATTCCG CCGGCCTCGA CTCGGTGTTG GCCGCCGGCG CAACGGCAGC CCAGCTGGGT
ACGGCATTTC TGCGCAGCGA CGAAAGCGGC GCCCGCCAGC TCCACAAGGA CGCCCTCGCC
AGCCAAAGCT TCACGGAAAC ACGACTGACC CGTGCGTTCA CCGGCAGGCA GGCGCGTGCC
CTGGTCAATT CGTTCGTCCG CGACCACGAT GACGCACCGG AAGGGTACCC GGCCCTCCAC
CATCTCACCG CTCCGATCCG TGCAGCAGCC GCCGCGGCGG GTGATCCGGA AAGGCTGAAC
CTGTGGGCGG GAACCGGATG GCGGAAGGCT GGCGCCGGTC CGGTCTCGGA CGTCATCAAC
GGGATACTTG CGGGTTCGGC CCTTTCCTGA
 
Protein sequence
MTGTIFGTRI IAAPMAGGTS TPRFVRAAHR AGGLGFLAAG YKTVAAMQSD ISEVRSAGAR 
FGMNLFVPDR TQLNSPQAVR EQLENYRQSL EPDARRYGVA LPGLRLDDDD EWQGKIDALL
ADPVEFVSFT FGFPEQAEVR ALQRAGSTII ATVTSPAEAS LAVERGADIL VVQHGSAGGH
SAAFLAREST TGSPGPATTA ELLAAVRTAV GVPLVAAGGI MDSAGLDSVL AAGATAAQLG
TAFLRSDESG ARQLHKDALA SQSFTETRLT RAFTGRQARA LVNSFVRDHD DAPEGYPALH
HLTAPIRAAA AAAGDPERLN LWAGTGWRKA GAGPVSDVIN GILAGSALS