Gene Arth_3948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3948 
Symbol 
ID4447766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4461659 
End bp4462732 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content67% 
IMG OID639691779 
Productaldo/keto reductase 
Protein accessionYP_833423 
Protein GI116672490 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTCA ACCAGCTACG GGTGTTCGGG CGCAGCGGGA CATTGATCAG CCCGCTCACT 
TTGGGGACCA TGAACTTCGG CGAGGGCGCA CGGGCGGACC CGGCGGGCGC CGGCGGAACC
GCCCAGGGCT ATGCGCCCAC CGGCGCCGAT GAAAGCATCC GCATCATCAA CGCTGCCCTG
GACGCCGGCA TCACCGCCGT GGACACGGCG GACGTCTACT CGCAGGGGCA GAGCGAACAG
GTGGTGGGCC GCGCCCTGAA GGGCCGCCGC GACGATGTTT TCATTGCCAC CAAATTCCAC
GGCCAGATGA GCGCCAACCC GGCCCACTCC GGCAACTCGA GGCGCTGGAT CATGCAGGCA
GTGGAGGGCA GCCTCCGCCG CCTGCAGACG GACCGGATCG ACCTGTACCA GGCGCACCGT
CCCGACTACA ACACCGACGT CCTGGAAACC ATCACGGCAC TGAACGACCT CATCCGCCAA
GGCAAGATCC TCTACTACGG AACGTCCGTT TTCACTCCGG CGCAGCTGGT GGAGGCCCAG
TGGCTGGCAA CCACCAACCA CCTCATCCCG CCCGTCGCCA ACCAGGTCCC CTATTCCATG
CTGGTCCGGG GCACCGAGCG TGATGTCCTG CCGATCGCCC AGCAGTACGG GCTCGGAGTG
CTGGCCTACG GTCCGCTGGC CGGCGGCTGG CTGTCCGGGA GCTTTGTCCT GGATGCCGGG
AAGCCGCCCA CGCGCGTTCA CTCGCTTCCC GGACGGTACG ACATTTCCGG CCCGGCGAGC
GAGCGCAAGC TGCACGCCGC AGATGCCCTG GCCAGGCTGG CGGACAAGCT GGAACTTCCG
CTGGTGGACC TGGCAGTCGG CTTCGCGCTG AACCACCCGG CTGTCAGCAG TGTGATCATT
GGGCCGCGGA GCGAGGAGCA CCTGCACGCC TACCTGAAGG CTGCGGACAC GGTGCTGGAC
GAATCCGTGC TGGATGCCAT CGACGAGCTG GTGCCCCCGG GCACCAATTT CGTGGAGCGG
GACGCCGGCG CCGTGGTCCC CTCCCTGGAG TATGCGGAGC TCCGGCGAAG GTAG
 
Protein sequence
MSLNQLRVFG RSGTLISPLT LGTMNFGEGA RADPAGAGGT AQGYAPTGAD ESIRIINAAL 
DAGITAVDTA DVYSQGQSEQ VVGRALKGRR DDVFIATKFH GQMSANPAHS GNSRRWIMQA
VEGSLRRLQT DRIDLYQAHR PDYNTDVLET ITALNDLIRQ GKILYYGTSV FTPAQLVEAQ
WLATTNHLIP PVANQVPYSM LVRGTERDVL PIAQQYGLGV LAYGPLAGGW LSGSFVLDAG
KPPTRVHSLP GRYDISGPAS ERKLHAADAL ARLADKLELP LVDLAVGFAL NHPAVSSVII
GPRSEEHLHA YLKAADTVLD ESVLDAIDEL VPPGTNFVER DAGAVVPSLE YAELRRR