Gene Arth_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1041 
Symbol 
ID4446476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1118158 
End bp1119147 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content69% 
IMG OID639688844 
Productaldo/keto reductase 
Protein accessionYP_830535 
Protein GI116669602 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC AGCCCGAAGG CGCCCAGGTG GACGGCCTGA AACTTCCCAT TTCCCGGCTG 
GTCCTGGGGA CCATGACGTT CGGCGACACG GTCGACGAGG CCACCGCGGG GCGGATGGTG
GAGGAAGCGC TCGACGCCGG CATCACCACG ATCGACACCG CCAACGCCTA CGTCGGGGGA
ACCACCGAGG AAATGCTCTC CCGCCTCCTC AAGGGTCGCC GCGGCGACGT TATTCTCGCC
TCCAAGGCGG GCATGCCGCA CGCGGACCAC GGCTCCAACT CGCCGTTGTC GCCCGCGGGT
CTGCGTGCCA GCGTGGAGGG GAGCCTCCGC CGGCTCGGCG TGGACAGCAT CGACCTGTTC
TACCTGCACC AGCCGGACCG CGCCACGCCG CTGCGCGACA CACTGGCCAC CGTGGCCGAG
CTGTTCGCCG AGGGGAAGAT CTGCGCGCTG GGCGTGTCCA ACTTCGCGGC CTGGCAGATT
GCCGACGTCA TCCACACGGC ACGCGAAGTG GGGGCGCCGC GGCCGGTGGT CGCGCAGCAG
CTGTACAACC TGGTGGCACG CCGGGTGGAG GAGGAATACC TCGAATTCGC CGCCACCCAC
AACGTGCACA CCATGGTCTA CAACCCCCTG GGCGGCGGGC TGCTCACCGG CAAGCACAGC
TTCGACGCCA AGCCCACCGA GGGCCGCTAC GGCGACTCCA AGCTGGCCGC CATGTACACC
CAGCGGTACT GGGACAAGCA GCTGTTCGAC GCCATTGAAG AGCTCTCCCG CATCGCTGAC
GGTGCAGGGA TTTCCCTGGC CGAGCTGTCG CTGCGCTGGC TGGCCTACCG GGACGGCGTG
GGCTCCATGC TGCTGGGCGG CTCCAAGGTG GAACAGCTGC AGTCCAACAT CGCCGCCGTC
GCCAACGGGC CGCTGCCCGC CGACGTCGTG GACGCCTGCG ACGCCGTGGG CACCTCGCTG
CGGGGCCCCA TGCCCGCCTA CAACCGCTGA
 
Protein sequence
MSKQPEGAQV DGLKLPISRL VLGTMTFGDT VDEATAGRMV EEALDAGITT IDTANAYVGG 
TTEEMLSRLL KGRRGDVILA SKAGMPHADH GSNSPLSPAG LRASVEGSLR RLGVDSIDLF
YLHQPDRATP LRDTLATVAE LFAEGKICAL GVSNFAAWQI ADVIHTAREV GAPRPVVAQQ
LYNLVARRVE EEYLEFAATH NVHTMVYNPL GGGLLTGKHS FDAKPTEGRY GDSKLAAMYT
QRYWDKQLFD AIEELSRIAD GAGISLAELS LRWLAYRDGV GSMLLGGSKV EQLQSNIAAV
ANGPLPADVV DACDAVGTSL RGPMPAYNR