Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3770 |
Symbol | |
ID | 4447855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4249160 |
End bp | 4250179 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639691594 |
Product | aldo/keto reductase |
Protein accession | YP_833245 |
Protein GI | 116672312 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.730632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACCGG CACCCGCACC ATTGGTCCAC CTCGGCGACG GATTGAACGT CAGCCCCCTG GGGTTCGGCG GCATGGCCCT GACCCCGGTC TACGGCGAAG TGGACGAGGC CGAGGCCCTG CGGACCCTGC ACCATGCGGT GGATGCGGGC GTCAGCTTTA TCGACACTGC GGACATCTAC GGCGGCGGCA GCAACGAGGA ACTGATTTCC CGGTTGCTCA AGGAGCGCCG GGACGAAGTA CAGCTGGCCA CCAAGTTCGG GCTGGTGGGC ACCCCGTCAA CCGGCTATAC GGACATCCGG GGTGACGCCG CCTATGTGCG GGAGGCGGTG GAAGCCAGCC TCCGCCGCCT GGGCACCGAC ACCATTGACC TGTACTACAT GCACCGCCGC GACCTCCGCG TTCCGATTGT GGAAACCGTG GAGGCCATGG CGGCGCTGGT GCAGCAAGGC AAGGTGCGGC ACCTCGGCCT GTCGGAAGTG ACGGCCGAGG AACTGCGGGA AGCCAGCAGC GTCCACCCGA TCGCGGCGGT CCAGAGCGAG TGGTCCATCT GGAGCCGCGA TGTGGAACGC CACGTTGTTC CCGCGGCGGC CGCACTTGGG GTGGGTTTCG TGCCGTATTC ACCGCTGGGC CGGGGATTCC TCACCGGCAC AGTGGACGCT TCGAAGCTCG GGAGCGGCGA CTTCCGCCGC AACATCCCCC GCTTCGCCGC TGACGCGGCG GATGCCAACC GGGGAGTGGT GGCCGCGGTT CAGGCCGTCG CAGCCGAGCT CACCGTCGCC GGCGAGCCGG CGACTCCGGC ACAAGTGGCC CTGGCCTGGC TGTTTGCCCA GGGCAAAAAG CTTGGCCTGC CTGTGGTTCC CATCCCCGGC ACGCGCAAGG CGGAACGGAT CGACGAGAAC CTGGGTGCGC TGTCGCTCAA CTTCACCAGT GCGCAACTGG AGAAGCTCGA CGCCGCCGCG GACGCCGTCG TCGGCTCCCG CTCGGCGGAC CCCAAGTGGG TGTCCCAGGG CCGCGAATAG
|
Protein sequence | MAPAPAPLVH LGDGLNVSPL GFGGMALTPV YGEVDEAEAL RTLHHAVDAG VSFIDTADIY GGGSNEELIS RLLKERRDEV QLATKFGLVG TPSTGYTDIR GDAAYVREAV EASLRRLGTD TIDLYYMHRR DLRVPIVETV EAMAALVQQG KVRHLGLSEV TAEELREASS VHPIAAVQSE WSIWSRDVER HVVPAAAALG VGFVPYSPLG RGFLTGTVDA SKLGSGDFRR NIPRFAADAA DANRGVVAAV QAVAAELTVA GEPATPAQVA LAWLFAQGKK LGLPVVPIPG TRKAERIDEN LGALSLNFTS AQLEKLDAAA DAVVGSRSAD PKWVSQGRE
|
| |