Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4059 |
Symbol | |
ID | 4447790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4581726 |
End bp | 4582721 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691890 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_833534 |
Protein GI | 116672601 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATGC TCCAGGGGGC GCACGTCCTG GTCACCGGCG GAGCCGGCAC TATCGGATCC ACTATTGTCG ACCACCTGGT CACCGCCGGC GTTGAACGGA TAACCGTCCT GGACAACCTG GTCCGGGGCC GCCGGGCCAA CCTGGACGAC GCGGTGGCCA CCGGCAGGGT GGAACTCGTC GAAGGGGACC TGCGCGACCG CGACCTCGTC CACGACCTCA CCCGCGGCAA GGACATCGTC TTCCATCAGG CGGCCATCAG GATCACCCAG TGCGCCGAGG AGCCGCGGCT CGCGCTCGAA GTGCTGGTGG ACGGCACGTT CAACGTCTTC GAGGCGGCGG CCGAACACGG TGTGGGCAAG CTGGTGGCGG CATCCAGCGC GTCGGTTTAC GGCATGGCGG AGGAATTTCC CACCAGCGAA CGCCACCACC ACCACAACAA CGACACGTTC TACGGCGCGG CGAAGTCCTT CAACGAGGGA ATGGCCCGCA GCTTCCGTGC GATGACCGGC CTGGACTACG TCCTCCTGCG CTACTTCAAC GTCTACGGGC CGCGGATGGA TGTGCACGGC CTCTACACAG AGGTCCTGGT GCGCTGGATG GAGCGCATCG CGGACGGGCA GCCGCCGCTG ATCTTTGGTG ACGGACGGCA GACCATGGAT TTCATCCACA CCCGTGACGT TGCCCGGGCC AACATCCTGG CCGCCGGAAG CGGCGCGCGC GAGGGGGTCT ACAACGTGGC CAGCGGGGAA GAAACAAGCC TCCTGCAACT CGCCGAGGCG CTATTGCGGG CCATGGATTC CGAACTGCAC GTGGAACACG GACCCGACCG CGCCATCAAC GGCGTTGTCC GCCGCCTCGC GGATACTTCC GCGGCCCGGC TTGACCTTGG CTTCGCGGCC GAAACCGGAC TTGAGGACGG GCTCCGCGAA CTCGTGGACT GGTGGCGTCC GCTTCGCGGC GAAATTGCCG CCGCCCGGGT CGGAGGCGTG CGGTGA
|
Protein sequence | MSMLQGAHVL VTGGAGTIGS TIVDHLVTAG VERITVLDNL VRGRRANLDD AVATGRVELV EGDLRDRDLV HDLTRGKDIV FHQAAIRITQ CAEEPRLALE VLVDGTFNVF EAAAEHGVGK LVAASSASVY GMAEEFPTSE RHHHHNNDTF YGAAKSFNEG MARSFRAMTG LDYVLLRYFN VYGPRMDVHG LYTEVLVRWM ERIADGQPPL IFGDGRQTMD FIHTRDVARA NILAAGSGAR EGVYNVASGE ETSLLQLAEA LLRAMDSELH VEHGPDRAIN GVVRRLADTS AARLDLGFAA ETGLEDGLRE LVDWWRPLRG EIAAARVGGV R
|
| |