Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1942 |
Symbol | |
ID | 4445526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2191412 |
End bp | 2192662 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639689752 |
Product | N-acylglucosamine 2-epimerase |
Protein accession | YP_831424 |
Protein GI | 116670491 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2942] N-acyl-D-glucosamine 2-epimerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTGGC TGAACAGCGC CGCTCACGCG CGATGGCTCG AGGCCGAAAC CGATCGGCTG ATCAATTTCG CCGCGGGGTC AAAGGTGCCC ACCGGATTCG GCTGGCTCGA CAACAACGGT GCTGTGTTCA CCGACAAGCC GACCCACCTC TGGATTACCG CGCGGATGGT TCATAGTTTC GCGGTCGCTG CCCTGATGGG GCGGCCAGGC GCTGCGACAC TCGTCGACCA CGGCATCGCC GCCCTCAACG GAGTTTTCCA CGATGATGAG TTCGGCGGCT GGTATGCCGA GGTAGACGGG AACGGACCGG TAAACGACAC CAAGTCCGGT TACCAGCACT CATTCGTGCT CCTCGCCGCT GCCAGCGCCG TTGCGGCCAA CCGCCCGGGC GCCAGGGAAT TGCTCGACGA GGCACTGCGA ATCGCTGACA CGAGGTTCTG GGACCACAAC GCCGGCATGT GCTTTGACTC GTGGAACCGG GAATTTACGG AAACGGAAGC ATACCGTGGC GGGAACGCGA GCATGCACTC CGTCGAGGCG TACCTCATCG TCGCCGATGT GACCGGGGAC AACCGCTGGC TCGAACGGGC GCTCCACATC GCCGAGGTGC TCATCCACGA CTTTGCTCGC AACAACGACT ACCGGGTCTT CGAGCACTTC GACCCGGAGT GGAACCCCAT GCCGGAGTAC AACACCGATG ACCGGGCCAG CCAGTTCCGC GCGTACGGCG GAACGCCCGG CCACTGGGTT GAGTGGGCAC GCCTGCTCCT GCACGTCCGC GCCGGGCTCG AAGCCCGCGG CATGGAGGTC CCGGCCTGGC TGCTGGACGA CGCACGGGGC CTGTTCGACG CCGCCATCCG CGACGCCTGG GAACCGGACG GTCATCCGGG CTTTGTTTAC ACCGTCGACT GGGAAGGCAA GCCGGTGGTC ACCACCCGCA TTCGCTGGGT TCCCGCCGAG GCAATCGGGG GCGCGGCAGC ACTCTACATC GCCACCGGCG ACCAGAAGTA CGCGGACTGG TACGAGCGAA TCTGGGACCA CGCCCGCGAC TGGTTTATCG ACTACGAGCA CGGGTCCTGG AAGCAGGAAC TCGACGAGAA CGGCAATGTC ACCTCCACCG TGTGGTCCGG CAAGGCGGAC ATCTATCACC TCTGGCACTG CCTGGTGGTC CCGCGACTTC CACTGGCACC AGGGCTTGCT CCAGCAGTCG CGGCCGGCCT GCTGGATGCA CGGCTTGCTG CTTCCAGGTA A
|
Protein sequence | MTWLNSAAHA RWLEAETDRL INFAAGSKVP TGFGWLDNNG AVFTDKPTHL WITARMVHSF AVAALMGRPG AATLVDHGIA ALNGVFHDDE FGGWYAEVDG NGPVNDTKSG YQHSFVLLAA ASAVAANRPG ARELLDEALR IADTRFWDHN AGMCFDSWNR EFTETEAYRG GNASMHSVEA YLIVADVTGD NRWLERALHI AEVLIHDFAR NNDYRVFEHF DPEWNPMPEY NTDDRASQFR AYGGTPGHWV EWARLLLHVR AGLEARGMEV PAWLLDDARG LFDAAIRDAW EPDGHPGFVY TVDWEGKPVV TTRIRWVPAE AIGGAAALYI ATGDQKYADW YERIWDHARD WFIDYEHGSW KQELDENGNV TSTVWSGKAD IYHLWHCLVV PRLPLAPGLA PAVAAGLLDA RLAASR
|
| |