Gene Arth_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1942 
Symbol 
ID4445526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2191412 
End bp2192662 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content65% 
IMG OID639689752 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_831424 
Protein GI116670491 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTGGC TGAACAGCGC CGCTCACGCG CGATGGCTCG AGGCCGAAAC CGATCGGCTG 
ATCAATTTCG CCGCGGGGTC AAAGGTGCCC ACCGGATTCG GCTGGCTCGA CAACAACGGT
GCTGTGTTCA CCGACAAGCC GACCCACCTC TGGATTACCG CGCGGATGGT TCATAGTTTC
GCGGTCGCTG CCCTGATGGG GCGGCCAGGC GCTGCGACAC TCGTCGACCA CGGCATCGCC
GCCCTCAACG GAGTTTTCCA CGATGATGAG TTCGGCGGCT GGTATGCCGA GGTAGACGGG
AACGGACCGG TAAACGACAC CAAGTCCGGT TACCAGCACT CATTCGTGCT CCTCGCCGCT
GCCAGCGCCG TTGCGGCCAA CCGCCCGGGC GCCAGGGAAT TGCTCGACGA GGCACTGCGA
ATCGCTGACA CGAGGTTCTG GGACCACAAC GCCGGCATGT GCTTTGACTC GTGGAACCGG
GAATTTACGG AAACGGAAGC ATACCGTGGC GGGAACGCGA GCATGCACTC CGTCGAGGCG
TACCTCATCG TCGCCGATGT GACCGGGGAC AACCGCTGGC TCGAACGGGC GCTCCACATC
GCCGAGGTGC TCATCCACGA CTTTGCTCGC AACAACGACT ACCGGGTCTT CGAGCACTTC
GACCCGGAGT GGAACCCCAT GCCGGAGTAC AACACCGATG ACCGGGCCAG CCAGTTCCGC
GCGTACGGCG GAACGCCCGG CCACTGGGTT GAGTGGGCAC GCCTGCTCCT GCACGTCCGC
GCCGGGCTCG AAGCCCGCGG CATGGAGGTC CCGGCCTGGC TGCTGGACGA CGCACGGGGC
CTGTTCGACG CCGCCATCCG CGACGCCTGG GAACCGGACG GTCATCCGGG CTTTGTTTAC
ACCGTCGACT GGGAAGGCAA GCCGGTGGTC ACCACCCGCA TTCGCTGGGT TCCCGCCGAG
GCAATCGGGG GCGCGGCAGC ACTCTACATC GCCACCGGCG ACCAGAAGTA CGCGGACTGG
TACGAGCGAA TCTGGGACCA CGCCCGCGAC TGGTTTATCG ACTACGAGCA CGGGTCCTGG
AAGCAGGAAC TCGACGAGAA CGGCAATGTC ACCTCCACCG TGTGGTCCGG CAAGGCGGAC
ATCTATCACC TCTGGCACTG CCTGGTGGTC CCGCGACTTC CACTGGCACC AGGGCTTGCT
CCAGCAGTCG CGGCCGGCCT GCTGGATGCA CGGCTTGCTG CTTCCAGGTA A
 
Protein sequence
MTWLNSAAHA RWLEAETDRL INFAAGSKVP TGFGWLDNNG AVFTDKPTHL WITARMVHSF 
AVAALMGRPG AATLVDHGIA ALNGVFHDDE FGGWYAEVDG NGPVNDTKSG YQHSFVLLAA
ASAVAANRPG ARELLDEALR IADTRFWDHN AGMCFDSWNR EFTETEAYRG GNASMHSVEA
YLIVADVTGD NRWLERALHI AEVLIHDFAR NNDYRVFEHF DPEWNPMPEY NTDDRASQFR
AYGGTPGHWV EWARLLLHVR AGLEARGMEV PAWLLDDARG LFDAAIRDAW EPDGHPGFVY
TVDWEGKPVV TTRIRWVPAE AIGGAAALYI ATGDQKYADW YERIWDHARD WFIDYEHGSW
KQELDENGNV TSTVWSGKAD IYHLWHCLVV PRLPLAPGLA PAVAAGLLDA RLAASR