Gene Arth_4059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4059 
Symbol 
ID4447790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4581726 
End bp4582721 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content67% 
IMG OID639691890 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_833534 
Protein GI116672601 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATGC TCCAGGGGGC GCACGTCCTG GTCACCGGCG GAGCCGGCAC TATCGGATCC 
ACTATTGTCG ACCACCTGGT CACCGCCGGC GTTGAACGGA TAACCGTCCT GGACAACCTG
GTCCGGGGCC GCCGGGCCAA CCTGGACGAC GCGGTGGCCA CCGGCAGGGT GGAACTCGTC
GAAGGGGACC TGCGCGACCG CGACCTCGTC CACGACCTCA CCCGCGGCAA GGACATCGTC
TTCCATCAGG CGGCCATCAG GATCACCCAG TGCGCCGAGG AGCCGCGGCT CGCGCTCGAA
GTGCTGGTGG ACGGCACGTT CAACGTCTTC GAGGCGGCGG CCGAACACGG TGTGGGCAAG
CTGGTGGCGG CATCCAGCGC GTCGGTTTAC GGCATGGCGG AGGAATTTCC CACCAGCGAA
CGCCACCACC ACCACAACAA CGACACGTTC TACGGCGCGG CGAAGTCCTT CAACGAGGGA
ATGGCCCGCA GCTTCCGTGC GATGACCGGC CTGGACTACG TCCTCCTGCG CTACTTCAAC
GTCTACGGGC CGCGGATGGA TGTGCACGGC CTCTACACAG AGGTCCTGGT GCGCTGGATG
GAGCGCATCG CGGACGGGCA GCCGCCGCTG ATCTTTGGTG ACGGACGGCA GACCATGGAT
TTCATCCACA CCCGTGACGT TGCCCGGGCC AACATCCTGG CCGCCGGAAG CGGCGCGCGC
GAGGGGGTCT ACAACGTGGC CAGCGGGGAA GAAACAAGCC TCCTGCAACT CGCCGAGGCG
CTATTGCGGG CCATGGATTC CGAACTGCAC GTGGAACACG GACCCGACCG CGCCATCAAC
GGCGTTGTCC GCCGCCTCGC GGATACTTCC GCGGCCCGGC TTGACCTTGG CTTCGCGGCC
GAAACCGGAC TTGAGGACGG GCTCCGCGAA CTCGTGGACT GGTGGCGTCC GCTTCGCGGC
GAAATTGCCG CCGCCCGGGT CGGAGGCGTG CGGTGA
 
Protein sequence
MSMLQGAHVL VTGGAGTIGS TIVDHLVTAG VERITVLDNL VRGRRANLDD AVATGRVELV 
EGDLRDRDLV HDLTRGKDIV FHQAAIRITQ CAEEPRLALE VLVDGTFNVF EAAAEHGVGK
LVAASSASVY GMAEEFPTSE RHHHHNNDTF YGAAKSFNEG MARSFRAMTG LDYVLLRYFN
VYGPRMDVHG LYTEVLVRWM ERIADGQPPL IFGDGRQTMD FIHTRDVARA NILAAGSGAR
EGVYNVASGE ETSLLQLAEA LLRAMDSELH VEHGPDRAIN GVVRRLADTS AARLDLGFAA
ETGLEDGLRE LVDWWRPLRG EIAAARVGGV R