Gene Arth_2684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2684 
Symbol 
ID4444738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3007458 
End bp3008552 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content64% 
IMG OID639690504 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_832163 
Protein GI116671230 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTTCCT CTTCATCTTC CACCCGTCCC GGAGGTACCG TCCTGGTCAC GGGCGGCGCC 
GGGTTCATCG GCTGTGCGAT CTCCGATGCC CTGGTCAATG AGTTCGACCG CGTGGTCGTC
GTCGACAATC TCCACCCGCA GATTCATGCC ACGGGCCAGC GGCCCGAGCA GCTTAACGCG
GCAGCGGAAC TGGTGGTTGC GGATGTGACG GAAGCGAAGA CCTGGGACAC CGTCCTCCAG
GACGTAACTC CCGACGTCGT TATCCACCTG GCGGCGGAAA CCGGCACCGG CCAGTCTCTG
GAGGAGTCCA CCCGGCACGC GCACGTCAAT GTCGTCGGCA CCTCCCAGCT CCTCGACGGC
CTCAACCGCC ACGGCAAGCT GCCCCGACGG ATCGTCTTGT CCTCCAGCCG TGCCGTGTAT
GGCGAAGGCG CCTGGAAGGA TGCTCACGGC CGGGTCTTTT ACCCCGGTCA GCGGACAAGC
GAAACCCTCG ACAAGGCACA GTGGGATTTC CCGGATGCCT CGCCCGTCGC GATGAAGGCG
TCGGAGACGT TCCCGGCGCC CGTGAGCGTC TACGGTGCCA CGAAGCTCGC CCAGGAAAAT
GTCCTCCAGG CATGGGCGAA GTCCTACGGC GTGGAGACCG TGATCCTCCG CCTGCAGAAT
GTCTATGGTC CGGGCCAGTC CCTGATCAAC CCGTACACCG GCATCATGAG CCTCTTTTGC
CGGATGGCCA TGGGCGGCAA GTCGATACCC CTTTATGAGG ACGGCGAAGT TCGCCGCGAC
TTCATCCTGA TCGACGATGT CGCGTCGGCC ATTGTTGCCG GGGCGGTCTC CACCACCGTC
CAGGCCGAAC CGATGGACAT CGGATCTGGC GAGTTCCAGA CCATCGGCAC CGCTGCAAAG
CTGATCGCCG AACACTACAA AGCTCCTGCG TCGCACGTCA CCGGCCAGTA CAGGCAGGGC
GACGTTCGTC ATGCCTGGGC TGACATCACG GCCGCCGAGA AGGTGCTGGG ATGGACCCCG
AAGTACAACC TTGCCCAGGG AATCGAACGA CTGGCCACGT GGATTGACGC GCAGCCGGAT
GTCAAGCCTG CCTGA
 
Protein sequence
MTSSSSSTRP GGTVLVTGGA GFIGCAISDA LVNEFDRVVV VDNLHPQIHA TGQRPEQLNA 
AAELVVADVT EAKTWDTVLQ DVTPDVVIHL AAETGTGQSL EESTRHAHVN VVGTSQLLDG
LNRHGKLPRR IVLSSSRAVY GEGAWKDAHG RVFYPGQRTS ETLDKAQWDF PDASPVAMKA
SETFPAPVSV YGATKLAQEN VLQAWAKSYG VETVILRLQN VYGPGQSLIN PYTGIMSLFC
RMAMGGKSIP LYEDGEVRRD FILIDDVASA IVAGAVSTTV QAEPMDIGSG EFQTIGTAAK
LIAEHYKAPA SHVTGQYRQG DVRHAWADIT AAEKVLGWTP KYNLAQGIER LATWIDAQPD
VKPA