Gene Arth_2910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2910 
Symbol 
ID4444432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3280490 
End bp3282004 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content72% 
IMG OID639690733 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_832389 
Protein GI116671456 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.899807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGCG CCTACACCGG CACCCAGATT CGTGAGGCCG AAAAGCCCCT TCTTCTTTCC 
GGGGCAGGCG CTGTTCTGAT GCAGCGGGCC GCCTACGGCC TGGCCAACGC CGTTGTCCGT
GAACTCGTTG CCCGGGGGAT CCGTCCCTAC GGGGCCAGCG TGGCGGTTCT TACCGGCAAA
GGTAACAACG GGGGAGACGG GCTCTTCGCC GCGGCCTTCC TGGCCGCCCG GGGACTGCGT
ACGACGGCGG TGCTCACCGC CGGTGAGGCC CACCCGGACG GGCTGGCCGC CTTTGAACGG
GCCGGCGGGC GCGCCCGGAC CCTCACCGAC CACAACGCCG GTGAGCTGGC AGCGGCAGCC
GCCAGCGCCG ACGTCGTGAT CGACGCGGTA CTGGGGACCG GAGCCCAAGG CGGGCTCCAC
GGCGCCACGG CATCCCTCAT CGGGAAGCTG CGCGCCGCCG CCCACGGGTT TGTGGTGGCC
TGCGACATTC CCAGCGGCGT GAACGCCGAC ACCGGCGAGG CCTATGATCC GGTCCTTCCG
GCCGACCTCA CCGTGACGTT CGGCGGAGCG AAAGCCGGGC TGCTCGCCGA TCCCGGCGCC
GACCATGCCG GGCGCGTCCT GGTCATCCCC ATCGGCATTG AAACCGAACT GCCGTCGCCG
GTGCTGAGGC GCCTGGAATC CGCCGACCTT GCCCGGTTGC TGCCCCCGCC GACGCGCCGC
TCCCACAAAT ACACCCGGGG CGTCCTGGGC GTCGTGGCGG GATCGCAACA GTATCCGGGA
GCTGCCGTCC TCGCCTGCCG CGGCGCCCTC GCAGCGGGCG CCGGGATGGT CCGGTACCTT
GGCCCGCCCG AGCCGACCCG CCTGGTGCGC CAGGCCTGCC CGGAGGTGGT GTGCGGACCG
GACAACGTGG CAGACGCGCA CGTCCAGGCG TGGCTGGTCG GTTCAGGGAT CGCCGAAGGC
GACCGCGAAC AGCTGCAGCG GGTCCGCGAC GCCGTGGAGA CGGGACTGCC AGTGGCCGCC
GACGCCGGTG CGCTGCCTGC CCTTCCTGAT GCCCTGCCTC CGCACGTGGT GCTGACACCG
CACGGCGGCG AGCTGGCGCG CGTCCTGCAG CGGTACGGGA TCGACCTGGG CCGGCAGGGA
GTTGACGGTG CCACCCTCGA CGCCGTGCGC CAGGCGGCTG AAAGGACCGG AGCCACAGTC
CTGCTCAAGG GCGCCACCAC GCTGGTGGCT GCGCCGTACG GCCCCGTTTT CAGCCAGGCT
GAAGCCACGC CGTGGATGGC AACTGCCGGA AGCGGTGACG TGCTGGCCGG GGTGCTCGGG
TCCCTGCTGG CCCAGCATTC GGATGACGAG GAAAGATTTG CGGCCTGCGG GATCTCCGCG
GACCAGCGCT GGGCGGCCAT CGGCGCCATG GCGGCGAGCC TGCACGGCCG CGCAGGGACC
CTTGCCTCAG CCGGCGGCCC CGTGACCGCC GGTGCCATCG CTCAATCCCT GCCCGAGGTG
ATGCGGACCT TGTAA
 
Protein sequence
MISAYTGTQI REAEKPLLLS GAGAVLMQRA AYGLANAVVR ELVARGIRPY GASVAVLTGK 
GNNGGDGLFA AAFLAARGLR TTAVLTAGEA HPDGLAAFER AGGRARTLTD HNAGELAAAA
ASADVVIDAV LGTGAQGGLH GATASLIGKL RAAAHGFVVA CDIPSGVNAD TGEAYDPVLP
ADLTVTFGGA KAGLLADPGA DHAGRVLVIP IGIETELPSP VLRRLESADL ARLLPPPTRR
SHKYTRGVLG VVAGSQQYPG AAVLACRGAL AAGAGMVRYL GPPEPTRLVR QACPEVVCGP
DNVADAHVQA WLVGSGIAEG DREQLQRVRD AVETGLPVAA DAGALPALPD ALPPHVVLTP
HGGELARVLQ RYGIDLGRQG VDGATLDAVR QAAERTGATV LLKGATTLVA APYGPVFSQA
EATPWMATAG SGDVLAGVLG SLLAQHSDDE ERFAACGISA DQRWAAIGAM AASLHGRAGT
LASAGGPVTA GAIAQSLPEV MRTL