Gene Arth_1548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1548 
Symbol 
ID4445915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1725726 
End bp1726817 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content70% 
IMG OID639689363 
Productglucokinase 
Protein accessionYP_831042 
Protein GI116670109 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0569057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACAC CTTCGCCCGG CGACCTGCCC AGACCCTACC GACGGACGGC CGCGTGGCGG 
CGCAGGAAGC CTTCCGGGCA GGATGCGCTG GCCGCCAGCC AGGGCTTCCG GGAGCACCTC
CGGCTCGGGC GCAAAGGGTT GGCGATCGGT GTGGACATCG GAGGCACCAA GGTCGCGGCC
GGTGTGGTGG ACGCCGACGG CCGGATCCTC AGCCAGGCCA GGCGCTCCAC ACCCGGGAAC
GACCCCCGGG CCGTGGAACA GGTGATCGTC GAACTCGTGG AGGAGCTGAG CCGCGGCCAC
CGCATCTGGT CGGTGGGAAT CGGCGCGGCC GGGTGGATGG ACCTCGACGG CGGCACGGTG
CTGTTCAGCC CGCACCTCGC GTGGCGCAAC GAACCGTTGC GGGACAACCT CCAGCGACTG
CTGCGCCGTC CGGTCCTGCT GACCAACGAC GCCGATGCCG CCGCGTGGGC GGAATGGCGC
TTTGGTGCCG GGCAGGGGCA AAGCAGGCTC GTCTGCATCA CCCTGGGCAC GGGGATCGGC
GGTGCCATGG TGATGGACGG CCGGCTGGAA CGCGGCCGCT TTGGGGTTGC GGGAGAATTC
GGCCACCAGA TCATCATGCC CGGCGGGCAC CGTTGCGAAT GCGGAAACCG CGGCTGCTGG
GAACAGTATG CGTCTGGAAA CGCGCTGGGC CGCGAAGCCC GCGAACTGGC TGCTGCCAAT
TCGCCCGTGG CGCAGGAACT CCTGAAGGCC GTGGACGGTC AGGTGGACCG CATCACCGGG
GCCATCGTGA CGGAACTGGC CAAGGCGGGG GACCCCACGT CCCGGGAACT GCTGGAGGAC
GTTGGGGAGT GGCTGGGCCT GGGGCTGGCC AACCTGGCTG CCGCGCTGGA CCCTGGAAAG
TTCGTCATCG GTGGAGGCCT GTGCGATGCG GGTGAACTGC TGGTGGCGCC GGCACGGAAG
GCCTTTGCGC GGAACCTGAC AGGGCGCGGA TTCCGGCCGG CTGCCGAAAT CGCGCTCGCA
GCCCTGGGCC CCAACGCCGG GCTGATCGGG GCCGCAGACC TGTCCCGTGT CAGCAGCAGG
ATGCACGGCT GA
 
Protein sequence
MHTPSPGDLP RPYRRTAAWR RRKPSGQDAL AASQGFREHL RLGRKGLAIG VDIGGTKVAA 
GVVDADGRIL SQARRSTPGN DPRAVEQVIV ELVEELSRGH RIWSVGIGAA GWMDLDGGTV
LFSPHLAWRN EPLRDNLQRL LRRPVLLTND ADAAAWAEWR FGAGQGQSRL VCITLGTGIG
GAMVMDGRLE RGRFGVAGEF GHQIIMPGGH RCECGNRGCW EQYASGNALG REARELAAAN
SPVAQELLKA VDGQVDRITG AIVTELAKAG DPTSRELLED VGEWLGLGLA NLAAALDPGK
FVIGGGLCDA GELLVAPARK AFARNLTGRG FRPAAEIALA ALGPNAGLIG AADLSRVSSR
MHG