Gene Arth_1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1161 
Symbol 
ID4446327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1260331 
End bp1261536 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content69% 
IMG OID639688968 
Productgalactokinase 
Protein accessionYP_830655 
Protein GI116669722 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.415843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCCC CAACAGGCAG CAACACTCCT GGCAACAGCA ACCCCCGCGA CAGCAAGCTG 
GGAAACAAGG ACCTCGAAGC GCGGTTCCGG GACGCCTTCG GTTCCGTCCC CGACGGCGTC
TGGCAGGCGC CGGGCCGCGT GAACCTGATT GGCGAGCACA CCGACTACAA CGACGGCTTC
GTGCTCCCGT TCGCTATCGA CCGGACAGCC CGGGTGGCTG TCCGGGTGCG CCCCGACTCC
ACGCTCCGGC TGCTGTCCAC CTACGGAGAC CAGGGCATGA CCACGGCGGA CACCTCGTCA
CTGGACGGCT CGCGCGCCAA GGGCTGGACC AAGTATCCGC TCGGCGTCAT CTGGGCGCTC
CAGGAGCGCG GGGTCGCGGT GCCCGGCCTT GACCTGCTGC TGGACTCCAA CGTTCCCCTC
GGCGCCGGAC TCTCCTCGTC CCACGCCATT GAGTGCGCCG TCATCTCCGC CCTCAACGAG
CTGACCGGCG CAGGGCTTAC AGCCGAGGAA ATGGTCCTGG CCACCCAGCG CGCCGAGAAT
GACTTTGTCG GAGCGCCCAC GGGCATCATG GACCAGTCCG CCTCGCTGCG CGGCGCGAGG
GGCCATGCCG TCTTTTTGGA CTGCCGGGAC CAGAGTGCCC GCCTGGTGCC GTTCGAGACG
GAGCCGGCAG GCCTGGTGCT GCTGGTGATC GACACCAAGG TTTCCCACTC CCACGCCGAC
GGCGGCTACG CGTCGCGCCG CGCCTCGTGC GAGCTCGGGG CCGAAGTGCT GGGCGTCAAA
GCGCTGCGGG ACGTCGGGGT GAAGGACCTG GACGAAGCCA GCGGTCTCCT GGATGAGGTC
ACGTTCCGCC GGGTCCGGCA CGTCGTCACC GAGAACGACC GCGTCCTGCA GACCGTGGAA
CTCCTCGGCT CAGCCGGACC GGGCGCCATC GGGCCACTCC TCGATGCCAG TCACCTCTCC
ATGCGTGACG ACTTCGAGAT TTCGTGCCCC GAACTGGACC TTGCCGTCGA CACCTCCCGG
GCCAGCGGAG CCATCGGCGC CCGCATGACA GGCGGCGGTT TTGGCGGCGC AGCTATCGCC
CTCACACCGG TGGCCGCCGA GCAGCAGGTA CGCACCGCCG TCGAGCAGGC TTTCTCCCGC
GCGGGGTTCA GGAAGCCGGA CATCTTCACG GTTACCCCTG CGGCCGGGGC GATGCGGGTC
GTATAG
 
Protein sequence
MTAPTGSNTP GNSNPRDSKL GNKDLEARFR DAFGSVPDGV WQAPGRVNLI GEHTDYNDGF 
VLPFAIDRTA RVAVRVRPDS TLRLLSTYGD QGMTTADTSS LDGSRAKGWT KYPLGVIWAL
QERGVAVPGL DLLLDSNVPL GAGLSSSHAI ECAVISALNE LTGAGLTAEE MVLATQRAEN
DFVGAPTGIM DQSASLRGAR GHAVFLDCRD QSARLVPFET EPAGLVLLVI DTKVSHSHAD
GGYASRRASC ELGAEVLGVK ALRDVGVKDL DEASGLLDEV TFRRVRHVVT ENDRVLQTVE
LLGSAGPGAI GPLLDASHLS MRDDFEISCP ELDLAVDTSR ASGAIGARMT GGGFGGAAIA
LTPVAAEQQV RTAVEQAFSR AGFRKPDIFT VTPAAGAMRV V