Gene Arth_2512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2512 
Symbol 
ID4444918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2813736 
End bp2814761 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content68% 
IMG OID639690327 
Productthiamine-monophosphate kinase 
Protein accessionYP_831991 
Protein GI116671058 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.112077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTGAAT CTCACCTCAC CGTTGACGGA CTTTCCGAAT CCGAGCTCCT CGCCAGGATC 
TTTCCGCGCC TGAACAAAGG TCCAGCCGAG GGCACGGCGC TCCTGCTGGG ACCCGGGGAT
GACGCCGCCA TTGTGGCAGC CCCGGACGGC CGGACCGTGG TCAGCATTGA CACTCAGGTC
CAGGACCAGG ATTTCCGGCT GGTGTGGCGA AACGGGTACC GGACCACCGG CTTCGACGTC
GGCTGGAAGG CCGCGGCGCA GAACCTGAGC GACATCAACG CCATGGGTGC GCGGTCCGTG
TCCATGGTGG TGAGCCTGAC CCTGCCTCCG GAGACGCCGG TTTCCTGGGT TGAGGATTTC
GCGGACGGGC TGTCCCACGC CATCAGCGGC CTTGGCGCCG CTGGATGTTC CGTGGCCGGC
GGGGACCTGG GCCGGGGCCG CGAACTGGCC GTGACCGTGG CCATCCTGGG CACCCTGGAC
GGGCGGGAGC CGGTATTGCG CTCCGGGGCC CGTCCCGGGG ACACCGTCGC GCTGGCCGGA
ACGCTGGGGC TCGCGGCGGC GGGCCTTGCC CTGCTGGAGT CGGCATTGGA TGTTGAACGG
TTAACGCCGG AGCAGCGGAC CATTATGGAC AGGCAATGCC GGCCGCTGCC GCCGCTGGAT
GCCGGGCCGT CCGCACTCGC GGCAGGCGCC TCGGCCATGA TGGATGTTTC CGACGGACTG
ATCCGCGACG GCAACCGCCT GGCCGCCGCC AGCGGCGTGG TCCTGGACCT TGATCCCGAC
GCCCTGAAGC AGCTCGCAGA GCCTTTGGCC GCTGTCGCGG ACGCGGTGGA CGGCGACCCC
ATGGTCTGGG TGCTCGGCGG AGGGGAAGAT CATGGACTTC TTGCCACATT CCCGGCGGAC
GTTCAGCTGC CTCCGGGTTT CGCTGCGATA GGCTCAGTAG AAGCCCTTGC ACCAATGGAA
AGCACTGGCG TGACGATAGC GGGCCGGCCC GCGGACACTG TGGGATGGGA TCACTTTGCA
GACTAA
 
Protein sequence
MPESHLTVDG LSESELLARI FPRLNKGPAE GTALLLGPGD DAAIVAAPDG RTVVSIDTQV 
QDQDFRLVWR NGYRTTGFDV GWKAAAQNLS DINAMGARSV SMVVSLTLPP ETPVSWVEDF
ADGLSHAISG LGAAGCSVAG GDLGRGRELA VTVAILGTLD GREPVLRSGA RPGDTVALAG
TLGLAAAGLA LLESALDVER LTPEQRTIMD RQCRPLPPLD AGPSALAAGA SAMMDVSDGL
IRDGNRLAAA SGVVLDLDPD ALKQLAEPLA AVADAVDGDP MVWVLGGGED HGLLATFPAD
VQLPPGFAAI GSVEALAPME STGVTIAGRP ADTVGWDHFA D