Gene Arth_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2094 
Symbol 
ID4445363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2361336 
End bp2362898 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content65% 
IMG OID639689902 
Productglucose-6-phosphate 1-dehydrogenase 
Protein accessionYP_831574 
Protein GI116670641 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0364] Glucose-6-phosphate 1-dehydrogenase 
TIGRFAM ID[TIGR00871] glucose-6-phosphate 1-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.175031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAAA CTGAAAACGG CAGCAGGAAG TCAGCCGGGC GGGGCCGCAA CCCGTTGCGG 
GATCCCCGGG ACCGCCGGCT AAACCGCATC GCCGGCCCCT CCTCGCTGGT CCTCTTTGGA
GTGACCGGCG ACCTTGCCCG CAAGAAGCTC ATGCCTGCCG TCTACGACCT GGCCAACCGG
GGCCTGCTGC CGCCGAGCTT TGCCCTGGTG GGCTTCGCCC GGCGGCAGTG GGAAAACGAG
GATTTCGCCG CCGAGGTCAA GGAGTCCGTC AAGGCCTACG CCCGGACCCC GTTTGACGAG
GCAGTGTGGA ATCAGCTGTC CGAGGGCATC CGTTTTGTCC AGGGAGAGTT CGACGACGAC
GACGCCTTTG AGCGGCTCGG TGACACGATC GATGAACTGG ACGAGCAGCG GGGTACGCGC
GGGAACCACG CGTTTTACCT GTCGATTCCG CCGAAGGCGT TTGAACAGGT CTGCCGCCAG
CTGTCGAAGC ACGGCCTGGC CCAGGCTGAG GGTGACAAGT GGCGCCGTGT GGTCATCGAG
AAGCCGTTCG GGCACGACCT CGAGTCCGCC CGGCAGCTTA ATGACATCGT GGAGTCGGTG
TTCCCGCCGG ATGCTGTGTT CCGGATCGAC CACTACCTGG GCAAGGAAAC GGTCCAGAAC
ATCCTGGCGC TGCGTTTCGC GAACCAGTTG TTCGAACCGC TCTGGAACGC GAACTATGTC
GACCACGTCC AGATCACCAT GGCCGAGGAC ATCGGGACCG GCGGCCGGGC AGGCTACTAC
GACGGCGTGG GCGCGGCCCG CGACGTCATC CAGAATCACC TGCTGCAGCT CCTGGCGCTG
ACGGCCATGG AGGAACCCAT TTCCTTCAAC GCCGATGACC TCCGTGCCGA AAAGGAAAAG
GTTCTGGCTG CGGTAAAGCT CCCGGACGAT CTCTCCACCC ATTCAGCGCG CGGTCAGTTC
GCCGGCGGCT GGCAGGGCGG CGAGCAGGTT CTTGGGTACC TGGAGGAGGA AGGCATCCCT
GCCGACTCCA CCACGGAAAC CTACGCCGCC ATCCGGGTGG ATATCCACAC CCGGCGCTGG
TCCGGTGTGC CGTTCTACCT GCGTGCCGGC AAGCGGCTGG GACGCCGCGT GACGGAAATC
GCAGTTGTCT TCAAGCGCGC CCCCAACCTG TTGTTCCGCG ATCACGGCGA GGACGACTTC
GGCCAGAACG CCGTGGTGAT CCGCGTGCAG CCTGACGAGG GCGCCACGAT CCGCTTCGGT
TCGAAGGTTC CGGGGACGCA GATGGAAGTC CGCGACGTCA CGATGGACTT CGGCTACGGG
CACTCGTTCA CCGAGTCCAG CCCCGAAGCC TATGAGCGGC TCATCCTGGA CGTCCTGCTG
GGCGAACCGC CGCTGTTCCC GCGGCACCAG GAGGTGGAAC TGTCCTGGAA GATCCTGGAT
CCCTTTGAAG AGTACTGGGC AAGTTTGGGC GAACAACCTG AGCCCTACGC CCCCGGAAGC
TGGGGCCCCG CCTCGGCGGA CGAGCTGCTT GCCCGCGATG GACGAACCTG GAGAAGGCCA
TGA
 
Protein sequence
MPETENGSRK SAGRGRNPLR DPRDRRLNRI AGPSSLVLFG VTGDLARKKL MPAVYDLANR 
GLLPPSFALV GFARRQWENE DFAAEVKESV KAYARTPFDE AVWNQLSEGI RFVQGEFDDD
DAFERLGDTI DELDEQRGTR GNHAFYLSIP PKAFEQVCRQ LSKHGLAQAE GDKWRRVVIE
KPFGHDLESA RQLNDIVESV FPPDAVFRID HYLGKETVQN ILALRFANQL FEPLWNANYV
DHVQITMAED IGTGGRAGYY DGVGAARDVI QNHLLQLLAL TAMEEPISFN ADDLRAEKEK
VLAAVKLPDD LSTHSARGQF AGGWQGGEQV LGYLEEEGIP ADSTTETYAA IRVDIHTRRW
SGVPFYLRAG KRLGRRVTEI AVVFKRAPNL LFRDHGEDDF GQNAVVIRVQ PDEGATIRFG
SKVPGTQMEV RDVTMDFGYG HSFTESSPEA YERLILDVLL GEPPLFPRHQ EVELSWKILD
PFEEYWASLG EQPEPYAPGS WGPASADELL ARDGRTWRRP