Gene Arth_3304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3304 
Symbol 
ID4443998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3708610 
End bp3710109 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content70% 
IMG OID639691128 
Productcarbohydrate kinase, FGGY 
Protein accessionYP_832780 
Protein GI116671847 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCAG CACCGGACGG TGCATCAGCG GACACTGTCC CGGCCAACGC CGGGGCAGTG 
GCCGCCGGCA GTGTTTTCGC GGCCGTCGAC ATTGGCGCCT CTTCCGGACG GGTCATCCTC
GGCCGTGTCT CCGGCGTCGC CGGTTCGGAA AGTGCCACGC TGGAGACGGT CCACCGCTTC
CCGAACGGTG TGGTGGAGTC CGACGACGGT CTACGCTGGG ATTTCGACGC CCTCTTCGCC
GAGGTACTCA CCGGCCTCGC TGCCGCGGCC CGCGTCGCCG GGGAGCGGGG CGAGGCCATC
AGCAGCATCG GGATCGACAC CTGGGCGGTG GACTACGGCC TGGTGAACGC TGCCGGGGAG
CTCATTGCGC AGCCCTTCAG CTACCGCGAT GACCGCAGCC GCGCCGCCGT CGCCCGGGTC
CACCGGAAAC TGGACCCGGC CCGGCTCTAC GCCACCACCG GGCTGCAGTT CCTGCAGTTC
AACACCCTCT ACCAGCTGGC CAGCGAACCG GACCTGGACG GTCTGCAGGC GCTGCTCATC
CCGGACCTGA TCGCGTTCCT GCTCACGGGT CAGCGCCGCA CCGAGGCCAC CAACGCTTCC
ACTACGGGGC TCTTTGATGC CGTCGCGGGG GAGTGGGCCA CCGAATTCCT TACGGCCCTC
GGGCTCCCGA AGAACCTGTT CCCGCCGCTA ATCCAGCCCG GCGAAACCGT CGGTACCCTG
CTGCCCGGCC TCGCCGCCCG CACCGGGCTG CACCAGGCCA CGAAGGTGGT GGCCGTCGGC
TCGCACGACA CCGCCTCCGC CGTCGCCGCC GTGCCCGCCG AACACGGGAA CTTCGCCTAC
ATCTCTTCAG GGACCTGGTC TCTGGTGGGC GTCGAACTCC GGAAGCCGGT GCTCACCGAG
GCGAGCCGGC AGGCCAACTT CACCAACGAA CGCGGCGTGG ACGGCACCGT CCGCTACCTC
CGCAACGTCG GCGGGCTCTG GCTGCTCAGC GAATGCCAGC GCACCTGGGC GCAGCAGGGA
TATACGGCGA CGCTGGACGA CCTGCTGGCC GGCGCCGCCG CGCTGCCTTT CGGCGGACCC
CAGATCAACG CCGACGATCC CTACTTCATC GCCCCGGACA ACATGCCCGA ACGCATCCAG
GCCGCCGTCC GCAACACCGG CGACGTCCTC ACCGGCAACC CCGCGGCGAT CACCCGCTGC
ATTCTGGACA GCCTCGCGGC CGGCTACGCC CGGACCATCG CCGACGCCGA ACGCCTGGCG
GACGTGCCCG TCGACGTGGT CCACATCGTA GGCGGCGGCT CGCAAAACCG GCTCCTCTGC
CAGCTCACCG CCGACGCCAC CGGCAAGCGC GTCATCGCGG GACCGGTCGA GGCCACCGCC
TTGGGCAACG TCCTGATCCA GGCACGGGCG GCCGGTGTGG TGTCCGGAGG CCTGGCTGAC
CTGAGGGCAC TGGTGCGCGG CTCGCAGCCA TTGGAAAACT ACCAGGCGGC GCTGGTCTGA
 
Protein sequence
MPPAPDGASA DTVPANAGAV AAGSVFAAVD IGASSGRVIL GRVSGVAGSE SATLETVHRF 
PNGVVESDDG LRWDFDALFA EVLTGLAAAA RVAGERGEAI SSIGIDTWAV DYGLVNAAGE
LIAQPFSYRD DRSRAAVARV HRKLDPARLY ATTGLQFLQF NTLYQLASEP DLDGLQALLI
PDLIAFLLTG QRRTEATNAS TTGLFDAVAG EWATEFLTAL GLPKNLFPPL IQPGETVGTL
LPGLAARTGL HQATKVVAVG SHDTASAVAA VPAEHGNFAY ISSGTWSLVG VELRKPVLTE
ASRQANFTNE RGVDGTVRYL RNVGGLWLLS ECQRTWAQQG YTATLDDLLA GAAALPFGGP
QINADDPYFI APDNMPERIQ AAVRNTGDVL TGNPAAITRC ILDSLAAGYA RTIADAERLA
DVPVDVVHIV GGGSQNRLLC QLTADATGKR VIAGPVEATA LGNVLIQARA AGVVSGGLAD
LRALVRGSQP LENYQAALV