Gene Arth_3211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3211 
Symbol 
ID4444201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3619123 
End bp3620094 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content62% 
IMG OID639691035 
ProductUDP-glucose pyrophosphorylase 
Protein accessionYP_832687 
Protein GI116671754 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1210] UDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01099] UTP-glucose-1-phosphate uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.464773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGGC CTCCCCATGA GCCGCGTAAT CTTGGCCGTG AATTGCCAGT TTCCTCAAAC 
GATTTGTATG GTTCAGCTAT GACTTTGGGG AAATCAGTAA GAAAAGCCGT CATTCCTGCT
GCCGGTTTGG GAACTCGCTT CCTGCCCGCC ACCAAGGCGA TGCCGAAGGA AATGTTGCCG
GTTGTTGACC AGCCCGCAAT CCAGTACGTG GTGGAGGAAG CCGTCAAGGC AGGGCTGACG
GACCTCCTGA TGATCACCGG ACGCCAGAAG CGGGCCCTGG AGGACCACTT TGACCGGGCA
CCTGCCCTGG AGCGGACCTT GGAGCTTAAG GGCGACCTGG ACCGGCTGGA GGCTGTCCAG
CACGCCTCCA GCCTCGCTCC GCTGCACTAC CTGCGCCAGG GAGATCCCAA GGGTCTGGGC
CACGCGGTGC TGTGCGCGCG CCAGCACGTG GGGGACGAGC CGTTCGCCGT CCTGCTTGGT
GACGACCTCA TCGACGAACG GGATGAGCTG CTGAGCACCA TGATCGACGT GCAGGCCAAG
ACCGGAGGCT CCGTCATCGC ACTGATCGAA GTGGACCCGT CCCAGATCAG CGCCTACGGC
TGCGCGGACA TCACGCCCGT GGACGGCGAG AACTATTTTC AGGTGAACCG CCTGGTGGAA
AAGCCCTCTG TAGACGAAGC CCCCTCCAAC CTGGCAGTCA TCGGCCGTTA CGTGCTGCAC
CCGGCCGTGT TTGATGTGCT GGAAGAAACC GAGCCGGGCC GCGGCGGTGA GATCCAGCTG
ACGGACGCCC TGCAGACCCT GGCAACGTCT GACGGCGAAG GCGGGGGCGT TTATGGCGTG
GTGTTCCGCG GGCGCCGCTA CGACACCGGA GACAAGCTCA GCTACATCAA GGCGGTTATT
TCCATCGCCT CGGAGCGCGT CGACTTCGGC GAGGACCTCA AGGCCTGGAT GAAGGAATTC
GTGAACGACT AA
 
Protein sequence
MGRPPHEPRN LGRELPVSSN DLYGSAMTLG KSVRKAVIPA AGLGTRFLPA TKAMPKEMLP 
VVDQPAIQYV VEEAVKAGLT DLLMITGRQK RALEDHFDRA PALERTLELK GDLDRLEAVQ
HASSLAPLHY LRQGDPKGLG HAVLCARQHV GDEPFAVLLG DDLIDERDEL LSTMIDVQAK
TGGSVIALIE VDPSQISAYG CADITPVDGE NYFQVNRLVE KPSVDEAPSN LAVIGRYVLH
PAVFDVLEET EPGRGGEIQL TDALQTLATS DGEGGGVYGV VFRGRRYDTG DKLSYIKAVI
SIASERVDFG EDLKAWMKEF VND