Gene Rru_A0943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A0943 
SymbollpxC 
ID3834400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1124295 
End bp1125338 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content67% 
IMG OID637825031 
ProductUDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase 
Protein accessionYP_426031 
Protein GI83592279 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0774] UDP-3-O-acyl-N-acetylglucosamine deacetylase 
TIGRFAM ID[TIGR00325] UDP-3-0-acyl N-acetylglucosamine deacetylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0149581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGTC TCAATTACGA TATCGACACC TCCTTTGAGG ACCGCTCTTC GTCCCGCAGT 
TCCAGCCGCG CCAAAGCCGG CCTTGGCGTG ATCGCCAGCG GCATCGCCCG CCAGCACACC
TTGAAAACCG CCATCAGTTG CACCGGCGTC GGCGTCCATC GCGGCCGCGA GGCCACCTTG
ACCCTGCGTC CGGCCCCCGT CGATACGGGC ATCGTTTTCA ACCGCACCGA CATCCTTGAT
GACAACGCCG CCATCCGCGT CAGCGCCCAG GCGGTGATCG ACGGCCGGCT GTGCACGACC
ATCGCCAACG AGGCCGGCGC CACGGTTTCC ACGGTCGAGC ACCTGATGGC CGCCTTCGCC
GCCCAGGGCA TCGACAATGT GATCGTCGAT GTCAACGGCC CGGAAGTGCC GATCATGGAC
GGCAGCGCCG CCCCCTTCGT TTTCCTCATC GATTGCGCCG GGGTGGTCGA CCAGGCGCTT
CCGCGCAAGG CGATCCGCGT CCGCAAGGCG GTGACCGTGG TCGAAGGCCC GGTTCTCGCC
AGCCTGATGC CGGCCGAGCG CGGCCTGTCG GTTGATTTCG AAATCGATTT CGCCGCCCGC
GCCATCGGTC GCCAGGGCTG CCACGTCGAT CTCACCCCCG ACCTGTTCCG CGCCCATATC
GCCCGCGCCC GCACCTTCGG CCTGCGCTCC GATGTCGATA TGATGCGCGC CGCCGGCCTT
GGCCTGGGCG GCTCGCTGGA AAACGCCGTG GTCGTCGACG ACGATCTGAT CCTCAACGAC
GAGGGCCTGC GCTACGAAGA GGAATTCGTG CGTCACAAGG CGCTGGACGC CATCGGCGAT
CTTTATATGG CCGGGGCGCC GATCATCGGC CGCTATCACG GCGTGCGCTC CAGCCACGCC
CATAACAACA AGCTGGTCCG CGCCCTGCTG GCCGATCCGG CCAATTACAG CCTGGAAACC
GTTGACGAGA CCGATCTCGC CGCCCCGGGC CAGGGCCGTT TCGACGGCTG GCGCGATACC
GACCGCATCG CCGCCACCGC CTGA
 
Protein sequence
MDGLNYDIDT SFEDRSSSRS SSRAKAGLGV IASGIARQHT LKTAISCTGV GVHRGREATL 
TLRPAPVDTG IVFNRTDILD DNAAIRVSAQ AVIDGRLCTT IANEAGATVS TVEHLMAAFA
AQGIDNVIVD VNGPEVPIMD GSAAPFVFLI DCAGVVDQAL PRKAIRVRKA VTVVEGPVLA
SLMPAERGLS VDFEIDFAAR AIGRQGCHVD LTPDLFRAHI ARARTFGLRS DVDMMRAAGL
GLGGSLENAV VVDDDLILND EGLRYEEEFV RHKALDAIGD LYMAGAPIIG RYHGVRSSHA
HNNKLVRALL ADPANYSLET VDETDLAAPG QGRFDGWRDT DRIAATA