Gene RPC_4203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4203 
Symbol 
ID3972658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4673547 
End bp4674680 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content69% 
IMG OID637927305 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_534046 
Protein GI90425676 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.360025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.194888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAACA CCAAGAAGAT CATGGTGGTG TTCGGTACGC GGCCCGAGGC CATCAAGCTC 
GCGCCGGTGA TCGCGGCGCT GCAAGCCCGC CCGCGTCAGT TCGACACCGT GGTCTGCTCC
TCCGGGCAGC ATCGCGAAAT GTTGCTGCAG ACGCTGGAGA CGTTCGGCAT TCAGCCGCAG
ATCAACCTCG ACGTCATGCG CCCCGACCAG ACCCTGCCGG ATCTGACCGC GCTGCTGATT
TCACGGCTGA CCGCGACGCT TGCTGCGGAG CGGCCGGATC GCGTCATCGT CCAGGGCGAC
ACCACCACGG CGTTCGCCGC CGCGTTGGCC GGCTATTATG CCCGGGTGCC CGTCGCGCAT
GTCGAGGCCG GGCTGCGCTC GCACGACCGG CACAATCCGT TTCCGGAAGA GATCAACCGC
CGGCTGATCA GCGCGATCGC CGATCTGCAT TTTGCGCCGA CGCAAGGCGC CGCGGACGCG
CTGCGCGCCG AGGCGATCGC CCAGGCCACG ATCCATGTCA CCGGCAACAC CATCGTCGAT
GCCTTGCTGG CGCTGCGCGG CCGGCTCGAG ACGCCGGACG GCCTCGCCCT TGTGCCGGCC
GCGATCCGCG GCTTCGGCAC CGATGGGCAA CCGCTGATCC TGGTGACCTG CCATCGTCGC
GAGAGCTTCG GCGACGATCT CGCGGCGATC TGCCGCGCGC TGAAGCGCAT CGCGCTCGGC
CACCCCGATC ATCGCATCGT CTTTCCGGTG CACCTCAACC CCAACGTCCG CGCCCAGGTG
ATGCCGCTGC TCGGCGACAC GCCGAACATC GCGCTGCTGG AGCCGGTGAG CTACCCGTCG
CTGGTCTATC TGCTGTCGCG CGCGGTGCTG GTGCTGTCGG ATTCCGGCGG CATTCAAGAG
GAGGCGCCGA GCTTCGGCGT GCCGATCCTG GTGCTGCGCT GGAAGACCGA ACGGCCGGAG
GGCGTCGCCG CCGGCGTCGC GCAGCTGGTC GGCGCCGACG AGGAGCTGAT CGTCGACCGC
GCCGGCGCCT TGCTGGCGCA AGCCGGCGAT CGCGGCGCGG CGGTGATCGC CAATCCCTAT
GGCGATGGCC GCGCCAGCGA ACGGATCGCC GACATCCTGG CGAGCGCGTC ATGA
 
Protein sequence
MTNTKKIMVV FGTRPEAIKL APVIAALQAR PRQFDTVVCS SGQHREMLLQ TLETFGIQPQ 
INLDVMRPDQ TLPDLTALLI SRLTATLAAE RPDRVIVQGD TTTAFAAALA GYYARVPVAH
VEAGLRSHDR HNPFPEEINR RLISAIADLH FAPTQGAADA LRAEAIAQAT IHVTGNTIVD
ALLALRGRLE TPDGLALVPA AIRGFGTDGQ PLILVTCHRR ESFGDDLAAI CRALKRIALG
HPDHRIVFPV HLNPNVRAQV MPLLGDTPNI ALLEPVSYPS LVYLLSRAVL VLSDSGGIQE
EAPSFGVPIL VLRWKTERPE GVAAGVAQLV GADEELIVDR AGALLAQAGD RGAAVIANPY
GDGRASERIA DILASAS