Gene RPD_0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0749 
Symbol 
ID4021222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp840033 
End bp841154 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content54% 
IMG OID637960938 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_567888 
Protein GI91975229 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAGA TTGCCACGAT CGTTGGAACA CGGCCGGAAA TCATCCGACT GTCTCGCATT 
ATCGCAAAAT TCGACGCCCA TTTCGAACAT GTCCTGATCC ATACTGGTCA AAATTACGAC
TACGAGTTGA ACCAGGTCTT TTTCGATCAG TTGGGCGTCC GAGCGCCGGA CTTCTTCATG
CATGCGGCTG GTGCTACAGC CGCGGAAACA ATCGGCAATG TCATCATCGC TTCAGACAGA
ATCCTGGAGG AGACAAAGCC TGACGCCGTG CTGATTTTGG GAGACACCAA CAGCTCCCTG
GCAGCCATTG CCGCGAAAAG GCGCAAGATT CCGATATTTC ACATGGAAGC GGGGAACCGT
TGCTTTGATG CGCGGGTGCC GGAGGAAATC AACCGCAAAA TTGTGGATCA CACGGCTGAT
ATCAATCTAA CCTACAGCTC TATTGCCCGC GAGTATCTGC TACGTGAGGG CTTCCCGCCA
GACCAGGTGA TCCGCACGGG TTCCCCCATG CGGGAGGTTC TAGACTACTA TGCGTCAGGC
ATTGCCGCTT CGACAGTCCT GAGCGACTTA TCGCTCCAGC CTCACCAGTT TTTCGTTGTG
AGCTCACATC GGGAAGAGAA CGTCGATTCA CCGCAAAGAC TAAACAACCT GCTGCTCATC
CTGAACGAGC TGGCCGATCG ATATGGTCTG CCTATTATTG TTTCGACTCA CCCCCGGACT
AAAAATCGCC TGGCCGAAAA CAAGATCCAG ATGAACGGCC TGGTACAATT TCATCCGCCC
TTCGGATTTC TTGACTACGT AAAACTGCAG GCGCAGGCCA AGGCCGTTCT TTCCGACAGC
GGAACTATTA CCGAAGAATC GTCAATACTG AATTTCCCCG CGCTTAATTT ACGGGAAGTC
CAGGAGCGCC CTGAGGGCTT CGAAGAGGCC TCGGTCATGA TGGTCGGCCT TGATCTCACG
CGCATCCTGA CCGGCCTGCG CATTCTGGAG GACCAGCCGC GCAGCCCTGA GCGAACGCTG
CGTATGGTCG CCGACTACAC GCCTGACAAT GTGTCGGACA AGATGGTACG CATAATTCTA
AGCTACACCG ATTTCGTCAA CAGCCGCACG TGGCGTCAAT AG
 
Protein sequence
MLKIATIVGT RPEIIRLSRI IAKFDAHFEH VLIHTGQNYD YELNQVFFDQ LGVRAPDFFM 
HAAGATAAET IGNVIIASDR ILEETKPDAV LILGDTNSSL AAIAAKRRKI PIFHMEAGNR
CFDARVPEEI NRKIVDHTAD INLTYSSIAR EYLLREGFPP DQVIRTGSPM REVLDYYASG
IAASTVLSDL SLQPHQFFVV SSHREENVDS PQRLNNLLLI LNELADRYGL PIIVSTHPRT
KNRLAENKIQ MNGLVQFHPP FGFLDYVKLQ AQAKAVLSDS GTITEESSIL NFPALNLREV
QERPEGFEEA SVMMVGLDLT RILTGLRILE DQPRSPERTL RMVADYTPDN VSDKMVRIIL
SYTDFVNSRT WRQ