Gene RPD_1571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1571 
Symbol 
ID4022051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1764242 
End bp1765303 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content59% 
IMG OID637961766 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_568709 
Protein GI91976050 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.285914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGA AATTCATGAT TACGGGCGGC GCGGGCTTCA TCGGTTCGGC CGTCGTTCGT 
CGACTGATCG AAACCAGCGA TCATGAGGTT CTTGTCGTCG ACAAGCTGAC CTATGCCGGA
AATCTGGAAT CGCTTGCGCC GGTGTCGGCC AGTCCGAAGT TCAGCTTCGA ACGGGTCGAC
ATTACTGATG TCGAGGCCAT GCGCCGGGTC TTCGCGGAGT TTTCGCCCGA CATCGTGATG
CACCTCGCAG CCGAAAGTCA CGTCGACCGA TCGATTGACG GTCCCGGTGA ATTTATCCAG
ACCAACTTGG TCGGGACGTT CGTGTTGTTA CAGGCGGCCT TGAACCATTG GCGTACGCTG
CCGGCTGGTC GCAAGCCTGG TTTTCGCTTC CACCATGTCT CGACCGACGA GGTGTTTGGA
TCGCTTGGAC CGTCCGGCTC TTTCAACGAG GAGACGGCCT ACAGACCGAA TTCGCCCTAT
TCAGCCTCGA AAGCCGGATC CGACCATCTG GTGCGCGCCT GGCATCATAC TTACGGCCTG
CCGATGGTGA TGACCAATTG CTCGAACAAC TATGGTCCCT ATCAGTTTCC GGAGAAGCTG
ATTGCTCTGA TGATCATCAA TGCGCTGGAA GGCAGGCCGC TGCCGGTCTA TGGGACCGGC
GAGAATGTTC GCGACTGGCT TTACGTGGAG GATCATGCCG AGGCGCTGCT GCTCGTTGCC
GAAACCGGAG GCGTTGGCGA AAGCTATAAT ATCGGCGGAG ATAGCGAGCG CACCAACATA
TCGGTCGTTC GCTCGATCTG CCGGATCGTC GATGAGCTCG CGCCCGACGC AGCGATCGGT
CCGCGCGACA AGCTGATCGA ATTTGTGGTC GACCGGCCGG GTCACGATCT GCGTTATGCC
ATCGACGCAA CGAAGATCGA GCGCGAACTA GGTTGGAAGC CGCGCCATAG CTTCGAGACC
GGGTTGCGCC ACACAGTCCA ATGGTATCTC GACAATATGG GCTGGTGGAA ACGGGTCCGC
TCCGGTGCGT ACCGCGGCGA GCGACTCGGC ATCACCGTCT GA
 
Protein sequence
MTKKFMITGG AGFIGSAVVR RLIETSDHEV LVVDKLTYAG NLESLAPVSA SPKFSFERVD 
ITDVEAMRRV FAEFSPDIVM HLAAESHVDR SIDGPGEFIQ TNLVGTFVLL QAALNHWRTL
PAGRKPGFRF HHVSTDEVFG SLGPSGSFNE ETAYRPNSPY SASKAGSDHL VRAWHHTYGL
PMVMTNCSNN YGPYQFPEKL IALMIINALE GRPLPVYGTG ENVRDWLYVE DHAEALLLVA
ETGGVGESYN IGGDSERTNI SVVRSICRIV DELAPDAAIG PRDKLIEFVV DRPGHDLRYA
IDATKIEREL GWKPRHSFET GLRHTVQWYL DNMGWWKRVR SGAYRGERLG ITV