Gene RPB_1602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1602 
Symbol 
ID3910073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1805262 
End bp1806242 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content64% 
IMG OID637883498 
ProductADP-L-glycero-D-manno-heptose-6-epimerase 
Protein accessionYP_485223 
Protein GI86748727 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGG TAACCGGAGG CGCCGGCTTT ATCGGATCGA ACATCGCTGC GGCGCTGAAC 
GATGCGGGGC GCAGTGATGT GGCGGTGTGC GACTTTCTCG GCCACGAGGG CAAATGGAAG
AACCTCGCCA AGCGCCAGCT CGCCGATGTC GTGCCGCCCG CCGAACTCTC GGAATGGCTG
AGGGGTCGCC GGCTCGACGC GGTGTTTCAC ATGGGCGCGA TCTCGGAAAC GACCGCGACC
GATGGCGATC TGGTGATCGA CACCAACTTC CGGTTGTCGA TGCGGCTGCT CGACTGGTGC
ACTGAAAACC GGGTGCCGTT CATCTATGCC TCCTCCGCCG CCACCTATGG CGACGGCGCG
CAGGGCTTCA GCGATGATCC ATCGCTCGCC GCGCTGAAGC AATTGCGGCC GATGAATCTC
TACGGCTGGA GCAAGCACCT GTTCGATCTC GTGGTGGCCG AGCGCGCCGC ACGCGGCGAG
CGGCTGCCGC CGCAATGGGC CGGGTTGAAG TTCTTCAACG TGTTCGGCCC CAACGAGTAT
CACAAGGGGA CGATGGCGAG CGTGCTGGCG CGGCGCTTCG ACGACATCAG GGCCGGGCGC
GTGGTGCAGC TGTTCAAGTC GCATCGCGAC GGCATCGCCG ATGGCGACCA GCGCCGCGAT
TTCATCTATG TCGACGACGT GGTCCGGGTG ATGATGTGGC TGTTCGCGAC GCCGTCGGTG
AGCGGCCTGT TCAATGTCGG CACCAGCCAC GCCCGCAGTT TCCGGGATCT GATCCTCGCC
GCCTATTCGG CGCTCGGAAC CCCGCCGCAA ATCGACTACA TCGACATGCC GGAACAGATT
CGCGGCAGCT ATCAGTATTT CACCGAGAGC GAAGGCGACC GGTTGCGCGC CGCAGGCTAC
AATGGCGGCT TCACGCCGCT CGAAGATGCG GTCGCTTGCT ATGTCAGGGG GTACCTTGAC
GGCAGTGATC GCTTCCGCTG A
 
Protein sequence
MLLVTGGAGF IGSNIAAALN DAGRSDVAVC DFLGHEGKWK NLAKRQLADV VPPAELSEWL 
RGRRLDAVFH MGAISETTAT DGDLVIDTNF RLSMRLLDWC TENRVPFIYA SSAATYGDGA
QGFSDDPSLA ALKQLRPMNL YGWSKHLFDL VVAERAARGE RLPPQWAGLK FFNVFGPNEY
HKGTMASVLA RRFDDIRAGR VVQLFKSHRD GIADGDQRRD FIYVDDVVRV MMWLFATPSV
SGLFNVGTSH ARSFRDLILA AYSALGTPPQ IDYIDMPEQI RGSYQYFTES EGDRLRAAGY
NGGFTPLEDA VACYVRGYLD GSDRFR