Gene RPC_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1431 
Symbol 
ID3973699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1554978 
End bp1555958 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content65% 
IMG OID637924546 
ProductADP-L-glycero-D-manno-heptose-6-epimerase 
Protein accessionYP_531312 
Protein GI90422942 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.596658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGG TGACCGGAGG GGCCGGTTTT ATCGGGTCGA ACGTCGTGGC CGCGCTGAAC 
GACGCCGGCC GCGCCGACGT CGCGGTGTGC GACGTGCTCG GCCACGACGG CAAATGGAAG
AATCTGGCCA AGCGCCAGCT CGCGGATGTC GTGCCGCCGG GCGAGCTGAT GGGCTGGCTG
CAGGGCCGCA GGCTCGACGC CATTATCCAC ATGGGCGCGA TCTCGGAGAC CACCGCGACC
GACGGCGACC TGGTGATCGA GACCAATTTC CGGCTGTCGA TGCGGCTGTT GGACTGGTGC
ACCGCCAACA AGGTGCCGCT GATCTATGCC TCGTCGGCCT CGACCTATGG CGATGGCGAG
CAGGGCTTCA AGGACGATCA ATCCGTCGCC GCGTTGAAAC AGCTGCGGCC GATGAACCTG
TATGGCTGGA GCAAGCATCT GTTCGATCTT GCGGTCGCCG AACGCGCCGC GCGCGGCGAC
CAGTTGCCGC CGCAATGGGC CGGGCTGAAG TTTTTCAACG TGTTCGGCCC CAACGAATAT
CACAAGGGCA GCATGATGAG CGTGCTGGCC AAGCGGTTCG ACGACGTCAA ATCCGGCCGC
GTGGTGCAGT TGTTCAAGTC GCACCGCGCC GGCATCGAAG ACGGCGACCA GCGCCGCGAC
TTCATCTATG TCGACGACGT GGTGCGGGTG ATGACCTGGC TGTTGGCGAC GCCGTCGGTC
AGCGGCATCT TCAACGTCGG CACCGGGCAT GCCCGCAGCT TCCGCGACCT GATCCTGTCG
GCCTATGCGG CGCTCGGCGC CAAGCCGAAC ATCGAATATA TCGACATGCC GGAAAGCATT
CGCGGCAGCT ACCAATACTT CACCGAGAGC GAAGGCGAGC GGTTGCGCGC CGCCGGCTAC
AATGGCGGCT TCACCGCGCT GGAAGACGCG GTCGCGCACT ACGTCAAAGG CTTCCTCGAC
GCCGAGGACC GCTTCCGGTG A
 
Protein sequence
MLLVTGGAGF IGSNVVAALN DAGRADVAVC DVLGHDGKWK NLAKRQLADV VPPGELMGWL 
QGRRLDAIIH MGAISETTAT DGDLVIETNF RLSMRLLDWC TANKVPLIYA SSASTYGDGE
QGFKDDQSVA ALKQLRPMNL YGWSKHLFDL AVAERAARGD QLPPQWAGLK FFNVFGPNEY
HKGSMMSVLA KRFDDVKSGR VVQLFKSHRA GIEDGDQRRD FIYVDDVVRV MTWLLATPSV
SGIFNVGTGH ARSFRDLILS AYAALGAKPN IEYIDMPESI RGSYQYFTES EGERLRAAGY
NGGFTALEDA VAHYVKGFLD AEDRFR