Gene RPD_1614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1614 
Symbol 
ID4022094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1807964 
End bp1808944 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content62% 
IMG OID637961809 
ProductADP-L-glycero-D-manno-heptose-6-epimerase 
Protein accessionYP_568752 
Protein GI91976093 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGG TAACCGGAGG CGCCGGTTTT ATCGGATCGA ACATCGTGGC CGCGCTGAAC 
GACGCCGGCC GCAGCGACAT CGCCGTGAGC GATGTCCTGG GTCATGACGG CAAGTGGAAG
AACCTCGCCA AGCGTCAGCT TGCCGATGTC GTGCCGCCGG CCGAACTGGC CGACTGGCTG
AAAGGCCGCA GGCTCGAGGC GGTGTTCCAC ATGGGCGCGA TCTCGGAGAC GACCGCGACT
GATGGCGATT TGGTGATTGA CACCAACTTC CGGTTGTCAT TACGACTGCT CGACTGGTGT
ACCGAGAACC GGGTACCGTT CATCTACGCA TCTTCGGCCG CGACCTATGG CGACGGCGCG
CAAGGCTTCA GCGATGATGC CTCGCTTGCC GCCTTGAAGC AGTTGCGGCC GATGAATCTC
TACGGCTGGA GCAAACACCT GTTCGATCAG GTGGTCGCGG AGCGCGCCGC GCGCGGCGAC
CGGCTGCCGC CGCAATGGGC GGGGCTCAAG TTTTTCAACG TTTTCGGCCC CAATGAATAC
CACAAGGGCA CGATGGCGAG CGTGCTCGCG CGGCGTTTCG ACGACATCAA GGCCGGGCGC
GTGGTGCAGC TGTTCAAGTC GCATCGCGAC GGCATTGCCG ACGGCGACCA GCGCCGCGAT
TTCATCTATG TCGACGATGT CGTTCGGGTC ATGTTGTGGT TGTTCGCGAC GCCGTCGGTG
AGCGGCCTGT TCAATGTCGG CACCAGCCAT GCCCGCAGTT TCCGCGACCT GATCCTTGCG
GCCTATTCGG CGCTCGGAAC CCCACCGCAT ATCGAGTACA TCGACATGCC GGAACAGATT
CGCGGCAGCT ATCAGTATTT CACCGAGAGC GAAGGCGACC GGTTGCGTGC CGCAGGCTAT
AATGGCGGCT TCACGCCGCT CGAAGATGCG GTCGCTTCCT ATGTCAGGGG CTACCTTGAC
GGTAACGATC GCTTCCGCTG A
 
Protein sequence
MLLVTGGAGF IGSNIVAALN DAGRSDIAVS DVLGHDGKWK NLAKRQLADV VPPAELADWL 
KGRRLEAVFH MGAISETTAT DGDLVIDTNF RLSLRLLDWC TENRVPFIYA SSAATYGDGA
QGFSDDASLA ALKQLRPMNL YGWSKHLFDQ VVAERAARGD RLPPQWAGLK FFNVFGPNEY
HKGTMASVLA RRFDDIKAGR VVQLFKSHRD GIADGDQRRD FIYVDDVVRV MLWLFATPSV
SGLFNVGTSH ARSFRDLILA AYSALGTPPH IEYIDMPEQI RGSYQYFTES EGDRLRAAGY
NGGFTPLEDA VASYVRGYLD GNDRFR