Gene RPC_4163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4163 
Symbol 
ID3972306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4625387 
End bp4627318 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content64% 
IMG OID637927266 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_534007 
Protein GI90425637 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.737802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.887363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGGA TCTCCGCTTT TACTTACCGG AACTGGCTGA TCGCGATTCA CGATGCCGGG 
GTGACGGCGT TCGCGGTGAT CGCCAGTTTT TTCTTGCGGT TCCAAGGCGA GAACTTCACC
GAGCGACTGC CGCTGCTGCT GACGATTCTG CCTTATTTCG TGGTTTTCAG CTTTTTCGTC
TGTTACTTTT TCCATCTGAC CACCACCAAG TGGCGGTTCA TTTCCGTCCC CGATCTTTTG
AACATCCTGC GCGCGGCCAG CGTACTGACG GTCGCGCTGC TGGTGCTCGA TTATATCTTC
CTGGCGCCGA ACGTGTTCGG AACGTTCTTT CTCGGCAAGA CCACCATCAT CATCTATTGG
TTTCTCGAGG TGTTCTTCCT CAGCGGTTCG CGGTTGGCCT ACCGGCATTT CCGCTACACC
CGCACCCGCA ACCAGGCCAA GCTCAAGGAC GCCGCGGCGA CCTTGCTGAT CGGGCGCGCC
GCCGACACCG AAGTGTTTCT GCGCGCGGTC GAGAGCGGCG CGGTGAAGCG GCAATGGCCG
GTCGGCATCC TGTCGCCGTC CAGTTCGGAT CGCGGGCAGC TGATCCGCGG CATCCCGGTG
CTGGGCACGA TCGACGATCT GGCCAACGTG GTCGACGATT TCGCCGGCCG TAACAAGCCG
ATCAAGCTCG TGGTGATGAC GCCGTCGGCG TTCGAGAGCG AGGCGCAACC GGAATCCGTG
CTGATGCGGG CGCGCAAGTT GGGCCTCGCG GTCAGCCGGT TGCCGTCGCT GGAGGAGAGC
CGCGATACGC CGCGGCTGGC GCCGGTGGCG GTGGAGGATC TGTTGCTGCG GCCGAGCGTC
AAGATCGACT ACGCCCGGCT GGAGGCGTTC GTGCGCGGCA AATCCGTGGT GGTCACCGGC
GGCGGCGGAT CGATCGGTTT CGAGATCTGC GACCGCGTCA CCACCTTCGG CGCCGCGCGG
CTGTTGATCA TCGAGAATTC CGAACCGGCG CTGCATGCGG CGATGGAAAC CATTCTCGCC
AAGGAGCCCG CGGTGGCGGT CGAAGGCCGC ATGGCCGACG TGCGCGACCG CGATCGCATC
CACCAATTGC TGACCGCGTT CAAGCCGGAC ATCGTGTTCC ACGCCGCGGC GCTGAAGCAT
GTGCCGATCC TGGAGCGCGA CTGGGCGGAG GGCGTCAAGA CCAACATCTT CGGTTCGGTC
AACGTCGCCG ACGCCGCACT GGCGTCGGGC GCCGCCGCGA TGGTGATGAT TTCCACCGAC
AAGGCGATCC AGCCGGTGTC GATGCTGGGG CTGACCAAGC GGTTCGCCGA AATGTACTGT
CAGGCGCTGG ATCGCCAATT GATGACCGGC GCCGACGGCG GCAAGCCGCC GATGCGGCTG
ATCTCGGTGC GGTTCGGCAA CGTGTTGGCC TCGAACGGCT CGGTGGTGCC GAAGTTCAAG
GCGCAGATCG AGGCCGGCGG GCCGATCACG GTGACCCATC CGGACATGGT GCGCTACTTC
ATGACCATCC GCGAAGCCTG CGATCTGGTG ATCACGGCCG CGACTCATGC GCTCAGTCCG
GCGCGGTTCG ATGCCTCGGT CTATGTGCTG AACATGGGGC AGCCGGTGAA GATCGTCGAT
CTCGCCGAGC GGATGATCCG GCTGTCCGGG CTGCAGCCCG GCTACGACAT CGAGATCGTG
TTCACCGGGG TGCGCCCCGG CGAGCGGCTG AATGAAATCC TGTTCGCCGA GCAGGAGCCG
ATCAGCGAGA CCGGCATCGC CGGCATCGTC GCGGCCAAAC CGAACCAGCC GCCGATGGCG
TTGCTGCGGC AATGGCTGAC CCAACTCGAG CAGGGCGTCA GCAACGAAAC CTGCTCGGAG
ATCTCCGGGG TGTTGAAAGC CGCGGTGCCG GAGTTCGGCG CCGAGGCCGA GCTTCGCACC
GCGGCGCAGT GA
 
Protein sequence
MTRISAFTYR NWLIAIHDAG VTAFAVIASF FLRFQGENFT ERLPLLLTIL PYFVVFSFFV 
CYFFHLTTTK WRFISVPDLL NILRAASVLT VALLVLDYIF LAPNVFGTFF LGKTTIIIYW
FLEVFFLSGS RLAYRHFRYT RTRNQAKLKD AAATLLIGRA ADTEVFLRAV ESGAVKRQWP
VGILSPSSSD RGQLIRGIPV LGTIDDLANV VDDFAGRNKP IKLVVMTPSA FESEAQPESV
LMRARKLGLA VSRLPSLEES RDTPRLAPVA VEDLLLRPSV KIDYARLEAF VRGKSVVVTG
GGGSIGFEIC DRVTTFGAAR LLIIENSEPA LHAAMETILA KEPAVAVEGR MADVRDRDRI
HQLLTAFKPD IVFHAAALKH VPILERDWAE GVKTNIFGSV NVADAALASG AAAMVMISTD
KAIQPVSMLG LTKRFAEMYC QALDRQLMTG ADGGKPPMRL ISVRFGNVLA SNGSVVPKFK
AQIEAGGPIT VTHPDMVRYF MTIREACDLV ITAATHALSP ARFDASVYVL NMGQPVKIVD
LAERMIRLSG LQPGYDIEIV FTGVRPGERL NEILFAEQEP ISETGIAGIV AAKPNQPPMA
LLRQWLTQLE QGVSNETCSE ISGVLKAAVP EFGAEAELRT AAQ