Gene RPC_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3454 
Symbol 
ID3972122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3829952 
End bp3831091 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content71% 
IMG OID637926565 
Producthypothetical protein 
Protein accessionYP_533313 
Protein GI90424943 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.199214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCGG TTACGATTTG TTCACCAGAA CGGGCCGCAG CCGCGCCGTG GGACGATCTG 
GTGCGGCGCG CCTCGCCCAA CGTGTTCATG AATCCGGCGG CGCTGCGCGC GGCGCAGGAC
TGCGGCTTCG CCGACATCCG GGTGCTGCTG GCCTTCGACG AGGCCGCCTC GCCGCGGGCG
CTGGTCGGGC TGTGGGCGCT GCAGCGCCGC GTGGTCGCGC CGCTGCTGCC GGCGATCCTC
GAGGCCTTGC CGTTCTACTA TGCCTTCCTG TCGAGCCCGG TGGTCGATCC GGGGTTCGCC
GACGAGGTGA TGCCGGCGTT CCTCGCGGCG ATCGAACGCG ATCCCGCGCT GCCCCGGGTG
ATCACGCTGA AATCGCTCGA CGCCGAAGCG CCGAGCTATG CCGCGCTGCG CACCGCGTTG
AGCGGGAAGG GCGGCGCGCA GCTCGCGCTG AAACAATTCG CCCGGCCGTT CGCCACCAAG
GACCACGGCA TCAAGCGTTC CGGCTCGACC CGCAAGAAGC TGCGGCAGGA CTGGAACCGG
CTTTCCGCGC TCGGCGCCGT CGAGCTGGTC AACGACCGCT CGGCCGCTGG CGTCGCGGCG
GCATTCGAGA GCTTTCTGGC GCTGGAGGCG GGCGGCTGGA AGGGCGCGCG GGGCACCGCG
CTGACCTGCG ATCCGCGCCA TGCCCGCTTT ACCCGCGAAT TGATCCGGGC GCTGGCGGCG
CGCGGCGACG CCAGCGTGGC GCTGCTGCGG GTCGAGGGAC GGGCGATCGC GGCGCAAGTG
CTGATGTATT GTGGAACCAG CGCCTACACC TGGAAGACCG GCTACGACGC CGAATTCGCC
AAATTCTCGC CGGGCGCGCT GTTGATCGAC AAGCTCGCCG AAGCGCTGTT CGCCGACGGG
GTGATCGACA CCATCGATTC TTGCTCGGTG CAGGACAGCT TCATGGCGCA GCTGTGGAGC
GGCCGGCGCG CCATGGTCGA CCTAGTGGTC GACGTCGGGC CGGGCCGCTC GTTCGGATTC
GCCGTCGAGA CCGCGCGGCA GCGCGGCATC GCGCGGCTGC GAGAACTCCG CAACCGGCTG
CGGGCCTGGC GGGCAGTCCC GCCGGCGCCG AAGAAGCCGC CGGTGCCGCG CGAGGCGTGA
 
Protein sequence
MISVTICSPE RAAAAPWDDL VRRASPNVFM NPAALRAAQD CGFADIRVLL AFDEAASPRA 
LVGLWALQRR VVAPLLPAIL EALPFYYAFL SSPVVDPGFA DEVMPAFLAA IERDPALPRV
ITLKSLDAEA PSYAALRTAL SGKGGAQLAL KQFARPFATK DHGIKRSGST RKKLRQDWNR
LSALGAVELV NDRSAAGVAA AFESFLALEA GGWKGARGTA LTCDPRHARF TRELIRALAA
RGDASVALLR VEGRAIAAQV LMYCGTSAYT WKTGYDAEFA KFSPGALLID KLAEALFADG
VIDTIDSCSV QDSFMAQLWS GRRAMVDLVV DVGPGRSFGF AVETARQRGI ARLRELRNRL
RAWRAVPPAP KKPPVPREA