Gene RPC_4197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4197 
Symbol 
ID3972554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4664626 
End bp4667166 
Gene Length2541 bp 
Protein Length846 aa 
Translation table11 
GC content59% 
IMG OID637927299 
Productglycosyl transferase, group 1 
Protein accessionYP_534040 
Protein GI90425670 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3754] Lipopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTAACC AGTTTGCAAG CATCGAAGAG TTCTTCGATG AGAGCTATTA CGCGTTTTCC 
GGCGAAGCCA AAAAGAACGG AATCCGTCCA ATCGATCACT ATCTCCAGTT TGGGGAACAA
CTCGGTGCTG CTCCGTCGAC CAGATTCGAC CCCAAATACT ATTTGAAACA ATATCCCGAT
CTTGGTGGCT GGCAAGGGGG ACTGCTCAAC CACTATCTGC AGTACGGCAG AGCGGAGGGC
CGACAGGGCG TCGCAATGAC CCCGAGCATC GCGTGCCCGA CGGAAAAGAT CGATCCCGGC
CGCGCGACCA TCCTGCTGGC CGTTCACGAC GCGTCACGGT CCGGAGCGCC GATTCTGGCA
TGGAATCTGA TCAACGAGTT GCGCAAGCAA CATAACGTCG TGGTTCTGCT CAAAAGCGGC
GGCCCGATCG AACCGGCGCT CCGGGAGGCG GCGACTGAAC TCGTCACCAT CCCTGCCGAA
TTTCCCTATG GGTCCGGCGA GGACGCTCTC TTCGCTCAGA AGCTGACCGA AACCTACTCG
CCGCTCTACG CCATCGCCAA CAGCGTCGCG ACGCGCGAAC TGGCCATTCT GCTTGAGGCC
GCAGGCGTCC CGGTGATCGC ATTGGTGCAC GAATTTTCCA GCTACTTTCA ACCGATCGGG
ATTCTCAACC CGCTGTACGT TTCGGCGTCG AAACTCGTCT TTCCGGCGCC GATTGTCGCC
GACGCCAGTG TGCAGGACTA TCCAGGTCTC AAGCCCCGAC ATTTGGATAT CCTGCCTCAA
GGCCCGTCGA AAGTTCCCAG CTTTCAGCAC CCTACCGGCG GCGCCGAGCG TCGCAGTTCC
GTGCAACGAC TGGAGGACCT GGCGCTTGAA GATACCGCCG TGATTCTCGG GACTGGGCGA
ATCGAATACC GAAAAGGCGT CGATGCGTTC ATCGCAGCGG CCTCGCAGGT CCAACGCAGC
GCGACACGAA AATTCAAATT CGTCTGGATC GGCCATCTGC ATCCGTCCGA CGCCAGTTAT
TTCGGTTTTC TGCAAGAGCA GATCAAGCGA AGCGGCCTTG AGCAACACGT TCTGTTCGTC
GATGAGGTCG ATGAACTTGA ACCGTTCTAC AGCAAGGCCG ATGTGTTTTT TCTCAGTTCG
CGGCTGGACC CGCTTCCAAA CGTTGCCATC GACGCGTCGT TGAGGAAGAT TCCAGTGGTC
TGCTTCAAGA ACGCCAGCGG CTTCGCCGAG TTGCTCGAGT GCAGCGACAC TGCCAAAGAA
CTGGTCGTCC CCTACCTCGA CAGTTCGGCG GCCGGCCGCA TGATTTGCGA CCTGCTGGCA
GATGCCCGCC TGTTGACACG ACTTGGCGAG GATATTCAGG CTGTTGCCAA CGCCACCTTC
GACATGGGCC AATATGCCCG CAAGCTCGAC CAACTCGGAC GCACCTGTGC CGCCGACATT
GAGCAGATGG CGCTCGACCA GGCGACGATC CTAGAGTCCG GGATCTTCAA CGCCAGCCTT
CGTTTCGGAA GTTGCGCGGA AAAGTTCACG CCCGATCAGG CCGTCAACGA ATACCTCAAC
GCGTCGCGAC TTTGCCGGCC GCTGCACCGG GGCAAGGCCG GAATGCTGAT TCGCCGGCCG
GTCGAAGGCT TTCATCCTTT GATTTACGCC GCCGAGAATC CCGAGGTCGA TCGCCAAGGC
ATCGATCCAC TGGCGCATTT TCTTCGCAAT GGGCGACCGG AAGGGCGTTG GACCCATCGT
GTCATCCGCC CCGAACCGCG CCGCGAAAGC AATGCAGCCC GGCCGCGGAT CGCCATCCAC
GGCCATTTTT ACTACCCGGA TCTGCTTGAG AGCTTCCTAA AGTTGATCGC CGCGAATGCC
AGTTCGGTCG ATCTGTTCTT GACCACCAGC GGCCCGGAGC AGGCCGCGCA GATCCGAAAG
TCGCTGCGGG CCTTCGGCAT TCAAAATGCC GATGTCTGGT CGGTGCCGAA TCGCGGGCGC
GATATCGGGC CCTTTCTCAA GGAAATGCCC GACAAGCTCG GCTCCTACGA CATCGTCGGC
CATTTTCACG GCAAGCGAAG CAAGCACGTC GACTCCACGG TCGGCGACCA ATGGCGGGAT
TTTGCCTGGC AGCATCTGAT CGGCGACGCG TTTCCGATGA TCGACGTTAT CGCCGATGCA
TTCGCGGAGG ATGCCAAGCT CGGGCTGGTT TTTGCAGAGG ATCCCTATCT GAACGGATGG
GACGAGAACC GCGACCTGGC CGAACGGCTG GCGCAGCGCA TGAAGATCGA GGCCCCGCTT
CCCGAACACT TCGATTTTCC GATCGGGACG ATGTTCTGGG CGCGTGTCGC TGCGTTGCAG
CCGTTGTTTC AGTTGAACCT GGATTGGAAT GACTACCCGC ACGAGCCGCT GCCGATCGAC
GGCACGATTT TGCACGCGCT CGAGCGCATC GTTCCGTTCG CCGTCCAGAA ATCCGGCTTC
GAATACGCCA CAACCTATGT GCGTTCCAGC ATGCGCGACG ATGGCCTGGC CTTTATTCGC
CGCCCCGGCT TGCAAAGGTG A
 
Protein sequence
MSNQFASIEE FFDESYYAFS GEAKKNGIRP IDHYLQFGEQ LGAAPSTRFD PKYYLKQYPD 
LGGWQGGLLN HYLQYGRAEG RQGVAMTPSI ACPTEKIDPG RATILLAVHD ASRSGAPILA
WNLINELRKQ HNVVVLLKSG GPIEPALREA ATELVTIPAE FPYGSGEDAL FAQKLTETYS
PLYAIANSVA TRELAILLEA AGVPVIALVH EFSSYFQPIG ILNPLYVSAS KLVFPAPIVA
DASVQDYPGL KPRHLDILPQ GPSKVPSFQH PTGGAERRSS VQRLEDLALE DTAVILGTGR
IEYRKGVDAF IAAASQVQRS ATRKFKFVWI GHLHPSDASY FGFLQEQIKR SGLEQHVLFV
DEVDELEPFY SKADVFFLSS RLDPLPNVAI DASLRKIPVV CFKNASGFAE LLECSDTAKE
LVVPYLDSSA AGRMICDLLA DARLLTRLGE DIQAVANATF DMGQYARKLD QLGRTCAADI
EQMALDQATI LESGIFNASL RFGSCAEKFT PDQAVNEYLN ASRLCRPLHR GKAGMLIRRP
VEGFHPLIYA AENPEVDRQG IDPLAHFLRN GRPEGRWTHR VIRPEPRRES NAARPRIAIH
GHFYYPDLLE SFLKLIAANA SSVDLFLTTS GPEQAAQIRK SLRAFGIQNA DVWSVPNRGR
DIGPFLKEMP DKLGSYDIVG HFHGKRSKHV DSTVGDQWRD FAWQHLIGDA FPMIDVIADA
FAEDAKLGLV FAEDPYLNGW DENRDLAERL AQRMKIEAPL PEHFDFPIGT MFWARVAALQ
PLFQLNLDWN DYPHEPLPID GTILHALERI VPFAVQKSGF EYATTYVRSS MRDDGLAFIR
RPGLQR