Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4197 |
Symbol | |
ID | 3972554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4664626 |
End bp | 4667166 |
Gene Length | 2541 bp |
Protein Length | 846 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637927299 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_534040 |
Protein GI | 90425670 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3754] Lipopolysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGTAACC AGTTTGCAAG CATCGAAGAG TTCTTCGATG AGAGCTATTA CGCGTTTTCC GGCGAAGCCA AAAAGAACGG AATCCGTCCA ATCGATCACT ATCTCCAGTT TGGGGAACAA CTCGGTGCTG CTCCGTCGAC CAGATTCGAC CCCAAATACT ATTTGAAACA ATATCCCGAT CTTGGTGGCT GGCAAGGGGG ACTGCTCAAC CACTATCTGC AGTACGGCAG AGCGGAGGGC CGACAGGGCG TCGCAATGAC CCCGAGCATC GCGTGCCCGA CGGAAAAGAT CGATCCCGGC CGCGCGACCA TCCTGCTGGC CGTTCACGAC GCGTCACGGT CCGGAGCGCC GATTCTGGCA TGGAATCTGA TCAACGAGTT GCGCAAGCAA CATAACGTCG TGGTTCTGCT CAAAAGCGGC GGCCCGATCG AACCGGCGCT CCGGGAGGCG GCGACTGAAC TCGTCACCAT CCCTGCCGAA TTTCCCTATG GGTCCGGCGA GGACGCTCTC TTCGCTCAGA AGCTGACCGA AACCTACTCG CCGCTCTACG CCATCGCCAA CAGCGTCGCG ACGCGCGAAC TGGCCATTCT GCTTGAGGCC GCAGGCGTCC CGGTGATCGC ATTGGTGCAC GAATTTTCCA GCTACTTTCA ACCGATCGGG ATTCTCAACC CGCTGTACGT TTCGGCGTCG AAACTCGTCT TTCCGGCGCC GATTGTCGCC GACGCCAGTG TGCAGGACTA TCCAGGTCTC AAGCCCCGAC ATTTGGATAT CCTGCCTCAA GGCCCGTCGA AAGTTCCCAG CTTTCAGCAC CCTACCGGCG GCGCCGAGCG TCGCAGTTCC GTGCAACGAC TGGAGGACCT GGCGCTTGAA GATACCGCCG TGATTCTCGG GACTGGGCGA ATCGAATACC GAAAAGGCGT CGATGCGTTC ATCGCAGCGG CCTCGCAGGT CCAACGCAGC GCGACACGAA AATTCAAATT CGTCTGGATC GGCCATCTGC ATCCGTCCGA CGCCAGTTAT TTCGGTTTTC TGCAAGAGCA GATCAAGCGA AGCGGCCTTG AGCAACACGT TCTGTTCGTC GATGAGGTCG ATGAACTTGA ACCGTTCTAC AGCAAGGCCG ATGTGTTTTT TCTCAGTTCG CGGCTGGACC CGCTTCCAAA CGTTGCCATC GACGCGTCGT TGAGGAAGAT TCCAGTGGTC TGCTTCAAGA ACGCCAGCGG CTTCGCCGAG TTGCTCGAGT GCAGCGACAC TGCCAAAGAA CTGGTCGTCC CCTACCTCGA CAGTTCGGCG GCCGGCCGCA TGATTTGCGA CCTGCTGGCA GATGCCCGCC TGTTGACACG ACTTGGCGAG GATATTCAGG CTGTTGCCAA CGCCACCTTC GACATGGGCC AATATGCCCG CAAGCTCGAC CAACTCGGAC GCACCTGTGC CGCCGACATT GAGCAGATGG CGCTCGACCA GGCGACGATC CTAGAGTCCG GGATCTTCAA CGCCAGCCTT CGTTTCGGAA GTTGCGCGGA AAAGTTCACG CCCGATCAGG CCGTCAACGA ATACCTCAAC GCGTCGCGAC TTTGCCGGCC GCTGCACCGG GGCAAGGCCG GAATGCTGAT TCGCCGGCCG GTCGAAGGCT TTCATCCTTT GATTTACGCC GCCGAGAATC CCGAGGTCGA TCGCCAAGGC ATCGATCCAC TGGCGCATTT TCTTCGCAAT GGGCGACCGG AAGGGCGTTG GACCCATCGT GTCATCCGCC CCGAACCGCG CCGCGAAAGC AATGCAGCCC GGCCGCGGAT CGCCATCCAC GGCCATTTTT ACTACCCGGA TCTGCTTGAG AGCTTCCTAA AGTTGATCGC CGCGAATGCC AGTTCGGTCG ATCTGTTCTT GACCACCAGC GGCCCGGAGC AGGCCGCGCA GATCCGAAAG TCGCTGCGGG CCTTCGGCAT TCAAAATGCC GATGTCTGGT CGGTGCCGAA TCGCGGGCGC GATATCGGGC CCTTTCTCAA GGAAATGCCC GACAAGCTCG GCTCCTACGA CATCGTCGGC CATTTTCACG GCAAGCGAAG CAAGCACGTC GACTCCACGG TCGGCGACCA ATGGCGGGAT TTTGCCTGGC AGCATCTGAT CGGCGACGCG TTTCCGATGA TCGACGTTAT CGCCGATGCA TTCGCGGAGG ATGCCAAGCT CGGGCTGGTT TTTGCAGAGG ATCCCTATCT GAACGGATGG GACGAGAACC GCGACCTGGC CGAACGGCTG GCGCAGCGCA TGAAGATCGA GGCCCCGCTT CCCGAACACT TCGATTTTCC GATCGGGACG ATGTTCTGGG CGCGTGTCGC TGCGTTGCAG CCGTTGTTTC AGTTGAACCT GGATTGGAAT GACTACCCGC ACGAGCCGCT GCCGATCGAC GGCACGATTT TGCACGCGCT CGAGCGCATC GTTCCGTTCG CCGTCCAGAA ATCCGGCTTC GAATACGCCA CAACCTATGT GCGTTCCAGC ATGCGCGACG ATGGCCTGGC CTTTATTCGC CGCCCCGGCT TGCAAAGGTG A
|
Protein sequence | MSNQFASIEE FFDESYYAFS GEAKKNGIRP IDHYLQFGEQ LGAAPSTRFD PKYYLKQYPD LGGWQGGLLN HYLQYGRAEG RQGVAMTPSI ACPTEKIDPG RATILLAVHD ASRSGAPILA WNLINELRKQ HNVVVLLKSG GPIEPALREA ATELVTIPAE FPYGSGEDAL FAQKLTETYS PLYAIANSVA TRELAILLEA AGVPVIALVH EFSSYFQPIG ILNPLYVSAS KLVFPAPIVA DASVQDYPGL KPRHLDILPQ GPSKVPSFQH PTGGAERRSS VQRLEDLALE DTAVILGTGR IEYRKGVDAF IAAASQVQRS ATRKFKFVWI GHLHPSDASY FGFLQEQIKR SGLEQHVLFV DEVDELEPFY SKADVFFLSS RLDPLPNVAI DASLRKIPVV CFKNASGFAE LLECSDTAKE LVVPYLDSSA AGRMICDLLA DARLLTRLGE DIQAVANATF DMGQYARKLD QLGRTCAADI EQMALDQATI LESGIFNASL RFGSCAEKFT PDQAVNEYLN ASRLCRPLHR GKAGMLIRRP VEGFHPLIYA AENPEVDRQG IDPLAHFLRN GRPEGRWTHR VIRPEPRRES NAARPRIAIH GHFYYPDLLE SFLKLIAANA SSVDLFLTTS GPEQAAQIRK SLRAFGIQNA DVWSVPNRGR DIGPFLKEMP DKLGSYDIVG HFHGKRSKHV DSTVGDQWRD FAWQHLIGDA FPMIDVIADA FAEDAKLGLV FAEDPYLNGW DENRDLAERL AQRMKIEAPL PEHFDFPIGT MFWARVAALQ PLFQLNLDWN DYPHEPLPID GTILHALERI VPFAVQKSGF EYATTYVRSS MRDDGLAFIR RPGLQR
|
| |