Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1701 |
Symbol | |
ID | 3972528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1840666 |
End bp | 1843938 |
Gene Length | 3273 bp |
Protein Length | 1090 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637924814 |
Product | transglutaminase-like |
Protein accession | YP_531579 |
Protein GI | 90423209 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.126756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.484971 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCGATCT TCGTCGCCCT ACACCACGTC ACGCATTACA AATACGACCG CCCCATCGAT CTTGGCCCGC AAACCGTGCG GCTGCGGCCG GCGCCGCACA CCAGGACCCC GATCCTGAGC TATTCGCTCA AGGTCACGCC TTCGAATCAC TTCGTCAATT GGCAGCAAGA CCCGCAGGGC AATTGGCTGG CGCGCTTCGT GTTTCCGGAG AAGGCCAGCG AACTGAAGAT CGAGGTGGAT TTCTCCGCCG AGATGACGGT GATCAATCCG TTCGACTTCT TCGTCGAGCC TTATGCCGAA AGCTTCCCGT TCGCTTATAC CAATGATTTG CAGACCGAGC TTGCGCCCTA TCTCGCCACC GAGCCGAGCG GGCCGCTGTT CGAGGCCTAT CTCGCCGAGA TCGCCCGCGA GGCGCCGAGC ACGGTCAACT TCCTGGTCGA TCTCAACGCC CGGCTGCGCG ACCGCATCAA TTACATCATC CGCATGGAGC CGGGGATCCA GACCCCGGAG CATACGCTGC AGACCGGCGC CGGCTCGTGC CGTGATTCGG CCTGGCTGTT GATCCAGACG CTGCGCCGGC TCGGCCTCGC GGCGCGCTTC GTCTCCGGTT ATCTCATTCA GTTGCGGCCC GACATCGCCC CGATCGATGG CAAGGTCGAG GTCGAGAACG ATTTCACCGA TCTGCACGCC TGGGCCGAGG TCTATATCCC CGGCGCCGGC TGGATCGGCT TCGACGCCAC CTCGGGGATG CTGGCCGGCG AGGGCCATAT CCCGGTCGCC GCGACGCCGC ATTATCGCTC GGCGGCGCCG ATCTCCGGCA TGGCCGGCTT CGCCAATGTC GATTTCAACT TCGAGATGAG CGTGAAGCGG ATCCGCGAGG TGCCGCGGAT CACCAAGCCG TTCTCCGACG CAGCCTGGGC GCGGCTCGAT CAGGTCGGCG AACAGGTCGA CGCCGATCTG AAAGCCGACG ACGTCAGGTT GACCATGGGC GGTGAACCGA CCTTCGTCTC GATCGACGAT CTGGAATCGC CGGAATGGAA TATCGCCGCG GTCGGCCCGA CCAAGCGGGT GCTCGGCGAC GATCTGCTGC GCCGGCTACG CGAGCGCTTT GCGCCGGGCG GGCTGCTGCA TTTCGGCCAG GGCAAATGGT ATCCCGGCGA AAGCCTGCCG CGCTGGGCGT TCGGGCTGTA TTGGCGCAAG GACGGCGTGC CGGTCTGGAA AAACCCGCAG CTGATCGCGC CGGTCGAGGG CAAGCGTCCG GTCAAGATCG AGGAAGCGCA GCGCTTCGCC GTCGACACCG CCAAGCGGCT CGGCGTTCAT TCCGACTACA TCCTGCCGGC GTTCGAAGAC CCGGCGCACT GGCTGCAGAA AGAAGCGGGG CTGCCGCCGA ATGTCGATCC CAGCGATTCC AAGCTCGCCG ATCCGGAAGA GCGCGCGCGG ATGGCGCGGG TGTTCGATCA GGGCCTCAAC GTGCCGAAGG GCTACGTGCT GCCGATCCAG CGTTGGAACG CCGAGGCCGA TCGCTGGCGC AGCGAGCGCT GGAAATTCCG CCGCGGCAAT CTGTTTCTGA CGCCGGGCGA TTCGCCGCTC GGGCTGCGGC TGCCGATCTC GTCATTGCCG CATATTCCCG AGGACGAATT CCCCTACACC GTCGAGCAGG ATCCGCTGGA GCCGCGCGAT CCGTTGTCGG AGGCTGGCGA AGCATCGGCC GAGGCCGCGG CGAAGTCGTC CGCGGAAAAG CCGAAGAAGA AGCAAGTCCC GGTGCGCACC GCGATGTCGA TCGAGGTGCG CGACGGCGTG CTGTGCGCGT TCATGCCGCC GGTGGAGAAG CTGGAAGATT ATCTCGAACT GATCGCGGCG GTGGAGGCCA CCGCCGAAGA GATGCAGATG CAGGTCCACG TCGAGGGCTA TCCGCCGCCG TTCGATCCGC GCATCGAAGT GATCAAGGTG ACGCCGGATC CGGGCGTCAT CGAGGTCAAT ATTCATCCAG CGGCAAACTG GCGGCAGGCG GTCGAGACCA CTTTTGCGCT TTACGAGGAG GCGGCCAAGG TCCGGCTCGG CGCCAACCGC TTCCTGGTCG ACGGCCGCCA CACCGGCACC GGCGGCGGCA ACCACGTCGT GGTCGGCGGC GCGACGCCTT CCGACTCGCC GTTCCTGCGC CGGCCCGATC TGTTGAAGAG TTTTGTTTTG TATTGGCAGC GCCACCCGTC GCTGTCCTAC CTATTCTCCG GGATGTTCAT CGGCCCGACC AGCCAGGCGC CGCGGATCGA CGAGGCGCGG CACGACGGGC TCTACGAACT CGAGATCGCG GTCGCGCATG TGCCGCCGCC CGGCGTCACC GGACCGCTGT GGCTGGTCGA CCGGCTGTTC CGCAACATCC TGGTCGATAT CACCGGCAAC ACCCACCGCG CCGAGATCTG CATCGACAAG CTGTATTCGC CGGACAGCTC GACCGGCCGG CTCGGCCTGG TGGAATTCCG CGCGCTCGAA ATGCCGCCGG ACCCGCGGAT GAGCCTCGCC CAGCAATTGC TGATCCGGGC GCTGGTCGCC AAACTGTGGC GCGAGCCGCT GGATGGCAAG TTCGTGCGCT GGGGCACCAC GCTGCATGAT CGTTTCATGC TGCCTTACTA CTTGTGGGAG GACTTCAAGG GCGTGCTCGC CGAGTTGCGA GAGTCCGGCT ACGAGTTCTC GCCGGAATGG TTCGAAGCGC AGCTGGAATT CCGCTTTCCG GTGTTCGGCA GCGTCAGCCA TGGCGGCGTC TCGCTGGAAG TGCGCCAGGC TTTGGAGCCC TGGCACGTGA TGGGCGAGGA AGGCTCGGCC GGCGGCACCG TGCGCTATGT CGATTCCTCG GTGGAGCGGC TGCAGCTGAA GACCGAAGGT TTCGTCGAGG GCCGCCATGT GGTGACCTGC AACGGACGGC GGCTGCCGAT GACTTCGACC GGGCGCTCCG GTGAGGCGGT AGCGGCGGTG CGGTTCAAGG CCTGGCAGCC GGCCTCGGGG CTGCATCCGA CCATTCCGGT GCATACCCCG CTGGTGTTCG ACATCGTCGA CACTTGGAAC GGCCGCTCGC TCGGCGGCTG CGTCTATCAC GTCGCCCATC CGGGCGGACG CTCCTACGAG ACCAAGCCGG TCAACTCGTA TGAAGCCGAG GCGCGGCGGC TGGCGCGGTT TCAGGACCAC GGCCACACGC CCGGCAAGGT CGATCCGCCG CGCGAAGAAC GCAGCCTCGA ATTTCCGCTG ACCCTCGATC TGCGGACGCC GCTGCCGCAT TAG
|
Protein sequence | MSIFVALHHV THYKYDRPID LGPQTVRLRP APHTRTPILS YSLKVTPSNH FVNWQQDPQG NWLARFVFPE KASELKIEVD FSAEMTVINP FDFFVEPYAE SFPFAYTNDL QTELAPYLAT EPSGPLFEAY LAEIAREAPS TVNFLVDLNA RLRDRINYII RMEPGIQTPE HTLQTGAGSC RDSAWLLIQT LRRLGLAARF VSGYLIQLRP DIAPIDGKVE VENDFTDLHA WAEVYIPGAG WIGFDATSGM LAGEGHIPVA ATPHYRSAAP ISGMAGFANV DFNFEMSVKR IREVPRITKP FSDAAWARLD QVGEQVDADL KADDVRLTMG GEPTFVSIDD LESPEWNIAA VGPTKRVLGD DLLRRLRERF APGGLLHFGQ GKWYPGESLP RWAFGLYWRK DGVPVWKNPQ LIAPVEGKRP VKIEEAQRFA VDTAKRLGVH SDYILPAFED PAHWLQKEAG LPPNVDPSDS KLADPEERAR MARVFDQGLN VPKGYVLPIQ RWNAEADRWR SERWKFRRGN LFLTPGDSPL GLRLPISSLP HIPEDEFPYT VEQDPLEPRD PLSEAGEASA EAAAKSSAEK PKKKQVPVRT AMSIEVRDGV LCAFMPPVEK LEDYLELIAA VEATAEEMQM QVHVEGYPPP FDPRIEVIKV TPDPGVIEVN IHPAANWRQA VETTFALYEE AAKVRLGANR FLVDGRHTGT GGGNHVVVGG ATPSDSPFLR RPDLLKSFVL YWQRHPSLSY LFSGMFIGPT SQAPRIDEAR HDGLYELEIA VAHVPPPGVT GPLWLVDRLF RNILVDITGN THRAEICIDK LYSPDSSTGR LGLVEFRALE MPPDPRMSLA QQLLIRALVA KLWREPLDGK FVRWGTTLHD RFMLPYYLWE DFKGVLAELR ESGYEFSPEW FEAQLEFRFP VFGSVSHGGV SLEVRQALEP WHVMGEEGSA GGTVRYVDSS VERLQLKTEG FVEGRHVVTC NGRRLPMTST GRSGEAVAAV RFKAWQPASG LHPTIPVHTP LVFDIVDTWN GRSLGGCVYH VAHPGGRSYE TKPVNSYEAE ARRLARFQDH GHTPGKVDPP REERSLEFPL TLDLRTPLPH
|
| |