Gene RPC_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1701 
Symbol 
ID3972528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1840666 
End bp1843938 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content66% 
IMG OID637924814 
Producttransglutaminase-like 
Protein accessionYP_531579 
Protein GI90423209 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.484971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGATCT TCGTCGCCCT ACACCACGTC ACGCATTACA AATACGACCG CCCCATCGAT 
CTTGGCCCGC AAACCGTGCG GCTGCGGCCG GCGCCGCACA CCAGGACCCC GATCCTGAGC
TATTCGCTCA AGGTCACGCC TTCGAATCAC TTCGTCAATT GGCAGCAAGA CCCGCAGGGC
AATTGGCTGG CGCGCTTCGT GTTTCCGGAG AAGGCCAGCG AACTGAAGAT CGAGGTGGAT
TTCTCCGCCG AGATGACGGT GATCAATCCG TTCGACTTCT TCGTCGAGCC TTATGCCGAA
AGCTTCCCGT TCGCTTATAC CAATGATTTG CAGACCGAGC TTGCGCCCTA TCTCGCCACC
GAGCCGAGCG GGCCGCTGTT CGAGGCCTAT CTCGCCGAGA TCGCCCGCGA GGCGCCGAGC
ACGGTCAACT TCCTGGTCGA TCTCAACGCC CGGCTGCGCG ACCGCATCAA TTACATCATC
CGCATGGAGC CGGGGATCCA GACCCCGGAG CATACGCTGC AGACCGGCGC CGGCTCGTGC
CGTGATTCGG CCTGGCTGTT GATCCAGACG CTGCGCCGGC TCGGCCTCGC GGCGCGCTTC
GTCTCCGGTT ATCTCATTCA GTTGCGGCCC GACATCGCCC CGATCGATGG CAAGGTCGAG
GTCGAGAACG ATTTCACCGA TCTGCACGCC TGGGCCGAGG TCTATATCCC CGGCGCCGGC
TGGATCGGCT TCGACGCCAC CTCGGGGATG CTGGCCGGCG AGGGCCATAT CCCGGTCGCC
GCGACGCCGC ATTATCGCTC GGCGGCGCCG ATCTCCGGCA TGGCCGGCTT CGCCAATGTC
GATTTCAACT TCGAGATGAG CGTGAAGCGG ATCCGCGAGG TGCCGCGGAT CACCAAGCCG
TTCTCCGACG CAGCCTGGGC GCGGCTCGAT CAGGTCGGCG AACAGGTCGA CGCCGATCTG
AAAGCCGACG ACGTCAGGTT GACCATGGGC GGTGAACCGA CCTTCGTCTC GATCGACGAT
CTGGAATCGC CGGAATGGAA TATCGCCGCG GTCGGCCCGA CCAAGCGGGT GCTCGGCGAC
GATCTGCTGC GCCGGCTACG CGAGCGCTTT GCGCCGGGCG GGCTGCTGCA TTTCGGCCAG
GGCAAATGGT ATCCCGGCGA AAGCCTGCCG CGCTGGGCGT TCGGGCTGTA TTGGCGCAAG
GACGGCGTGC CGGTCTGGAA AAACCCGCAG CTGATCGCGC CGGTCGAGGG CAAGCGTCCG
GTCAAGATCG AGGAAGCGCA GCGCTTCGCC GTCGACACCG CCAAGCGGCT CGGCGTTCAT
TCCGACTACA TCCTGCCGGC GTTCGAAGAC CCGGCGCACT GGCTGCAGAA AGAAGCGGGG
CTGCCGCCGA ATGTCGATCC CAGCGATTCC AAGCTCGCCG ATCCGGAAGA GCGCGCGCGG
ATGGCGCGGG TGTTCGATCA GGGCCTCAAC GTGCCGAAGG GCTACGTGCT GCCGATCCAG
CGTTGGAACG CCGAGGCCGA TCGCTGGCGC AGCGAGCGCT GGAAATTCCG CCGCGGCAAT
CTGTTTCTGA CGCCGGGCGA TTCGCCGCTC GGGCTGCGGC TGCCGATCTC GTCATTGCCG
CATATTCCCG AGGACGAATT CCCCTACACC GTCGAGCAGG ATCCGCTGGA GCCGCGCGAT
CCGTTGTCGG AGGCTGGCGA AGCATCGGCC GAGGCCGCGG CGAAGTCGTC CGCGGAAAAG
CCGAAGAAGA AGCAAGTCCC GGTGCGCACC GCGATGTCGA TCGAGGTGCG CGACGGCGTG
CTGTGCGCGT TCATGCCGCC GGTGGAGAAG CTGGAAGATT ATCTCGAACT GATCGCGGCG
GTGGAGGCCA CCGCCGAAGA GATGCAGATG CAGGTCCACG TCGAGGGCTA TCCGCCGCCG
TTCGATCCGC GCATCGAAGT GATCAAGGTG ACGCCGGATC CGGGCGTCAT CGAGGTCAAT
ATTCATCCAG CGGCAAACTG GCGGCAGGCG GTCGAGACCA CTTTTGCGCT TTACGAGGAG
GCGGCCAAGG TCCGGCTCGG CGCCAACCGC TTCCTGGTCG ACGGCCGCCA CACCGGCACC
GGCGGCGGCA ACCACGTCGT GGTCGGCGGC GCGACGCCTT CCGACTCGCC GTTCCTGCGC
CGGCCCGATC TGTTGAAGAG TTTTGTTTTG TATTGGCAGC GCCACCCGTC GCTGTCCTAC
CTATTCTCCG GGATGTTCAT CGGCCCGACC AGCCAGGCGC CGCGGATCGA CGAGGCGCGG
CACGACGGGC TCTACGAACT CGAGATCGCG GTCGCGCATG TGCCGCCGCC CGGCGTCACC
GGACCGCTGT GGCTGGTCGA CCGGCTGTTC CGCAACATCC TGGTCGATAT CACCGGCAAC
ACCCACCGCG CCGAGATCTG CATCGACAAG CTGTATTCGC CGGACAGCTC GACCGGCCGG
CTCGGCCTGG TGGAATTCCG CGCGCTCGAA ATGCCGCCGG ACCCGCGGAT GAGCCTCGCC
CAGCAATTGC TGATCCGGGC GCTGGTCGCC AAACTGTGGC GCGAGCCGCT GGATGGCAAG
TTCGTGCGCT GGGGCACCAC GCTGCATGAT CGTTTCATGC TGCCTTACTA CTTGTGGGAG
GACTTCAAGG GCGTGCTCGC CGAGTTGCGA GAGTCCGGCT ACGAGTTCTC GCCGGAATGG
TTCGAAGCGC AGCTGGAATT CCGCTTTCCG GTGTTCGGCA GCGTCAGCCA TGGCGGCGTC
TCGCTGGAAG TGCGCCAGGC TTTGGAGCCC TGGCACGTGA TGGGCGAGGA AGGCTCGGCC
GGCGGCACCG TGCGCTATGT CGATTCCTCG GTGGAGCGGC TGCAGCTGAA GACCGAAGGT
TTCGTCGAGG GCCGCCATGT GGTGACCTGC AACGGACGGC GGCTGCCGAT GACTTCGACC
GGGCGCTCCG GTGAGGCGGT AGCGGCGGTG CGGTTCAAGG CCTGGCAGCC GGCCTCGGGG
CTGCATCCGA CCATTCCGGT GCATACCCCG CTGGTGTTCG ACATCGTCGA CACTTGGAAC
GGCCGCTCGC TCGGCGGCTG CGTCTATCAC GTCGCCCATC CGGGCGGACG CTCCTACGAG
ACCAAGCCGG TCAACTCGTA TGAAGCCGAG GCGCGGCGGC TGGCGCGGTT TCAGGACCAC
GGCCACACGC CCGGCAAGGT CGATCCGCCG CGCGAAGAAC GCAGCCTCGA ATTTCCGCTG
ACCCTCGATC TGCGGACGCC GCTGCCGCAT TAG
 
Protein sequence
MSIFVALHHV THYKYDRPID LGPQTVRLRP APHTRTPILS YSLKVTPSNH FVNWQQDPQG 
NWLARFVFPE KASELKIEVD FSAEMTVINP FDFFVEPYAE SFPFAYTNDL QTELAPYLAT
EPSGPLFEAY LAEIAREAPS TVNFLVDLNA RLRDRINYII RMEPGIQTPE HTLQTGAGSC
RDSAWLLIQT LRRLGLAARF VSGYLIQLRP DIAPIDGKVE VENDFTDLHA WAEVYIPGAG
WIGFDATSGM LAGEGHIPVA ATPHYRSAAP ISGMAGFANV DFNFEMSVKR IREVPRITKP
FSDAAWARLD QVGEQVDADL KADDVRLTMG GEPTFVSIDD LESPEWNIAA VGPTKRVLGD
DLLRRLRERF APGGLLHFGQ GKWYPGESLP RWAFGLYWRK DGVPVWKNPQ LIAPVEGKRP
VKIEEAQRFA VDTAKRLGVH SDYILPAFED PAHWLQKEAG LPPNVDPSDS KLADPEERAR
MARVFDQGLN VPKGYVLPIQ RWNAEADRWR SERWKFRRGN LFLTPGDSPL GLRLPISSLP
HIPEDEFPYT VEQDPLEPRD PLSEAGEASA EAAAKSSAEK PKKKQVPVRT AMSIEVRDGV
LCAFMPPVEK LEDYLELIAA VEATAEEMQM QVHVEGYPPP FDPRIEVIKV TPDPGVIEVN
IHPAANWRQA VETTFALYEE AAKVRLGANR FLVDGRHTGT GGGNHVVVGG ATPSDSPFLR
RPDLLKSFVL YWQRHPSLSY LFSGMFIGPT SQAPRIDEAR HDGLYELEIA VAHVPPPGVT
GPLWLVDRLF RNILVDITGN THRAEICIDK LYSPDSSTGR LGLVEFRALE MPPDPRMSLA
QQLLIRALVA KLWREPLDGK FVRWGTTLHD RFMLPYYLWE DFKGVLAELR ESGYEFSPEW
FEAQLEFRFP VFGSVSHGGV SLEVRQALEP WHVMGEEGSA GGTVRYVDSS VERLQLKTEG
FVEGRHVVTC NGRRLPMTST GRSGEAVAAV RFKAWQPASG LHPTIPVHTP LVFDIVDTWN
GRSLGGCVYH VAHPGGRSYE TKPVNSYEAE ARRLARFQDH GHTPGKVDPP REERSLEFPL
TLDLRTPLPH