Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1712 |
Symbol | |
ID | 3908237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1947239 |
End bp | 1950583 |
Gene Length | 3345 bp |
Protein Length | 1114 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637883606 |
Product | transglutaminase-like |
Protein accession | YP_485331 |
Protein GI | 86748835 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.347512 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGATCT TCGTCGCCCT ACATCACGTC ACCCACTATA AATATGATCG CCCGGTCGAC ATCGGCCCCC AGACCATTCG GCTCCGGCCT GCGCCGCACA CCCGAACGCC GATTCTGTCG TATTCGCTGA AGGTGACGCC GGCGAACCAC TTCATCAATT GGCAACAGGA CCCGCAGGGC AATTGGCTGG CGCGATTCGT GTTTCCGGAG AAGGCCAACG AGCTCAAGAT CGAGGTCGAT TTCACCGCGG CGATGACGGT GATCAATCCG TTCGATTTTT TCGTCGAAAG CTACGCCGAG ACCTTTCCGT TCGACTACAG CAATGACCTC ATGACCGAAC TCGCGCCGTA TCTGGCGACG ACGGAGCCGG GGCCGCTGTT CAAGGACTAT CTCGCGAGCA TCCCGCGCGA GGCGGAGAGC ACGGTCAATT TCCTGGTCGA TCTGAACGCG AAGCTGCGCG AGCGCATCCG CTACATCATC CGGATGGAGC CCGGCGTGCA GACGCCGGAG GAAACGCTCG CCGCGGGCGC GGGCTCGTGC CGCGATTCGG CGTGGCTGCT GATCCAGACG CTACGTCATA TCGGTCTCGC CGCACGCTTC GTCTCCGGCT ATCTCGTGCA GTTGCGCCCC GACATCGATC CGGTCGAAGG GCCGCGCGAG GTCGAGACCG ACTTCACCGA TCTGCACGCC TGGTGCGAGG TGTATCTGCC GGGCGCCGGC TGGATCGGCT TCGACGTCAC CTCGGGAATG CTCGCCGGCG AGGGCCACAT CCCGGTCGCC GCCACGCCGC ATTACCGGAC CGCGGCGCCG ATCTCGGGCG TGGTCGGCTT CGCCAATGTC GATTTCAATT TCGAGATGAG CGTCAAACGC ATCCGCGAGG CGCCGCGGAT CACCAGGCCG TTCTCGGACG AATCCTGGGC GCGGCTCGAT GCGCTGGGCG ACAAGGTCGA CGCCGATCTC GTCGCCGGCG ACGTCCGGCT GACAATGGGC GGCGAGCCGA CCTTCGTCTC CATTGACGAC ATGGAATCGC CGGAGTGGAA CGTCGCCGCG GTCGGCGGCG CCAAGCGCAT GCTCGCCGAC GATCTGATCC GCCGGCTGCG GACGCGGTTC GCGCCGGGCG GCCTGCTGCA TTTCGGCCAG GGCAAATGGT ATCCGGGCGA AAGCCTGCCG CGCTGGGCGT TCGGCCTGTA CTGGCGCAAG GACGGATTGC CGATCTGGAA GAACGCCGAT CTGATCGCGC CGGTGTTCGG CCAGCGTCCG GCGCGCGTGG AAGAAGCGGA GCAATTCGCG CTCGGCACCG CCAAGCGGCT CGGCATCGAC ACCGATTACG TGTTGCCTGC GTTCGAGGAC CCGAGCCATT GGCTGCAGAA GGAAGCGACC CTGCCGCCCA ACGTCGATCC GCTCGACAAC AAGCTCGCGG ACCCCGAAGA GCGCGCGCGG ATGGCGCGGA TCTTCGATTC CGGCCTGACC ACGCCGAAGG GATTCGTGCT GCCGATTCAG GCCTGGAACG CCGAAGTGCC GACGCCGAAG AAGCGCTGGC GCAGCGAGCG CTGGAAGCTG CGTCGCGGCA ATCTGTTTTT GGTCCCCGGC GATTCACCGA TGGGCCTGCG GCTGCCGATC GCGTCGCTGC CGCACATCCC CGAGGAGGAC TATCCGTTCG TCGTCGAGCG CGATCCGCTG GACGAACGCG ACGCGCTGCC GAGCTATGTG GCGCCGGCCT ATGCGCCGCC GCCCGCCGCG GAACTCGACC AGCTTCCGGT CTACGAGCAG GCGGCCGCAT CGCCCACGGC CCCACATCAA AATGTCGAGG AGCAGAAGCT GCGCAAGGGC GGCGTCCGCA CCGCGATGTC GATCGAAATC CGCGAGGGCG TGCTGTGCGC CTTCATGCCG CCCACCGAGA CCATCGAGGA CTATCTCGAA TTGGTCGCGG CGGTGGAAGC GACCGCCGAG GAGATGCAGC TCCAGGTCCA CGTCGAAGGC TATCCGCCGC CGTTCGATCC GCGTATCGAG GTCATCAAGG TGACGCCCGA TCCCGGCGTG ATCGAAGTCA ACATCCACCC CGCCCGGAAC TGGCGCGAGG CGGTGCAGAC CACATTCGGC CTGTATGAAG AGGCGGCCCG GGTGCGTCTC GGCGCCAACC GCTTCCTGAT CGACGGCCGC CACACCGGCA CCGGCGGCGG CAATCATGTC GTGATCGGCG GCGCCAAGCC CGCGGACTCG CCGTTCCTGC GACGGCCGGA TCTGCTGAAG AGCCTGGTGC TGTTCTGGCA GCGGCATCCG TCGTTGTCCT ATCTATTCTC CGGCATGTTC ATCGGCCCGA CCAGCCAGGC GCCGCGGATC GACGAGGCGC GGCACGATTC GCTCTACGAG CTGGAGATCG CGTTGGCGCA TGTGCCGCCG CCCGGCGTCA AAGGCCCGCT GTGGCTGGTC GACCGGCTGT TCCGGCACAT TCTGGTCGAC ATCACCGGCA ACACCCATCG CGCCGAATTG TGCATCGACA AGCTGTATTC GCCCGACAGC CCGACCGGGC GCCTCGGTCT GGTCGAGTTT CGCGCGCTGG AGATGCCGCC CGATCCGCGA ATGAGCCTGG CGCAGCAGCT TTTGATCCGC GCCCTGATCG CGATGCTGTG GAAGCAGCCG CTCGACGGAA AATTCGTGCG CTGGGGCACC ACGTTGCACG ACCGCTTCAT GCTGCCGCAT TTCCTGTGGG AGGATTTTCG CGACGTGCTG GCCGAGCTCG GCCGTGCCGG CTACGCCTTC GAGCCGGAAT GGTTCACCGC GCAGCTCGAA TTCCGCTTTC CCGTCTTCGG CAGCGTCTAT CACGGCGGCG TCACGCTGGA GCTGCGCCAG GCGCTGGAGC CGTGGCACGT GCTCGGCGAA GAAGGCAGCG CCGGCGGCAC CGTGCGATAT GTCGACAGTT CGGTGGAGCG GCTGCAGGTC AAGGCCGAGG GCTTCGTCGA GGGCCGCCAC GTCATCACCT GCAACGGCCG TCGGCTGCCG ATGACGCCGA CCGCGCGCTC CGGCGAGGCG GTGGCGGCGG TGCGGTTCAA GGCCTGGCAG CCGGCGTCGG GGCTGCACCC CACGATACCG GTGCATTCCC CGCTGGTGTT CGACATCGTC GATACCTGGA ACGGCCGCTC GCTCGGCGGC TGCGTCTATC ACGTCGCTCA CCCCGGCGGG CGGTCCTACG AGACCAAGCC GGTCAACTCC TACGAGGCCG AGGCGCGGCG GCTCGCGCGC TTCCAGGATC ACGGTCACAC CCCCGGGCGG ATCGATCCGC CGCAGGAAGA ACGTTCATTT GAATTCCCCC TGACCCTCGA CTTGCGCACG CCGCTGCTGC ATTGA
|
Protein sequence | MSIFVALHHV THYKYDRPVD IGPQTIRLRP APHTRTPILS YSLKVTPANH FINWQQDPQG NWLARFVFPE KANELKIEVD FTAAMTVINP FDFFVESYAE TFPFDYSNDL MTELAPYLAT TEPGPLFKDY LASIPREAES TVNFLVDLNA KLRERIRYII RMEPGVQTPE ETLAAGAGSC RDSAWLLIQT LRHIGLAARF VSGYLVQLRP DIDPVEGPRE VETDFTDLHA WCEVYLPGAG WIGFDVTSGM LAGEGHIPVA ATPHYRTAAP ISGVVGFANV DFNFEMSVKR IREAPRITRP FSDESWARLD ALGDKVDADL VAGDVRLTMG GEPTFVSIDD MESPEWNVAA VGGAKRMLAD DLIRRLRTRF APGGLLHFGQ GKWYPGESLP RWAFGLYWRK DGLPIWKNAD LIAPVFGQRP ARVEEAEQFA LGTAKRLGID TDYVLPAFED PSHWLQKEAT LPPNVDPLDN KLADPEERAR MARIFDSGLT TPKGFVLPIQ AWNAEVPTPK KRWRSERWKL RRGNLFLVPG DSPMGLRLPI ASLPHIPEED YPFVVERDPL DERDALPSYV APAYAPPPAA ELDQLPVYEQ AAASPTAPHQ NVEEQKLRKG GVRTAMSIEI REGVLCAFMP PTETIEDYLE LVAAVEATAE EMQLQVHVEG YPPPFDPRIE VIKVTPDPGV IEVNIHPARN WREAVQTTFG LYEEAARVRL GANRFLIDGR HTGTGGGNHV VIGGAKPADS PFLRRPDLLK SLVLFWQRHP SLSYLFSGMF IGPTSQAPRI DEARHDSLYE LEIALAHVPP PGVKGPLWLV DRLFRHILVD ITGNTHRAEL CIDKLYSPDS PTGRLGLVEF RALEMPPDPR MSLAQQLLIR ALIAMLWKQP LDGKFVRWGT TLHDRFMLPH FLWEDFRDVL AELGRAGYAF EPEWFTAQLE FRFPVFGSVY HGGVTLELRQ ALEPWHVLGE EGSAGGTVRY VDSSVERLQV KAEGFVEGRH VITCNGRRLP MTPTARSGEA VAAVRFKAWQ PASGLHPTIP VHSPLVFDIV DTWNGRSLGG CVYHVAHPGG RSYETKPVNS YEAEARRLAR FQDHGHTPGR IDPPQEERSF EFPLTLDLRT PLLH
|
| |