Gene RPB_1712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1712 
Symbol 
ID3908237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1947239 
End bp1950583 
Gene Length3345 bp 
Protein Length1114 aa 
Translation table11 
GC content66% 
IMG OID637883606 
Producttransglutaminase-like 
Protein accessionYP_485331 
Protein GI86748835 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.347512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGATCT TCGTCGCCCT ACATCACGTC ACCCACTATA AATATGATCG CCCGGTCGAC 
ATCGGCCCCC AGACCATTCG GCTCCGGCCT GCGCCGCACA CCCGAACGCC GATTCTGTCG
TATTCGCTGA AGGTGACGCC GGCGAACCAC TTCATCAATT GGCAACAGGA CCCGCAGGGC
AATTGGCTGG CGCGATTCGT GTTTCCGGAG AAGGCCAACG AGCTCAAGAT CGAGGTCGAT
TTCACCGCGG CGATGACGGT GATCAATCCG TTCGATTTTT TCGTCGAAAG CTACGCCGAG
ACCTTTCCGT TCGACTACAG CAATGACCTC ATGACCGAAC TCGCGCCGTA TCTGGCGACG
ACGGAGCCGG GGCCGCTGTT CAAGGACTAT CTCGCGAGCA TCCCGCGCGA GGCGGAGAGC
ACGGTCAATT TCCTGGTCGA TCTGAACGCG AAGCTGCGCG AGCGCATCCG CTACATCATC
CGGATGGAGC CCGGCGTGCA GACGCCGGAG GAAACGCTCG CCGCGGGCGC GGGCTCGTGC
CGCGATTCGG CGTGGCTGCT GATCCAGACG CTACGTCATA TCGGTCTCGC CGCACGCTTC
GTCTCCGGCT ATCTCGTGCA GTTGCGCCCC GACATCGATC CGGTCGAAGG GCCGCGCGAG
GTCGAGACCG ACTTCACCGA TCTGCACGCC TGGTGCGAGG TGTATCTGCC GGGCGCCGGC
TGGATCGGCT TCGACGTCAC CTCGGGAATG CTCGCCGGCG AGGGCCACAT CCCGGTCGCC
GCCACGCCGC ATTACCGGAC CGCGGCGCCG ATCTCGGGCG TGGTCGGCTT CGCCAATGTC
GATTTCAATT TCGAGATGAG CGTCAAACGC ATCCGCGAGG CGCCGCGGAT CACCAGGCCG
TTCTCGGACG AATCCTGGGC GCGGCTCGAT GCGCTGGGCG ACAAGGTCGA CGCCGATCTC
GTCGCCGGCG ACGTCCGGCT GACAATGGGC GGCGAGCCGA CCTTCGTCTC CATTGACGAC
ATGGAATCGC CGGAGTGGAA CGTCGCCGCG GTCGGCGGCG CCAAGCGCAT GCTCGCCGAC
GATCTGATCC GCCGGCTGCG GACGCGGTTC GCGCCGGGCG GCCTGCTGCA TTTCGGCCAG
GGCAAATGGT ATCCGGGCGA AAGCCTGCCG CGCTGGGCGT TCGGCCTGTA CTGGCGCAAG
GACGGATTGC CGATCTGGAA GAACGCCGAT CTGATCGCGC CGGTGTTCGG CCAGCGTCCG
GCGCGCGTGG AAGAAGCGGA GCAATTCGCG CTCGGCACCG CCAAGCGGCT CGGCATCGAC
ACCGATTACG TGTTGCCTGC GTTCGAGGAC CCGAGCCATT GGCTGCAGAA GGAAGCGACC
CTGCCGCCCA ACGTCGATCC GCTCGACAAC AAGCTCGCGG ACCCCGAAGA GCGCGCGCGG
ATGGCGCGGA TCTTCGATTC CGGCCTGACC ACGCCGAAGG GATTCGTGCT GCCGATTCAG
GCCTGGAACG CCGAAGTGCC GACGCCGAAG AAGCGCTGGC GCAGCGAGCG CTGGAAGCTG
CGTCGCGGCA ATCTGTTTTT GGTCCCCGGC GATTCACCGA TGGGCCTGCG GCTGCCGATC
GCGTCGCTGC CGCACATCCC CGAGGAGGAC TATCCGTTCG TCGTCGAGCG CGATCCGCTG
GACGAACGCG ACGCGCTGCC GAGCTATGTG GCGCCGGCCT ATGCGCCGCC GCCCGCCGCG
GAACTCGACC AGCTTCCGGT CTACGAGCAG GCGGCCGCAT CGCCCACGGC CCCACATCAA
AATGTCGAGG AGCAGAAGCT GCGCAAGGGC GGCGTCCGCA CCGCGATGTC GATCGAAATC
CGCGAGGGCG TGCTGTGCGC CTTCATGCCG CCCACCGAGA CCATCGAGGA CTATCTCGAA
TTGGTCGCGG CGGTGGAAGC GACCGCCGAG GAGATGCAGC TCCAGGTCCA CGTCGAAGGC
TATCCGCCGC CGTTCGATCC GCGTATCGAG GTCATCAAGG TGACGCCCGA TCCCGGCGTG
ATCGAAGTCA ACATCCACCC CGCCCGGAAC TGGCGCGAGG CGGTGCAGAC CACATTCGGC
CTGTATGAAG AGGCGGCCCG GGTGCGTCTC GGCGCCAACC GCTTCCTGAT CGACGGCCGC
CACACCGGCA CCGGCGGCGG CAATCATGTC GTGATCGGCG GCGCCAAGCC CGCGGACTCG
CCGTTCCTGC GACGGCCGGA TCTGCTGAAG AGCCTGGTGC TGTTCTGGCA GCGGCATCCG
TCGTTGTCCT ATCTATTCTC CGGCATGTTC ATCGGCCCGA CCAGCCAGGC GCCGCGGATC
GACGAGGCGC GGCACGATTC GCTCTACGAG CTGGAGATCG CGTTGGCGCA TGTGCCGCCG
CCCGGCGTCA AAGGCCCGCT GTGGCTGGTC GACCGGCTGT TCCGGCACAT TCTGGTCGAC
ATCACCGGCA ACACCCATCG CGCCGAATTG TGCATCGACA AGCTGTATTC GCCCGACAGC
CCGACCGGGC GCCTCGGTCT GGTCGAGTTT CGCGCGCTGG AGATGCCGCC CGATCCGCGA
ATGAGCCTGG CGCAGCAGCT TTTGATCCGC GCCCTGATCG CGATGCTGTG GAAGCAGCCG
CTCGACGGAA AATTCGTGCG CTGGGGCACC ACGTTGCACG ACCGCTTCAT GCTGCCGCAT
TTCCTGTGGG AGGATTTTCG CGACGTGCTG GCCGAGCTCG GCCGTGCCGG CTACGCCTTC
GAGCCGGAAT GGTTCACCGC GCAGCTCGAA TTCCGCTTTC CCGTCTTCGG CAGCGTCTAT
CACGGCGGCG TCACGCTGGA GCTGCGCCAG GCGCTGGAGC CGTGGCACGT GCTCGGCGAA
GAAGGCAGCG CCGGCGGCAC CGTGCGATAT GTCGACAGTT CGGTGGAGCG GCTGCAGGTC
AAGGCCGAGG GCTTCGTCGA GGGCCGCCAC GTCATCACCT GCAACGGCCG TCGGCTGCCG
ATGACGCCGA CCGCGCGCTC CGGCGAGGCG GTGGCGGCGG TGCGGTTCAA GGCCTGGCAG
CCGGCGTCGG GGCTGCACCC CACGATACCG GTGCATTCCC CGCTGGTGTT CGACATCGTC
GATACCTGGA ACGGCCGCTC GCTCGGCGGC TGCGTCTATC ACGTCGCTCA CCCCGGCGGG
CGGTCCTACG AGACCAAGCC GGTCAACTCC TACGAGGCCG AGGCGCGGCG GCTCGCGCGC
TTCCAGGATC ACGGTCACAC CCCCGGGCGG ATCGATCCGC CGCAGGAAGA ACGTTCATTT
GAATTCCCCC TGACCCTCGA CTTGCGCACG CCGCTGCTGC ATTGA
 
Protein sequence
MSIFVALHHV THYKYDRPVD IGPQTIRLRP APHTRTPILS YSLKVTPANH FINWQQDPQG 
NWLARFVFPE KANELKIEVD FTAAMTVINP FDFFVESYAE TFPFDYSNDL MTELAPYLAT
TEPGPLFKDY LASIPREAES TVNFLVDLNA KLRERIRYII RMEPGVQTPE ETLAAGAGSC
RDSAWLLIQT LRHIGLAARF VSGYLVQLRP DIDPVEGPRE VETDFTDLHA WCEVYLPGAG
WIGFDVTSGM LAGEGHIPVA ATPHYRTAAP ISGVVGFANV DFNFEMSVKR IREAPRITRP
FSDESWARLD ALGDKVDADL VAGDVRLTMG GEPTFVSIDD MESPEWNVAA VGGAKRMLAD
DLIRRLRTRF APGGLLHFGQ GKWYPGESLP RWAFGLYWRK DGLPIWKNAD LIAPVFGQRP
ARVEEAEQFA LGTAKRLGID TDYVLPAFED PSHWLQKEAT LPPNVDPLDN KLADPEERAR
MARIFDSGLT TPKGFVLPIQ AWNAEVPTPK KRWRSERWKL RRGNLFLVPG DSPMGLRLPI
ASLPHIPEED YPFVVERDPL DERDALPSYV APAYAPPPAA ELDQLPVYEQ AAASPTAPHQ
NVEEQKLRKG GVRTAMSIEI REGVLCAFMP PTETIEDYLE LVAAVEATAE EMQLQVHVEG
YPPPFDPRIE VIKVTPDPGV IEVNIHPARN WREAVQTTFG LYEEAARVRL GANRFLIDGR
HTGTGGGNHV VIGGAKPADS PFLRRPDLLK SLVLFWQRHP SLSYLFSGMF IGPTSQAPRI
DEARHDSLYE LEIALAHVPP PGVKGPLWLV DRLFRHILVD ITGNTHRAEL CIDKLYSPDS
PTGRLGLVEF RALEMPPDPR MSLAQQLLIR ALIAMLWKQP LDGKFVRWGT TLHDRFMLPH
FLWEDFRDVL AELGRAGYAF EPEWFTAQLE FRFPVFGSVY HGGVTLELRQ ALEPWHVLGE
EGSAGGTVRY VDSSVERLQV KAEGFVEGRH VITCNGRRLP MTPTARSGEA VAAVRFKAWQ
PASGLHPTIP VHSPLVFDIV DTWNGRSLGG CVYHVAHPGG RSYETKPVNS YEAEARRLAR
FQDHGHTPGR IDPPQEERSF EFPLTLDLRT PLLH