Gene RPD_3585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3585 
Symbol 
ID4024099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3994892 
End bp3998278 
Gene Length3387 bp 
Protein Length1128 aa 
Translation table11 
GC content67% 
IMG OID637963789 
Producttransglutaminase-like 
Protein accessionYP_570709 
Protein GI91978050 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.222679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.726735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTAGCGC TTGATTGCGT GCTGCCCCGG TGTTCTGCTC AAAAAGAATG CTGGAGTTGT 
CCTGTGTCGA TCTTCGTCGC CCTTCATCAC GTCACGCATT ATAAATATGA CCGTCCGGTC
GATATCGGCC CCCAGACCAT CCGGCTTCGC CCCGCGCCGC ACACCCGCAC GCCGATTCTG
TCGTATTCGC TGAAGGTCAC GCCGGCGAAC CACTTCATCA ACTGGCAGCA GGACCCGCAG
GGCAACTGGC TGGCGCGGTT CGTGTTTCCG GAGAAGGCCA ACGAACTCAA GATCGAGGTC
GATTTCACCG CGGCGATGAC GGTGATCAAT CCGTTCGACT TCTTCGTCGA AAGTTATGCC
GAGACCTTCC CGTTCGCCTA CAGCAACGAT CTGATGGTCG AGCTTGCGCC GTATCTGGCG
ACGACCGAGC CGGGGCCGCT GTTGCGCGAC TATCTCGCGA GCATTCCGCG CGAGGCGGAC
AGCACCGTCA ATTTTCTGGT CGACCTCAAT GCGAAATTGC GCGAGCGCAT CCGCTACATC
ATCCGGATGG AGCCGGGGGT GCAGACGCCG GAGGAAACCC TCGCGGCCGG CGCCGGCTCG
TGCCGCGATT CGGCGTGGCT GCTGATCCAG ACGCTGCGCC ATATCGGCCT CGCGGCGCGC
TTCGTGTCCG GTTATCTGGT GCAGTTGCGG CCCGACATCG ATCCGGTCGA AGGCCCGCGC
GAGGTCGAGA ACGACTTCAC CGATCTGCAT GCGTGGTGCG AAGTCTATCT GCCGGGCGCC
GGCTGGATCG GCTTCGACGT CACTTCGGGG ATGCTGGCGG GCGAGGGCCA TATCCCGGTC
GCCGCCACGC CGCACTATCG GACCGCGGCG CCGATCTCCG GCGTGGTCGG CTTCGCCAAT
GTCGATTTCA ATTTCGAGAT GAGCGTCAAG CGCATCCACG AGGCGCCGCG CATCACCCGG
CCGTTTTCCG ACGAATCCTG GGCGCGGCTC GATGCGCTCG GCGACAAGGT CGACGCCGAT
CTCGCGGCCG CCGACGTGCG GCTGACGATG GGCGGTGAGC CGACCTTCGT GTCGATCGAC
GACATGGAAT CGCCGGAGTG GAATGTCGCC GCGGTCGGCG GCGCCAAGCG GATGCTCGCC
GACGATCTGA TCCGCCGGCT GCGCACGCGA TTCGCGCCGG GCGGGCTGCT GCATTTCGGC
CAGGGCAAAT GGTACCCGGG CGAAAGCCTG CCGCGCTGGG CGTTCGGCCT GTACTGGCGC
AAGGACGGCG TGCCGATCTG GAAGAATGCC GACTTGATCG CGCCGGTGGT CGGCCAGCGC
CCGGCGAAGG TGGAGGAAGC CGAGCAGTTC GCGATCGGCA CCGCCAAGCG GCTCGGCATC
GACACCGACT ACGTGCTGCC GGCGTTCGAG GACCCGAACC ACTGGCTGCA GAAGGAGGCG
GGACTGCCTC CGAATGTCGA TCCCTCCGAC AGCAAGCTCG CCGACCCCGA AGAGCGCGCC
CGGATGGCGC GGGTGTTCGA TTCCGGGCTG ACCACGCCGA AAGGATTCGT GCTGCCGATT
CAGGCTTGGA ACGCCGAGGT TCCGAAGCAG CACAAGCGCT GGCGCAGCGA GCGCTGGAAG
CTGCGCCGCG GCAATCTGTT TCTGATGCCG GGCGACTCGC CGATGGGCTT GCGGCTGCCG
ATCGCGTCGC TGCCGCATAT TCCGGAAGAG GATTATCCCT TCATCGTCGA GCGCGATCCG
CTCGATCCGC GCGGCGCGCT GCCGGCGTTC ACGCCGCCGC CCGCGGCCGA ACCGATGCCC
GTGTACGAGC AGACGCCGGC GCCGTCGGGC GCGGCGAACC AGGCCGTCGC GGAGCAGAAG
CTGCGCAGCG GCGGCGTCCG CACCGCGATG TCGATCGAGA TTCGCGAGGG CGTGCTGTGC
GCCTTCATGC CGCCCACCGA GACCATCGAG GACTATCTCG AGCTTGTCTC CGCGGTGGAA
GCGACCGCGG AGGAGATGCA GCTTCAGGTC CATGTCGAAG GCTACCCGCC GCCGTTCGAT
CCGCGCATCG AAGTCATCAA GGTGACGCCC GACCCCGGCG TGATCGAGGT CAACATCCAT
CCGGCGCGGA ATTGGCGCGA GGCGGTGACG ACCACTTTCG GCCTGTATGA AGAAGCCGCA
CAGGTGCGGC TCGGCGCCAA CCGCTTCCTG ATCGACGGCC GCCACACCGG CACCGGCGGC
GGCAATCATG TGGTGATCGG CGGAGCGAAG CCCGCGGATT CGCCATTCCT GCGCCGGCCG
GATCTGCTGA AGAGCCTGGT GCTGTTCTGG CAGCGGCATC CCGCTCTGTC CTATCTGTTC
TCCGGCATGT TCATCGGCCC GACCAGCCAG GCGCCGCGGA TCGACGAAGC GCGTCACGAC
TCGCTTTACG AACTGGAGAT CGCGCTGGCG CATGTGCCGC CGCCGGGAGT CAAAGGCCCG
TTGTGGCTGG TCGACCGGCT GTTCCGGCAC ATCCTGGTCG ACATCACCGG CAACACCCAC
CGCGCCGAGC TGTGCATCGA CAAGCTGTAT TCGCCGGACA GCCCGACCGG CCGCCTCGGC
CTCGTTGAAT TCCGCGCGCT CGAGATGCCG CCCGATCCGC GGATGTCGCT GGCGCAGCAA
TTGTTGATCC GCGCGCTGAC CGCGAAGCTG TGGCGCGAGC CGCTCGACGG CAAGTTCGTT
CGCTGGGGCA CTACGCTGCA CGATCGTTTC ATGCTGCCGC ATTTCCTCTG GGAGGATTTC
CGCGACGTGC TGGCCGAACT CGGCCGCGCC GGCTACGCGT TCGAGCCGGA ATGGTTCAGC
GCGCAGCTCG AATTCCGCTT TCCGGTGTTC GGCAGCGTCT ATCACGGCGG CGTCACGCTG
GAGCTGCGGC AGGCGCTGGA GCCGTGGCAC GTGCTCGGCG AAGAAGGCAG CGCCGGCGGC
ACGGTGCGCT ATGTCGATAG TTCGGTCGAG CGGTTGCAGG TCAAGGCCGA GGGCTTCGTC
GAGGGCCGCC ATGTCATCAC CTGCAACGGC CGCCGGCTGC CGATGACGCC GACCGCGCGC
TCCGGCGAGG CGGTGGCGGC GGTGCGATTC AAGGCCTGGC AGCCGGCCTC CGGTCTCCAC
CCCACGATAC CGGTGCACTC GCCGCTGGTG TTCGACATCG TCGACAGCTG GAACGGCCGC
TCGCTCGGCG GTTGCGTCTA TCATGTCGCC CATCCGGGCG GGCGCTCCTA CGAGACCAAG
CCGGTCAATT CCTACGAGGC CGAGGCGCGC CGGCTGGCGC GCTTCCAGGA CCACGGCCAC
ACGCCGGGGC GGATCGATCC GCCCCCTGAA GAACGCACAT TAGAATTCCC CCTGACCCTC
GACTTGCGCA CGCCGCTGCT GCATTGA
 
Protein sequence
MLALDCVLPR CSAQKECWSC PVSIFVALHH VTHYKYDRPV DIGPQTIRLR PAPHTRTPIL 
SYSLKVTPAN HFINWQQDPQ GNWLARFVFP EKANELKIEV DFTAAMTVIN PFDFFVESYA
ETFPFAYSND LMVELAPYLA TTEPGPLLRD YLASIPREAD STVNFLVDLN AKLRERIRYI
IRMEPGVQTP EETLAAGAGS CRDSAWLLIQ TLRHIGLAAR FVSGYLVQLR PDIDPVEGPR
EVENDFTDLH AWCEVYLPGA GWIGFDVTSG MLAGEGHIPV AATPHYRTAA PISGVVGFAN
VDFNFEMSVK RIHEAPRITR PFSDESWARL DALGDKVDAD LAAADVRLTM GGEPTFVSID
DMESPEWNVA AVGGAKRMLA DDLIRRLRTR FAPGGLLHFG QGKWYPGESL PRWAFGLYWR
KDGVPIWKNA DLIAPVVGQR PAKVEEAEQF AIGTAKRLGI DTDYVLPAFE DPNHWLQKEA
GLPPNVDPSD SKLADPEERA RMARVFDSGL TTPKGFVLPI QAWNAEVPKQ HKRWRSERWK
LRRGNLFLMP GDSPMGLRLP IASLPHIPEE DYPFIVERDP LDPRGALPAF TPPPAAEPMP
VYEQTPAPSG AANQAVAEQK LRSGGVRTAM SIEIREGVLC AFMPPTETIE DYLELVSAVE
ATAEEMQLQV HVEGYPPPFD PRIEVIKVTP DPGVIEVNIH PARNWREAVT TTFGLYEEAA
QVRLGANRFL IDGRHTGTGG GNHVVIGGAK PADSPFLRRP DLLKSLVLFW QRHPALSYLF
SGMFIGPTSQ APRIDEARHD SLYELEIALA HVPPPGVKGP LWLVDRLFRH ILVDITGNTH
RAELCIDKLY SPDSPTGRLG LVEFRALEMP PDPRMSLAQQ LLIRALTAKL WREPLDGKFV
RWGTTLHDRF MLPHFLWEDF RDVLAELGRA GYAFEPEWFS AQLEFRFPVF GSVYHGGVTL
ELRQALEPWH VLGEEGSAGG TVRYVDSSVE RLQVKAEGFV EGRHVITCNG RRLPMTPTAR
SGEAVAAVRF KAWQPASGLH PTIPVHSPLV FDIVDSWNGR SLGGCVYHVA HPGGRSYETK
PVNSYEAEAR RLARFQDHGH TPGRIDPPPE ERTLEFPLTL DLRTPLLH