Gene RPD_2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2551 
Symbol 
ID4023045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2850786 
End bp2855999 
Gene Length5214 bp 
Protein Length1737 aa 
Translation table11 
GC content68% 
IMG OID637962747 
Productalpha-2-macroglobulin-like 
Protein accessionYP_569682 
Protein GI91977023 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.11715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGTT GGGTTCGCGC CATCACGCTT TGCGCCACGC TGGCGCTCGG GTTGGCCACG 
GCACAGGCGG CCGACAAGGC CTTCAAACGC GACGAACTGG CGGATTCCGC GATCAAGCTC
GAAGCCCAGA TCAAGAGCGA GGCGGGGGCG ATCAACAAGC CGGCGGCGAG CCTGCGCACC
GACGCCGACG CCGCCTTGCG GCGCGGGGAT TACCGCACCG GGCTGCAGAT CATGGGCCAG
ATCGCCACCG TCGATCCGGC CGACGGCAGC AATTGGCTGC GGCTCGCCAA GACCGTCTTC
CAGATCAAGG CGCCGACCAG TTCGGAACAG ACCTTCCTGC TGGAGCGCGC CTCGACCGCC
GCCTACATCG CCTATCAGCG CGCCGGCAAT GCCGGGGAGG AGGCCGAGGC GCTGGCGGTG
CTCGGCCGCG CGATGGCCGA CCGGCGGCTG TGGCGCCCGG CGCTCGACGC GCTGCGGCTT
TCGCTCGATC TGCGCGAGGT CGCCGAGGTG CGCGGCCGCT ACGAGAAGCT GCGCGAGGAC
CACGGCTTCA AGCTGCTCGA CTACACCGTG GATTCGGATT CGGCCTCGCC GCGGGCGTGC
TTCCAGTTCT CCGAAGACCT CGCCAAGCGC ACCGACTTCG CGCCCTATCT GGCGCTGGCC
GGCACCGACA AGCCGGCACT GACCTCGGAA GACAAGCAGC TCTGCGTCGA AGGCCTCAAG
CACGGCGAGC GCTACAACAT CAATCTGCGC GCCGGGTTGC CGTCGACCGT CAAGGAGACG
CTGCCGAAAT CGGCCGAGTT CAACATCTAT GTCCGCGACC GCAAGCCGCT GGTGCGCTTC
ACCGGCCGCG CCTATGTGCT GCCGCGCACC GGCCAGCGCG GCATTCCGCT GGTCAGCGTC
AACACCCCGA CGGTGTCGGT GCAGCTGTTC CGGATCGGCG ACCGCAATCT GATCAACACC
GTGGTCGACA GCGACTTCCA GCGCACGCTG AGCCGCTACC AGCTCGACGA TCTCGGCAGC
CAGCGTGGCG CCAAAGTGTG GTCGGGCGAA CTCGACACCG CGCCAGCGGC GCTCAATGCC
GACGTCACCA CGGCGTTTGC GGTGGATCAG GTGCTCGGCG ATCTGCAGCC CGGCGTCTAT
GTGATGACCG CCGCGCCGAA GGGGCCGATC GCCAGTTCGG ACGATGACGG CCAGCTCGCG
ACGCAATGGT TCATCGTGTC CGATCTCGGG CTCACCGCGT TCTCCGGCAA TGACGGCATC
CACGTCTTCG TCAATTCGCT GGCCTCGACT GACCCGGTCG GCAAGGCCGA AGTCCGGCTG
ATCGCGCGCA ACAACGAAAT CCTCGCCACC CGCAAGACCG ACGAGTCCGG CCACGTGCTG
TTCGAGGCCG GGCTGGCGCG CGGCGAGGGC GGGATGTCGC CAGCGCTGCT GACGGCGACC
GGCGACAAGG CCGACTATGC CTTCCTCAGC CTCAAATCCA ACGCCTTCGA CCTGTCCGAC
CGCGGCGTCA CCGGCCGCGC AGTGCCGGCC GGGGCCGACG CCTTCGTCTA TGCCGAGCGC
GGCGTTTATC GCGGCAGCGA GACCGTCTAT CTCACCGCGC TACTGCGCGA AGGCCAGGGC
AACGCCATCG TCGGTGGGCC GATGACGCTG GTGATCGAGC GGCCGGACGG CGTCGAATTC
CGCCGCGCCG TGCTGTCCGA CCAGGGCGCC GGCGGCCGAA GCCTCGCGGT GGCGCTGAAC
TCGGCGGTGC CGACCGGTAC CTGGCGGGTC CGCGCCTTCA CCGATCCGAA GGGCGCCAGC
ATCGGCGAAA CCACCTTCAT GGTCGAGGAC TACGTTCCGG ACCGGATCGA ATTCGATTTC
TCGTCCAAGG ACAAACAGAT CAAAGCTGAT GCTCCAGTGG AACTGAAGGT CGACGGCCGC
TTCCTGTATG GCGCGCCGGC GTCGGGGTTG GCGCTGGAAG GCGACCTGCT GGTCGCGCCG
GCCGCGGGCC GTCCCGGTTT TCCCGGCTAC CAGTTCGGCG TCGCCGATGA GGAGACCACC
AGCAACGAGC GCACCCCGCT GGAGAACCTG CCCGAGGCCG ACGACAATGG CAGCGCGACC
TTCCCGTTGG TTCTGCCGAA GCCGCCGTCG TCGACACGGC CGCAGGAGGC GCAGATCTTC
ATCCGGATGC GCGAGGCCGG CGGCCGTGCC GTCGAGCGCA AGCTGGTGCT GCCGGTCGCG
CCGGCCAGTC CGATGATCGG CGTCAAGCCG CTGTTCGCCG ACAAGAATGT CGCCGACGGC
GACGCCGCCA AGTTCGAGGT CGCCTTCGTC GACCCGGACG GCGCCGCGTT GACGCGCAGC
GGGCTACGCT ACGAACTGCT GAAGATCGAG TCGCATTATC AATGGTACCG GCAGAATTCC
TCATGGGATT TCGAGCCGGT GAAATCGACC AAGCGCGTGG CCGACGGCGA TCTTTCGGTC
ACGCCGGATA AGCCCGGCCA ATTGTCGTTC CAGCCCGAGA CCGGCCGCTA CCGGCTCGAC
GTCAAGACGG CGGACGCCGA TGGTCCGATC ACCTCGGTGC AGTTCGATGT CGGCTGGTAC
TCGGACGGTA GCGCCGACAC CCCCGATCTG TTGGAGACCT CGATCGACAA GCCGGAATAT
GCCTCCGGCG ACAGCATGAC GGTGGCAGTC AACGCCCGGT CCGCCGGCTT GTTGACGGTC
AACGTGCTCG GCGACCGGCT GCTGACGACA CAGTCGTTGG CGGTGAAGCA GGGCTCGTCG
CAGGTCAGGA TCCCGGTCGG CAAGGATTGG GGCTCCGGCG CCTATGTGGT GACGACGCTG
CGCCGACCGC TCGACGCCGC GGCGCAGCGG ATGCCGGGTC GCGCGATCGG CGTGCAATGG
GTCTCGATCG ACAAGAAGGC GCGTACGCTT CAGGTCGCGC TGTCGCCGCC GGCGCTGGTG
CGGCCGTCGA CTACGCTGAA GCTGCCGGTC AAGCTCGGCG GGCTCGCCCC CGGCGAGGAC
GCCAAGATCG TGGTCGCGGC AGTCGACGTC GGCATTCTGA ATCTCACCAA TTACAAGCCG
CCGGCGCCGG ACGATTATTA TCTCGGCCAG CGCCGCATGA CCTCGGAGAT CCGCGATCTT
TATGGCCAAC TGATCGACGG CATGCAGGGC ACCCGTGGCC AGATCCGCTC GGGCGGCGAT
GCCGCCGGCG CCGAGCTGCA GGGCAGCCCG CCGACCCAGA AACCGCTGGC GCTGTATTCC
GGGATCGTCA CCGTCGGCGC TGATGGTGCC GCGGAGATCA GCTTCGATAT TCCGGAATTC
GCCGGCACCG CGCGGGTGAT GGCGGTGGCG TGGACCGCCA CCAAGGTCGG TCGCGCCACT
ATCGACGTCA CCGTGCGTGA CCCTGTGGTG CTGACCACGA CGCTGCCGCG CTTCCTGCGC
AACGGCGATC GCGGCACCAT GGCGTTCGAC CTCGACAATG TCGAAGGCGC GCCCGGCGAC
TTCACCGTCA AGGTGACTGC CAACGGTCCG GTGAAATTGA ACGGTCCTGC GTCCACAACG
ATGAAGCTCG CCGCCAAGCA GCGCGGTTCG GCTCAGTTGT CGGTCGAGGC CGGCGGCGCC
GGCACCGCGA CGCTCGACGT CGCCATCAGC GGTCCGAACG GGCTGACGCT GGCACGGCAC
TACGTGCTCG ACGTCCGCCC CGCGAACCAG ACTCTGGCGC GGCGCGCGAT CCGCACGCTG
GCGAAGCAGG AGAGTCTGAC GCTGACGGCG GACATGTTCG CCGACCTGGT GCCGGGCACC
GGCGGGGTGT CGCTGTCGGT CAGCCAGTCG ACCGCGCTCG ATGCTGCCAC GATCCTCAAG
GCGCTCGATC GCTATCCGTT CGGCTGCTCC GAGCAGATCG CCAGCCGCGC GCTGCCGCTG
CTCTACGTCA ACGATCTCGC CGCCGGCGCG CATCTGGCGA TGGATACGAG CGCCGACGAG
CGGATCAGGA CCTCGATCGA CCGGCTACTG GCGCGGCAGG GCTCGAACGG CTCGTTCGGG
ATGTGGTCGT CGGGTGGCGA CGATCCCTGG CTCGACGCTT ACGTCACCGA CTTCCTGACC
CGGGCGCGCG AGAAGAACTT CGTCGTGCCC GACGTCGCGT TCCGCAGCGC CCTCGACCGC
ATCCGCAACG CTGTGGTGAA TGCCGAGGAG CCGGAGAAGG ACGGCGGCCG CAACCTCGCT
TACGGGCTCT ATGTGCTGGC CCGCAACGGC GCGGCCCCGA TCGGCGATCT GCGCTATCTC
GCCGACACCA AGCTCGACAA GCTGGCGACG CCGATCGCCA AGTCGCAGCT CGCTGCCGCG
CTTGCGCTGG TCGGCGACCG CACCCGCGCC GAGCGGGTCT ATGCCGCGGC GGCCGGCGAC
CTCGCGCCGA AGCCGGTGAT CCAGTTCGGC CGCGTCGACT ATGGCTCGGC GCTACGCGAC
GCTGCAGCGC TGGTGTCGCT CGCCAGCGAA GGCAATGCGC CGAAGGCGAC GCTGACGACC
GCCGTGCAGC GCGTCGAGGC GGCGAGGGGG CTCACGCCCT ATACCTCGAC CCAGGAGAAT
GCCTGGCTGG TGCTGGCGGC GCGCGCGCTC GCCAAGGAGA CCATGAGTCT CGACGTCAAT
GGCAGCGCAT TGAAGTCCGC GGTGTATCGC AACTACAAGG CCGAGGAGGT GCGCGGCCAA
CCGGTTCGGA TCGCCAACAC CGGCGACAGC CCGGTGCAAG CGGTGGTCAC CGTCAGCGGT
TCGCCGGTGA CGCCGGAGCC TGCCGCCAGC AACGGCTTCA AGATCGAGCG CAATTACTTC
ACGCTCGCCG GCGAGCCGGC CGACATCACT AAAGCGAAGC AGAACGACCG CTTCGCGGTG
GTGCTGACGG TCACCGAGGC GAAGCCGGAG TTCGCGCATG TGATGATCGC GGACTATCTG
CCGGCCGGAC TCGAGATCGA CAATCCGCAT CTGGTGTCGT CGGGCGACAG CGGCACGCTG
GACTGGATCG AAAACGGCCA GGAGCCGGTC AACACCGAGT TCCGCGACGA CCGCTTCACC
GCCGCCATCA ACCGCGGCAC CGAAGACAAA GCGGTGTTCA CCGTGGCCTA TGTGGTGCGC
GCGGTGTCGC CCGGCAAATA CGTGCTGCCG CAGGCCATTG TCGAGGACAT GTACAATCCC
TCGCGTTACG GCCGCACCGG CACCGGCGTC GTCGAGGTGC GCGCGGCGAA ATGA
 
Protein sequence
MIGWVRAITL CATLALGLAT AQAADKAFKR DELADSAIKL EAQIKSEAGA INKPAASLRT 
DADAALRRGD YRTGLQIMGQ IATVDPADGS NWLRLAKTVF QIKAPTSSEQ TFLLERASTA
AYIAYQRAGN AGEEAEALAV LGRAMADRRL WRPALDALRL SLDLREVAEV RGRYEKLRED
HGFKLLDYTV DSDSASPRAC FQFSEDLAKR TDFAPYLALA GTDKPALTSE DKQLCVEGLK
HGERYNINLR AGLPSTVKET LPKSAEFNIY VRDRKPLVRF TGRAYVLPRT GQRGIPLVSV
NTPTVSVQLF RIGDRNLINT VVDSDFQRTL SRYQLDDLGS QRGAKVWSGE LDTAPAALNA
DVTTAFAVDQ VLGDLQPGVY VMTAAPKGPI ASSDDDGQLA TQWFIVSDLG LTAFSGNDGI
HVFVNSLAST DPVGKAEVRL IARNNEILAT RKTDESGHVL FEAGLARGEG GMSPALLTAT
GDKADYAFLS LKSNAFDLSD RGVTGRAVPA GADAFVYAER GVYRGSETVY LTALLREGQG
NAIVGGPMTL VIERPDGVEF RRAVLSDQGA GGRSLAVALN SAVPTGTWRV RAFTDPKGAS
IGETTFMVED YVPDRIEFDF SSKDKQIKAD APVELKVDGR FLYGAPASGL ALEGDLLVAP
AAGRPGFPGY QFGVADEETT SNERTPLENL PEADDNGSAT FPLVLPKPPS STRPQEAQIF
IRMREAGGRA VERKLVLPVA PASPMIGVKP LFADKNVADG DAAKFEVAFV DPDGAALTRS
GLRYELLKIE SHYQWYRQNS SWDFEPVKST KRVADGDLSV TPDKPGQLSF QPETGRYRLD
VKTADADGPI TSVQFDVGWY SDGSADTPDL LETSIDKPEY ASGDSMTVAV NARSAGLLTV
NVLGDRLLTT QSLAVKQGSS QVRIPVGKDW GSGAYVVTTL RRPLDAAAQR MPGRAIGVQW
VSIDKKARTL QVALSPPALV RPSTTLKLPV KLGGLAPGED AKIVVAAVDV GILNLTNYKP
PAPDDYYLGQ RRMTSEIRDL YGQLIDGMQG TRGQIRSGGD AAGAELQGSP PTQKPLALYS
GIVTVGADGA AEISFDIPEF AGTARVMAVA WTATKVGRAT IDVTVRDPVV LTTTLPRFLR
NGDRGTMAFD LDNVEGAPGD FTVKVTANGP VKLNGPASTT MKLAAKQRGS AQLSVEAGGA
GTATLDVAIS GPNGLTLARH YVLDVRPANQ TLARRAIRTL AKQESLTLTA DMFADLVPGT
GGVSLSVSQS TALDAATILK ALDRYPFGCS EQIASRALPL LYVNDLAAGA HLAMDTSADE
RIRTSIDRLL ARQGSNGSFG MWSSGGDDPW LDAYVTDFLT RAREKNFVVP DVAFRSALDR
IRNAVVNAEE PEKDGGRNLA YGLYVLARNG AAPIGDLRYL ADTKLDKLAT PIAKSQLAAA
LALVGDRTRA ERVYAAAAGD LAPKPVIQFG RVDYGSALRD AAALVSLASE GNAPKATLTT
AVQRVEAARG LTPYTSTQEN AWLVLAARAL AKETMSLDVN GSALKSAVYR NYKAEEVRGQ
PVRIANTGDS PVQAVVTVSG SPVTPEPAAS NGFKIERNYF TLAGEPADIT KAKQNDRFAV
VLTVTEAKPE FAHVMIADYL PAGLEIDNPH LVSSGDSGTL DWIENGQEPV NTEFRDDRFT
AAINRGTEDK AVFTVAYVVR AVSPGKYVLP QAIVEDMYNP SRYGRTGTGV VEVRAAK