Gene RPB_2919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2919 
Symbol 
ID3910715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3328832 
End bp3334045 
Gene Length5214 bp 
Protein Length1737 aa 
Translation table11 
GC content69% 
IMG OID637884822 
ProductAlpha-2-macroglobulin-like protein 
Protein accessionYP_486532 
Protein GI86750036 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGTT TGGTTCGCGC CATCACGCTT TGCGCCACGC TGGCGCTCGG GCTGGCCACC 
GCGCAGGCCG CCGACAAGGC GTTCAAACGC GACGATCTGG CGGATTCGGC GATCAAGCTC
GAGGCCCAGA TCAAGAGCGA GGCCGGCGCC ATCGCCAAGC CGGCGGCGGG CCTGCGCACC
GACGCCGACG CCGCCTTCAA GCGCGGCGAC TACCGCACCG GGCTGCAGAT CATGGGCCAG
ATCGCCACCG TCGATCCGTC CGACGGCAGC AACTGGCTGC GACTCGCCAA GACCGTGTTC
CAGATCAAGG CGCCGACCAG CTCGGAACGG ACCTTCCTGC TGGAGCGCGC CTCGACCGCC
GCCTATCTCG CCTATCAGCG CGCCGGCAAT GCCGGCGAGG AGGCCGAGGC GCTGGCGGTG
CTCGGCCGCG CCATGGCCGA CCGGCGGCTG TGGCGGCCGG CGCTGGATGC GCTGCGGCTG
TCGCTCGACC TGCGCGAAGT CGCCGAGGTT CGCGGCCAAT ACGAGAAGCT GCGCGACGAT
CACGGTTTCC GGCTGCTCGA TTATACCGTC GACTCGGATT CGGCGTCGCC GCGGGCGTGC
TTCCAGTTCT CCGAGGACCT CGCCAAGCGC ACCGACTTCG CGCCCTATTT GGCGCTGGCC
GGCACCGACA AGCCGGCGCT GACCTCCGAA GACAAGCAGC TCTGCGTCGA AGGGCTCAAG
CACGGCGAGC GCTACAACAT CAATCTGCGC GCCGGACTGC CGTCGACCGT CAAGGAGACG
CTGCCGAAAT CGGCCGAGTT CAACATCTAT GTCCGCGACC GCAAGCCGCT GGTGCGCTTC
ACCGGCCGCG CCTATGTGCT GCCGCGCACC GGCCAGCGCG GCATTCCGCT CGTCAGCGTC
AACACGCCCA CGGTGTCGGC GCAGGTGTTC CGGATCGGCG ACCGCAATTT GATCAACACC
GTGGTCGACA GCGACTTCCA GCGCACGCTG AGCCGCTACC AGCTCGACGA GCTCGGCAGC
CAGCGCGGCG TCAAGGTGTG GTCGGGCGAA CTCGACACCG CGCCGGCGGC GCTCAATGCC
GACGTCACCA CCGCGTTCGC GGTGGATCAG GTGCTCGGCG ACCTGCAACC CGGCGTCTAT
GTGATGACCG CGTCGCCGAA AGGGCCGGTC GCCAATGCCG ACGACGACGG CCAGCTCGCG
ACGCAATGGT TCATCGTCTC CGATCTCGGC GTCACCGCGT TCTCCGGCAA TGACGGCATC
CACGTCTTCG TCAATTCGCT GGCCTCGACC GACCCGGTCG GCAAAGCCGA TGTCCGGCTG
ATCGCCCGCA ACAACGAGAT TCTGGCCACC CGCAAAACCG ACGCCTCCGG CCACGTGCTG
TTCGAGGCCG GGCTGGCGCG CGGCGAGGGC GGGATGTCGC CGGCACTGCT GACGGTCACC
GGCGAGAAGG CCGACTATGC CTTCCTCAGC CTCAAATCCA ACGCCTTCGA CCTGTCCGAC
CGCGGCGTCA CCGGCCGCGC GGTGCCGGCC GGCGCCGACG CCTTCGTCTA TGCCGAGCGC
GGCGTCTATC GCGGCAGCGA GACCGTGCAT CTCACCGCGC TGCTGCGCGA CGGCCAGGGC
AACGCGCTCG TCGGCGGCCC GATGACGCTG GTGATCGAGC GGCCGGACGG CGTCGAATTC
CGCCGCGCCG TGCTGTCCGA TCAGGGCGCC GGCGGCCGCA GCCTCGACGT CGCGCTCAAC
TCGGCGGTGC CGACCGGCAC CTGGCGGGTG CGCGCCTTCA CCGACCCGAA GGGCGCCAGC
ATCGGCGAGA CCACCTTCAT GGTCGAGGAC TACGTCCCCG ACCGGATCGA ATTCGATATC
TCCACCAAGG ACAAGCAGAT CAAAGCTGAT GCTCCTGTGG AACTGAAGGT CGACGGTCGC
TTCCTGTATG GCGCGCCGGC CTCCGGGCTG GCGCTGGAAG GCGACCTGCT GGTCGCGCCG
GCCGCGAGTC GTCCCGGTTT TCCCGGCTAC CAGTTCGGCG TCGCCGACGC GGAGACCACC
AGCAACGAGC GCGCCCCGCT GGAGAACCTG CCCGAAGCCG ACGACAACGG CAGCGCGACC
TTCCCGTTGG TCCTGCCGAA GCCGCCGTCG TCGACCCGGC CGCAGGAGGC GCAGATCTTC
ATCCGGATGC GCGAGGCCGG CGGCCGCGCC GTCGAGCGCA AGCTGGTGCT GCCGGTCGCG
CCGACTGCGG CGATGATCGG CGTCAAGCCG CTGTTCGCCG ACAAGAACGT CGCCGACGGC
GACGCCGCCA AGTTCGAGGT CGCCTTCGTC GATCCGGACG GCACCGCGCT GACGCGATCC
GGCCTGCGCT ACGAGCTGCT GAAAATTGAA TCGCATTATC AATGGTACCG GCAGAATTCC
TCATGGGATT TCGAGCCGGT GAAATCGACC AAGCGGGTCG CCGATGGCGA CCTTGCGGTC
GCGCCCGGCC AACCCGGACA ATTGTCGTTC CAGCCCGAAA GCGGCCGCTA CCGGCTCGAC
GTCAAGACCG CCGATGCCGA CGGCCCGGTC ACCTCGGTGC AGTTCGACGT CGGCTGGTAT
TCCGACGGCT CGGCCGATAC GCCCGATCTG CTGGAGACCT CCGTCGACAA ACCCGAATAC
GCCTCCGGCG ACACCATGAC GGTGACGGTC AATGCGCGGA CCGCAGGGCT GCTGACCGTC
AACGTGCTCG GCGACCGGCT GCTGACGACG CAGTCGGTGG CGGTGAAGCA GGGCACGTCG
CAGGTCAAGA TCCCGGTCGG CAAGGATTGG GGCACCGGCG CCTATGTGGT GACGACGCTG
CGCCGGCCGC TCGATGCCGC CGCCCAGCGG ATGCCCGGCC GCGCCATCGG CGTGCAATGG
GTGTCGATCG ACAGGAAGGC GCGGACGCTG CAGGTCGCGC TGTCGCCGCC GGCGCTGGTG
CGGCCGTCGA CCACGTTGAA ACTGCCGGTC AAGCTCGGCG GACTCGCGCC CGGCGAGGAC
GCCAAGATCG TGGTCGCCGC GGTCGACGTC GGCATTCTCA ACCTCACCAA CTACAAGCCG
CCGGCGCCGG ACGATTATTA TCTCGGCCAG CGCCGCATGA CCTCCGAGAT CCGCGATCTC
TATGGCCAAT TGATCGACGG CATGCAGGGC ACGCGCGGCC AGATTCGCTC CGGCGGCGAC
GGCGCCGGCG CCGAGCTGCA GGGCAGCCCG CCGACGCAGA AGCCGCTGGC GCTGTATTCG
GGGATCGTCA CCGTCGGAGC CGACGGCAGC GCCGAGATCA GCTTCGAGAT TCCTGAATTC
GCCGGCACCG CGCGGGTGAT GGCGGTGGCG TGGACCGCCA CCAAGGTCGG CCGCGCCAAT
GTTGACGTGA CGGTGCGCGA CCCGGTGGTG CTGACCACGA CGCTGCCGCG CTTCCTGCGC
AATGGCGATC GCGGCACCAT GGCGTTCGAC CTCGACAATG TCGAAGGCGC GCCCGGTGAT
TTCACCATCA AGGTGACCGC GACGGGGCCC GTGAAGTTCG CAGGGCCGGC GTCGACCACG
CTGAAGCTCG CCGCCAAACA GCGCGGCTCG GCCTCGCTCG CCGTCGAGGC CGGCGGCGCC
GGCACCGCAG CGCTCGACGT CGCCATCAGC GGCCCGAACG GGCTGACGCT GGCACGGCAC
TACGCGCTCG ACGTCCGCCC CGCCAACCAG ACGCTGGCGC GGCGTTCGAT CCGCACGCTC
GCCAAGCAGG AAAGCCTGAC GCTGACCTCG GACATGTTCG CCGATCTGGT GCCGGGCACC
GGCGGGGTGT CACTGTCGGT TAGTCTCTCG ACCGCGCTGG ACGCGGCGAC CATTCTCAAG
GCGCTCGATC GCTATCCGTT CGGCTGCTCC GAGCAGATCG CCAGCCGCGC GCTGCCGCTG
TTGTACGTCA ACGATCTCGC CGCCGGCGCG CATCTGGCGA TGGATGCGAG CGCCGACGAG
CGGATCAGGA CCTCGATCGA TCGGCTGCTG GCGCGGCAGG GCTCGAATGG TTCATTCGGG
CTGTGGTCGA CCGGCGGCGA CGATGCCTGG CTCGATGCTT ACGTCACCGA CTTCCTGACC
CGCGCCCGCG AGAAGAACTT CGTGGTGCCG GACGTGGCGT TCCGCAGCGC GCTCGACCGC
ATCCGCAACG CGGTGGTGAA CGCCGAGGAG CCGGAGAAGG ACGGCGGCCG CAATCTCGCT
TACGGGCTGT ATGTGCTGGC CCGCAACGGC GCGGCGCCGA TCGGCGATCT GCGCTATCTC
GCCGACACCA AGCTCGACAA GCTGGCGACG CCGATCGCCA AGGCGCAGCT CGCCGCCGCG
CTGGCGCTGG TTGGGGATCG CGCGCGTGCG GAGCGGGTCT ATGCGGTGGC GGCGGGCGAT
CTCGCGCCGA AGCCGGTGAT CCAGTTCGGC CGGGTCGACT ACGGCTCGGC ACTGCGCGAC
GCCGCGGCAT TGGTGTCGCT CGCCAGCGAG GGCAACGCGC CGAAAGCGAC GCTGACCACG
GCGGTGCAGC GCGTCGAAGC GGCGAGGGGC CTGACGCCCT ACACCTCGAC GCAGGAAAAT
GCCTGGCTGG TGCTGGCGGC GCGCGCGCTC GCCAAGGAGA CGATGAGCCT CGACATCAAC
GGCAGCGCGG CGAAGTCCGC GGTGTATCGC AGCTACAAGG CCGACGAGCT GCGCGGCCAG
CCGATCCGCA TCGCCAACAC CGGCGACGCG CCGGTGCAGG CGGTGGTCAC CGTCAGCGGT
TCGCCGGTGA CGCCGGAGCC GGCCGCCAGC AACGGCTTCA AGATCGAGCG CAACTACTTC
ACGCTCAGTG GCGAGCCGGC CGACATCACC AAGGCGAAGC AGAACGATCG CTTCGCGGTG
GTGCTGACGG TCACCGAGGC CAAGCCGGAG TACGGTCACA TCATGGTGGC GGACTATCTG
CCGGCCGGCC TCGAGATCGA CAATCCGCAT CTGGTGTCGT CCGGCGATAG CGGCACGCTC
GACTGGATCG AGAACGGCGA GGAGCCGGTC AACACCGAAT TCCGCGACGA CCGCTTCACC
GCCGCGATCG ACCGTGGCAC CGAGGACAAG GCGGTGTTCA CGGTGGCCTA TATCGTCCGC
GCGGTGTCGC CCGGCAAATA CGTGCTCCCG CAGGCCATCG TCGAGGACAT GTACAACCCC
TCGCGTTACG GCCGCACCGG CACCGGCAGC GTCGAGGTGA CTAAGGCGAA ATGA
 
Protein sequence
MIGLVRAITL CATLALGLAT AQAADKAFKR DDLADSAIKL EAQIKSEAGA IAKPAAGLRT 
DADAAFKRGD YRTGLQIMGQ IATVDPSDGS NWLRLAKTVF QIKAPTSSER TFLLERASTA
AYLAYQRAGN AGEEAEALAV LGRAMADRRL WRPALDALRL SLDLREVAEV RGQYEKLRDD
HGFRLLDYTV DSDSASPRAC FQFSEDLAKR TDFAPYLALA GTDKPALTSE DKQLCVEGLK
HGERYNINLR AGLPSTVKET LPKSAEFNIY VRDRKPLVRF TGRAYVLPRT GQRGIPLVSV
NTPTVSAQVF RIGDRNLINT VVDSDFQRTL SRYQLDELGS QRGVKVWSGE LDTAPAALNA
DVTTAFAVDQ VLGDLQPGVY VMTASPKGPV ANADDDGQLA TQWFIVSDLG VTAFSGNDGI
HVFVNSLAST DPVGKADVRL IARNNEILAT RKTDASGHVL FEAGLARGEG GMSPALLTVT
GEKADYAFLS LKSNAFDLSD RGVTGRAVPA GADAFVYAER GVYRGSETVH LTALLRDGQG
NALVGGPMTL VIERPDGVEF RRAVLSDQGA GGRSLDVALN SAVPTGTWRV RAFTDPKGAS
IGETTFMVED YVPDRIEFDI STKDKQIKAD APVELKVDGR FLYGAPASGL ALEGDLLVAP
AASRPGFPGY QFGVADAETT SNERAPLENL PEADDNGSAT FPLVLPKPPS STRPQEAQIF
IRMREAGGRA VERKLVLPVA PTAAMIGVKP LFADKNVADG DAAKFEVAFV DPDGTALTRS
GLRYELLKIE SHYQWYRQNS SWDFEPVKST KRVADGDLAV APGQPGQLSF QPESGRYRLD
VKTADADGPV TSVQFDVGWY SDGSADTPDL LETSVDKPEY ASGDTMTVTV NARTAGLLTV
NVLGDRLLTT QSVAVKQGTS QVKIPVGKDW GTGAYVVTTL RRPLDAAAQR MPGRAIGVQW
VSIDRKARTL QVALSPPALV RPSTTLKLPV KLGGLAPGED AKIVVAAVDV GILNLTNYKP
PAPDDYYLGQ RRMTSEIRDL YGQLIDGMQG TRGQIRSGGD GAGAELQGSP PTQKPLALYS
GIVTVGADGS AEISFEIPEF AGTARVMAVA WTATKVGRAN VDVTVRDPVV LTTTLPRFLR
NGDRGTMAFD LDNVEGAPGD FTIKVTATGP VKFAGPASTT LKLAAKQRGS ASLAVEAGGA
GTAALDVAIS GPNGLTLARH YALDVRPANQ TLARRSIRTL AKQESLTLTS DMFADLVPGT
GGVSLSVSLS TALDAATILK ALDRYPFGCS EQIASRALPL LYVNDLAAGA HLAMDASADE
RIRTSIDRLL ARQGSNGSFG LWSTGGDDAW LDAYVTDFLT RAREKNFVVP DVAFRSALDR
IRNAVVNAEE PEKDGGRNLA YGLYVLARNG AAPIGDLRYL ADTKLDKLAT PIAKAQLAAA
LALVGDRARA ERVYAVAAGD LAPKPVIQFG RVDYGSALRD AAALVSLASE GNAPKATLTT
AVQRVEAARG LTPYTSTQEN AWLVLAARAL AKETMSLDIN GSAAKSAVYR SYKADELRGQ
PIRIANTGDA PVQAVVTVSG SPVTPEPAAS NGFKIERNYF TLSGEPADIT KAKQNDRFAV
VLTVTEAKPE YGHIMVADYL PAGLEIDNPH LVSSGDSGTL DWIENGEEPV NTEFRDDRFT
AAIDRGTEDK AVFTVAYIVR AVSPGKYVLP QAIVEDMYNP SRYGRTGTGS VEVTKAK