Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2919 |
Symbol | |
ID | 3910715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3328832 |
End bp | 3334045 |
Gene Length | 5214 bp |
Protein Length | 1737 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637884822 |
Product | Alpha-2-macroglobulin-like protein |
Protein accession | YP_486532 |
Protein GI | 86750036 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGTT TGGTTCGCGC CATCACGCTT TGCGCCACGC TGGCGCTCGG GCTGGCCACC GCGCAGGCCG CCGACAAGGC GTTCAAACGC GACGATCTGG CGGATTCGGC GATCAAGCTC GAGGCCCAGA TCAAGAGCGA GGCCGGCGCC ATCGCCAAGC CGGCGGCGGG CCTGCGCACC GACGCCGACG CCGCCTTCAA GCGCGGCGAC TACCGCACCG GGCTGCAGAT CATGGGCCAG ATCGCCACCG TCGATCCGTC CGACGGCAGC AACTGGCTGC GACTCGCCAA GACCGTGTTC CAGATCAAGG CGCCGACCAG CTCGGAACGG ACCTTCCTGC TGGAGCGCGC CTCGACCGCC GCCTATCTCG CCTATCAGCG CGCCGGCAAT GCCGGCGAGG AGGCCGAGGC GCTGGCGGTG CTCGGCCGCG CCATGGCCGA CCGGCGGCTG TGGCGGCCGG CGCTGGATGC GCTGCGGCTG TCGCTCGACC TGCGCGAAGT CGCCGAGGTT CGCGGCCAAT ACGAGAAGCT GCGCGACGAT CACGGTTTCC GGCTGCTCGA TTATACCGTC GACTCGGATT CGGCGTCGCC GCGGGCGTGC TTCCAGTTCT CCGAGGACCT CGCCAAGCGC ACCGACTTCG CGCCCTATTT GGCGCTGGCC GGCACCGACA AGCCGGCGCT GACCTCCGAA GACAAGCAGC TCTGCGTCGA AGGGCTCAAG CACGGCGAGC GCTACAACAT CAATCTGCGC GCCGGACTGC CGTCGACCGT CAAGGAGACG CTGCCGAAAT CGGCCGAGTT CAACATCTAT GTCCGCGACC GCAAGCCGCT GGTGCGCTTC ACCGGCCGCG CCTATGTGCT GCCGCGCACC GGCCAGCGCG GCATTCCGCT CGTCAGCGTC AACACGCCCA CGGTGTCGGC GCAGGTGTTC CGGATCGGCG ACCGCAATTT GATCAACACC GTGGTCGACA GCGACTTCCA GCGCACGCTG AGCCGCTACC AGCTCGACGA GCTCGGCAGC CAGCGCGGCG TCAAGGTGTG GTCGGGCGAA CTCGACACCG CGCCGGCGGC GCTCAATGCC GACGTCACCA CCGCGTTCGC GGTGGATCAG GTGCTCGGCG ACCTGCAACC CGGCGTCTAT GTGATGACCG CGTCGCCGAA AGGGCCGGTC GCCAATGCCG ACGACGACGG CCAGCTCGCG ACGCAATGGT TCATCGTCTC CGATCTCGGC GTCACCGCGT TCTCCGGCAA TGACGGCATC CACGTCTTCG TCAATTCGCT GGCCTCGACC GACCCGGTCG GCAAAGCCGA TGTCCGGCTG ATCGCCCGCA ACAACGAGAT TCTGGCCACC CGCAAAACCG ACGCCTCCGG CCACGTGCTG TTCGAGGCCG GGCTGGCGCG CGGCGAGGGC GGGATGTCGC CGGCACTGCT GACGGTCACC GGCGAGAAGG CCGACTATGC CTTCCTCAGC CTCAAATCCA ACGCCTTCGA CCTGTCCGAC CGCGGCGTCA CCGGCCGCGC GGTGCCGGCC GGCGCCGACG CCTTCGTCTA TGCCGAGCGC GGCGTCTATC GCGGCAGCGA GACCGTGCAT CTCACCGCGC TGCTGCGCGA CGGCCAGGGC AACGCGCTCG TCGGCGGCCC GATGACGCTG GTGATCGAGC GGCCGGACGG CGTCGAATTC CGCCGCGCCG TGCTGTCCGA TCAGGGCGCC GGCGGCCGCA GCCTCGACGT CGCGCTCAAC TCGGCGGTGC CGACCGGCAC CTGGCGGGTG CGCGCCTTCA CCGACCCGAA GGGCGCCAGC ATCGGCGAGA CCACCTTCAT GGTCGAGGAC TACGTCCCCG ACCGGATCGA ATTCGATATC TCCACCAAGG ACAAGCAGAT CAAAGCTGAT GCTCCTGTGG AACTGAAGGT CGACGGTCGC TTCCTGTATG GCGCGCCGGC CTCCGGGCTG GCGCTGGAAG GCGACCTGCT GGTCGCGCCG GCCGCGAGTC GTCCCGGTTT TCCCGGCTAC CAGTTCGGCG TCGCCGACGC GGAGACCACC AGCAACGAGC GCGCCCCGCT GGAGAACCTG CCCGAAGCCG ACGACAACGG CAGCGCGACC TTCCCGTTGG TCCTGCCGAA GCCGCCGTCG TCGACCCGGC CGCAGGAGGC GCAGATCTTC ATCCGGATGC GCGAGGCCGG CGGCCGCGCC GTCGAGCGCA AGCTGGTGCT GCCGGTCGCG CCGACTGCGG CGATGATCGG CGTCAAGCCG CTGTTCGCCG ACAAGAACGT CGCCGACGGC GACGCCGCCA AGTTCGAGGT CGCCTTCGTC GATCCGGACG GCACCGCGCT GACGCGATCC GGCCTGCGCT ACGAGCTGCT GAAAATTGAA TCGCATTATC AATGGTACCG GCAGAATTCC TCATGGGATT TCGAGCCGGT GAAATCGACC AAGCGGGTCG CCGATGGCGA CCTTGCGGTC GCGCCCGGCC AACCCGGACA ATTGTCGTTC CAGCCCGAAA GCGGCCGCTA CCGGCTCGAC GTCAAGACCG CCGATGCCGA CGGCCCGGTC ACCTCGGTGC AGTTCGACGT CGGCTGGTAT TCCGACGGCT CGGCCGATAC GCCCGATCTG CTGGAGACCT CCGTCGACAA ACCCGAATAC GCCTCCGGCG ACACCATGAC GGTGACGGTC AATGCGCGGA CCGCAGGGCT GCTGACCGTC AACGTGCTCG GCGACCGGCT GCTGACGACG CAGTCGGTGG CGGTGAAGCA GGGCACGTCG CAGGTCAAGA TCCCGGTCGG CAAGGATTGG GGCACCGGCG CCTATGTGGT GACGACGCTG CGCCGGCCGC TCGATGCCGC CGCCCAGCGG ATGCCCGGCC GCGCCATCGG CGTGCAATGG GTGTCGATCG ACAGGAAGGC GCGGACGCTG CAGGTCGCGC TGTCGCCGCC GGCGCTGGTG CGGCCGTCGA CCACGTTGAA ACTGCCGGTC AAGCTCGGCG GACTCGCGCC CGGCGAGGAC GCCAAGATCG TGGTCGCCGC GGTCGACGTC GGCATTCTCA ACCTCACCAA CTACAAGCCG CCGGCGCCGG ACGATTATTA TCTCGGCCAG CGCCGCATGA CCTCCGAGAT CCGCGATCTC TATGGCCAAT TGATCGACGG CATGCAGGGC ACGCGCGGCC AGATTCGCTC CGGCGGCGAC GGCGCCGGCG CCGAGCTGCA GGGCAGCCCG CCGACGCAGA AGCCGCTGGC GCTGTATTCG GGGATCGTCA CCGTCGGAGC CGACGGCAGC GCCGAGATCA GCTTCGAGAT TCCTGAATTC GCCGGCACCG CGCGGGTGAT GGCGGTGGCG TGGACCGCCA CCAAGGTCGG CCGCGCCAAT GTTGACGTGA CGGTGCGCGA CCCGGTGGTG CTGACCACGA CGCTGCCGCG CTTCCTGCGC AATGGCGATC GCGGCACCAT GGCGTTCGAC CTCGACAATG TCGAAGGCGC GCCCGGTGAT TTCACCATCA AGGTGACCGC GACGGGGCCC GTGAAGTTCG CAGGGCCGGC GTCGACCACG CTGAAGCTCG CCGCCAAACA GCGCGGCTCG GCCTCGCTCG CCGTCGAGGC CGGCGGCGCC GGCACCGCAG CGCTCGACGT CGCCATCAGC GGCCCGAACG GGCTGACGCT GGCACGGCAC TACGCGCTCG ACGTCCGCCC CGCCAACCAG ACGCTGGCGC GGCGTTCGAT CCGCACGCTC GCCAAGCAGG AAAGCCTGAC GCTGACCTCG GACATGTTCG CCGATCTGGT GCCGGGCACC GGCGGGGTGT CACTGTCGGT TAGTCTCTCG ACCGCGCTGG ACGCGGCGAC CATTCTCAAG GCGCTCGATC GCTATCCGTT CGGCTGCTCC GAGCAGATCG CCAGCCGCGC GCTGCCGCTG TTGTACGTCA ACGATCTCGC CGCCGGCGCG CATCTGGCGA TGGATGCGAG CGCCGACGAG CGGATCAGGA CCTCGATCGA TCGGCTGCTG GCGCGGCAGG GCTCGAATGG TTCATTCGGG CTGTGGTCGA CCGGCGGCGA CGATGCCTGG CTCGATGCTT ACGTCACCGA CTTCCTGACC CGCGCCCGCG AGAAGAACTT CGTGGTGCCG GACGTGGCGT TCCGCAGCGC GCTCGACCGC ATCCGCAACG CGGTGGTGAA CGCCGAGGAG CCGGAGAAGG ACGGCGGCCG CAATCTCGCT TACGGGCTGT ATGTGCTGGC CCGCAACGGC GCGGCGCCGA TCGGCGATCT GCGCTATCTC GCCGACACCA AGCTCGACAA GCTGGCGACG CCGATCGCCA AGGCGCAGCT CGCCGCCGCG CTGGCGCTGG TTGGGGATCG CGCGCGTGCG GAGCGGGTCT ATGCGGTGGC GGCGGGCGAT CTCGCGCCGA AGCCGGTGAT CCAGTTCGGC CGGGTCGACT ACGGCTCGGC ACTGCGCGAC GCCGCGGCAT TGGTGTCGCT CGCCAGCGAG GGCAACGCGC CGAAAGCGAC GCTGACCACG GCGGTGCAGC GCGTCGAAGC GGCGAGGGGC CTGACGCCCT ACACCTCGAC GCAGGAAAAT GCCTGGCTGG TGCTGGCGGC GCGCGCGCTC GCCAAGGAGA CGATGAGCCT CGACATCAAC GGCAGCGCGG CGAAGTCCGC GGTGTATCGC AGCTACAAGG CCGACGAGCT GCGCGGCCAG CCGATCCGCA TCGCCAACAC CGGCGACGCG CCGGTGCAGG CGGTGGTCAC CGTCAGCGGT TCGCCGGTGA CGCCGGAGCC GGCCGCCAGC AACGGCTTCA AGATCGAGCG CAACTACTTC ACGCTCAGTG GCGAGCCGGC CGACATCACC AAGGCGAAGC AGAACGATCG CTTCGCGGTG GTGCTGACGG TCACCGAGGC CAAGCCGGAG TACGGTCACA TCATGGTGGC GGACTATCTG CCGGCCGGCC TCGAGATCGA CAATCCGCAT CTGGTGTCGT CCGGCGATAG CGGCACGCTC GACTGGATCG AGAACGGCGA GGAGCCGGTC AACACCGAAT TCCGCGACGA CCGCTTCACC GCCGCGATCG ACCGTGGCAC CGAGGACAAG GCGGTGTTCA CGGTGGCCTA TATCGTCCGC GCGGTGTCGC CCGGCAAATA CGTGCTCCCG CAGGCCATCG TCGAGGACAT GTACAACCCC TCGCGTTACG GCCGCACCGG CACCGGCAGC GTCGAGGTGA CTAAGGCGAA ATGA
|
Protein sequence | MIGLVRAITL CATLALGLAT AQAADKAFKR DDLADSAIKL EAQIKSEAGA IAKPAAGLRT DADAAFKRGD YRTGLQIMGQ IATVDPSDGS NWLRLAKTVF QIKAPTSSER TFLLERASTA AYLAYQRAGN AGEEAEALAV LGRAMADRRL WRPALDALRL SLDLREVAEV RGQYEKLRDD HGFRLLDYTV DSDSASPRAC FQFSEDLAKR TDFAPYLALA GTDKPALTSE DKQLCVEGLK HGERYNINLR AGLPSTVKET LPKSAEFNIY VRDRKPLVRF TGRAYVLPRT GQRGIPLVSV NTPTVSAQVF RIGDRNLINT VVDSDFQRTL SRYQLDELGS QRGVKVWSGE LDTAPAALNA DVTTAFAVDQ VLGDLQPGVY VMTASPKGPV ANADDDGQLA TQWFIVSDLG VTAFSGNDGI HVFVNSLAST DPVGKADVRL IARNNEILAT RKTDASGHVL FEAGLARGEG GMSPALLTVT GEKADYAFLS LKSNAFDLSD RGVTGRAVPA GADAFVYAER GVYRGSETVH LTALLRDGQG NALVGGPMTL VIERPDGVEF RRAVLSDQGA GGRSLDVALN SAVPTGTWRV RAFTDPKGAS IGETTFMVED YVPDRIEFDI STKDKQIKAD APVELKVDGR FLYGAPASGL ALEGDLLVAP AASRPGFPGY QFGVADAETT SNERAPLENL PEADDNGSAT FPLVLPKPPS STRPQEAQIF IRMREAGGRA VERKLVLPVA PTAAMIGVKP LFADKNVADG DAAKFEVAFV DPDGTALTRS GLRYELLKIE SHYQWYRQNS SWDFEPVKST KRVADGDLAV APGQPGQLSF QPESGRYRLD VKTADADGPV TSVQFDVGWY SDGSADTPDL LETSVDKPEY ASGDTMTVTV NARTAGLLTV NVLGDRLLTT QSVAVKQGTS QVKIPVGKDW GTGAYVVTTL RRPLDAAAQR MPGRAIGVQW VSIDRKARTL QVALSPPALV RPSTTLKLPV KLGGLAPGED AKIVVAAVDV GILNLTNYKP PAPDDYYLGQ RRMTSEIRDL YGQLIDGMQG TRGQIRSGGD GAGAELQGSP PTQKPLALYS GIVTVGADGS AEISFEIPEF AGTARVMAVA WTATKVGRAN VDVTVRDPVV LTTTLPRFLR NGDRGTMAFD LDNVEGAPGD FTIKVTATGP VKFAGPASTT LKLAAKQRGS ASLAVEAGGA GTAALDVAIS GPNGLTLARH YALDVRPANQ TLARRSIRTL AKQESLTLTS DMFADLVPGT GGVSLSVSLS TALDAATILK ALDRYPFGCS EQIASRALPL LYVNDLAAGA HLAMDASADE RIRTSIDRLL ARQGSNGSFG LWSTGGDDAW LDAYVTDFLT RAREKNFVVP DVAFRSALDR IRNAVVNAEE PEKDGGRNLA YGLYVLARNG AAPIGDLRYL ADTKLDKLAT PIAKAQLAAA LALVGDRARA ERVYAVAAGD LAPKPVIQFG RVDYGSALRD AAALVSLASE GNAPKATLTT AVQRVEAARG LTPYTSTQEN AWLVLAARAL AKETMSLDIN GSAAKSAVYR SYKADELRGQ PIRIANTGDA PVQAVVTVSG SPVTPEPAAS NGFKIERNYF TLSGEPADIT KAKQNDRFAV VLTVTEAKPE YGHIMVADYL PAGLEIDNPH LVSSGDSGTL DWIENGEEPV NTEFRDDRFT AAIDRGTEDK AVFTVAYIVR AVSPGKYVLP QAIVEDMYNP SRYGRTGTGS VEVTKAK
|
| |