Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2820 |
Symbol | |
ID | 3910613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3212823 |
End bp | 3215360 |
Gene Length | 2538 bp |
Protein Length | 845 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637884720 |
Product | surface antigen (D15) |
Protein accession | YP_486433 |
Protein GI | 86749937 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4775] Outer membrane protein/protective antigen OMA87 |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR03303] outer membrane protein assembly complex, YaeT protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.163282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.985968 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCTG GAATGCGATT GTTGCGGGGG GGCCTTCTTG CTGCCACCCT GGTGTTTTTC GCCGTACCGG TCGCCACGAC GGCGACCGCT GTATTGACGG CGTCGCCTGC GGCCGCGCAG TCTGCCTCTT CGATTCAGGT CGAGGGCAAC CGCCGGGTCG AGGCCGACAC CATTCGCTCC TATTTCAAGC CAGGCCCGAG CGGCCGGCTC GACCAGGGCA GCATCGACGA CGGCCTCAAG GCGCTGATCG AGACGGGGCT GTTCCAGGAC GTCCGGATCA ATCAGGGTGG CGGCGGTCGC CTCGTCGTCA GCGTCGTCGA AAACCCGGTG ATCGGCCGGC TCGCTTTCGA GGGCAACAAG AAGATCAAGG ACGAGCAGCT TTCGGCGGAA ATCCAGTCGA AGCCGCGTGG CACGCTGTCG CGTCCGATGA TCCAGTCCGA CGCGCTGCGG ATCGCTGAAA TCTACCGCCG GTCGGGCCGT TACGACGTTC GCGTCGATCC GCAGATCATC GAACAGCCGA ACAACCGCGT CGATCTGGTG TTCGTCGTCA ACGAAGGCGA CAAGACCGGC GTCAAGTCGA TCGAATTCAT CGGCAACAAG GCGTTCTCGT CCTATCGGCT GAAGGACGTC ATCAAGACCC GCGAATCCAA CCTGCTGAGC TTCCTCGGCT CGGGCGACGT CTACGATCCG GATCGGGTCG AGGCGGACCG CGATCTGATC CGGCGCTTTT ATCTGAAGAA CGGCTATGCC GACGTTCAGG TGGTGGCCGC GCTGACCGAA TACGATCCGG AGCGCAAGGG CTTCCTCGTC TCCTTCAAGA TCGAGGAAGG TCAGCAATAT CGCGTCGGCT CGGTGAGCTT CGAATCGACG ATTCCGAATT TCGACGCCAA TTCCCTGAGC AGCTATTCGC GGGTGAATGT CGGCTCGCTG TACAACGCCG AGGCGCTCGA GAAGTCCGTC GAGGAAATGC AGATCGAGAT GTCGCGGCGC GGCTATGCAT TCGCGACGGT GCGTCCGCGT GGCGATCGTA ATTTCGAATC CCATACCGTC TCGATCGTGT TCTCGATCGA GGAGGGCGCT CGGGTCTACA TCGAGCGGAT CAACGTCGTC GGCAACACCC GGACCCGCGA CTACGTCATC CGGCGCGAGT TCGATATCGC GGAAGGCGAT GCCTACAACC GCGCGCTGGT CGACCGGGCC GAGCGCCGGC TGAAGAACCT CGACTTCTTC AAGTCCGTGA AGATCTCGAC CGAACCCGGC TCGTCGAGCG ACCGCGTCAT CCTGGTGGTC AATCTCGAAG AGAAATCGAC CGGCGACTTC TCGGTCTCCG GCGGCTATTC GACCAGCAAC GGCGCGATGG GCGAAGTCAG CGTCTCGGAG CGCAACTTCC TCGGCCGCGG CCTGTTCGCC AAGGCGACCG TGCAATACGG CCAGTATGCG CGCGGCTACT CGCTGTCGCT CGTCGAGCCC TATCTGCTCG ACTACCGCGT CGCGCTCGGC CTCGACCTGT ATCAGCGCGA GCAGCTCGCC AACAGCTACA TCTCGTACGG CACCAAGACG CTCGGCATCA GCCCGCGGCT CGGCTTCGCC CTGCGCGAAG ACCTGACCCT GCAGCTGCGC TATTCGCTGT ACCGGCAGGA AATCACGCTG CCGTCGTACC TGAACAATTG TAACAACAAT CTCGGCTCGG CGAACTACTT CCCGACGCCT CAGTTCATCG CGGCCGGCAA TCCGAACAAC ACCGGCTACG GCGTGCTCGG CTGCTACGGC GACGGCGAAG CCTCGCTTCC GGTCCGCATC GGCCTGTCCA ACGGCGCCTA CTGGACCTCC TCGGTCGGCT ACACCCTGAC CTACAACACG CTGGACAACA CCCGGAACCC GACCAACGGT CTGCTGGTCG ACTTCCGTCA GGACTTCGCC GGCGTCGGCG GCGACGTGAA GTTCCTGAAG TCGGCGTTCG ACGCCAAGTA CTACACCCCG CTGGTGTCGG ACATCGTCGG CATCGTCCAC CTGCAGGCCG GCAATCTCAG CACCTATGGC GGCAACCAGC TGCGCATGCT CGACCACTTC CAGATGGGTC CGAACCTGGT CCGCGGCTTC GCGCCGAACG GTATCGGTCC GCGCGACATC GGCCAGTACG CCTTCTACGG CTACGGCGGC GACGCGCTCG GCGGCACCAA CTACTGGGGC GCATCGGTCG AGTTGCAGAT GCCGTTCTGG TTCCTGCCGA AGGAAGTCGG GCTCAAGGGC GCCGTCTATG CCGACGCCGG CTCGCTGTTC GACTACAAGG GCCCGACGTC GTGGACGCTC ACCAACGAAG TCAACGCGCC CGGTTGTACG CCGGCGAGCC AGACCTCGAT CGGGACCTGC GCCGGCCTGA ATTACGACGA CACCAATCTG GTCCGCACCT CGGTGGGTGT CGGCCTGATC TGGGCCTCGC CGTTCGGTCC GCTGCGGTTC GACTACGCTG TCCCGATCAC CAAGGGTAAG TACGACCGCG TCCAGGAATT CAAATTCGGC GGCGGGACTT CGTTCTAA
|
Protein sequence | MNAGMRLLRG GLLAATLVFF AVPVATTATA VLTASPAAAQ SASSIQVEGN RRVEADTIRS YFKPGPSGRL DQGSIDDGLK ALIETGLFQD VRINQGGGGR LVVSVVENPV IGRLAFEGNK KIKDEQLSAE IQSKPRGTLS RPMIQSDALR IAEIYRRSGR YDVRVDPQII EQPNNRVDLV FVVNEGDKTG VKSIEFIGNK AFSSYRLKDV IKTRESNLLS FLGSGDVYDP DRVEADRDLI RRFYLKNGYA DVQVVAALTE YDPERKGFLV SFKIEEGQQY RVGSVSFEST IPNFDANSLS SYSRVNVGSL YNAEALEKSV EEMQIEMSRR GYAFATVRPR GDRNFESHTV SIVFSIEEGA RVYIERINVV GNTRTRDYVI RREFDIAEGD AYNRALVDRA ERRLKNLDFF KSVKISTEPG SSSDRVILVV NLEEKSTGDF SVSGGYSTSN GAMGEVSVSE RNFLGRGLFA KATVQYGQYA RGYSLSLVEP YLLDYRVALG LDLYQREQLA NSYISYGTKT LGISPRLGFA LREDLTLQLR YSLYRQEITL PSYLNNCNNN LGSANYFPTP QFIAAGNPNN TGYGVLGCYG DGEASLPVRI GLSNGAYWTS SVGYTLTYNT LDNTRNPTNG LLVDFRQDFA GVGGDVKFLK SAFDAKYYTP LVSDIVGIVH LQAGNLSTYG GNQLRMLDHF QMGPNLVRGF APNGIGPRDI GQYAFYGYGG DALGGTNYWG ASVELQMPFW FLPKEVGLKG AVYADAGSLF DYKGPTSWTL TNEVNAPGCT PASQTSIGTC AGLNYDDTNL VRTSVGVGLI WASPFGPLRF DYAVPITKGK YDRVQEFKFG GGTSF
|
| |