Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3836 |
Symbol | |
ID | 3911639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4380201 |
End bp | 4382687 |
Gene Length | 2487 bp |
Protein Length | 828 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637885736 |
Product | hypothetical protein |
Protein accession | YP_487440 |
Protein GI | 86750944 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.298031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTGCTCG ATGAAGCCAC AATGCTGGCG GAAGAAGAAG CGTTTCTATG GCCGACGACG AAACCTGACT ACCGTGCTGG CCTCTCTTTG TCGGGAGGCG GCATCCGCGC CGCCACGGTG GCCCTCGGCG TGCTCGAGGG ATTGGCGTCA CGCGGTCTCT TGCAACGAAT TCACTACCTG TCCACCGTGT CCGGCGGCGG CTATATCGGA TCGGCATTGT CCTGGTTTTG GTGCGAACGC CGCGTGGTTG CTGAAGCGGC GCTGCAGAAG TCGAGAGAGC GCACCGTTCA TCGTTTCGGA GCCGACACCG CCAGCTTCCC GTTTCAGGAG GAGCGGGCGA ACGCGTCCCC GGTTGCGGAG GCGGCGGCCC TCAATCTCAA ATTTCTTCGT CAGCACGGCT CGTATCTCAC CTCGGGAGAC GGCATCGGCT TCGCCGGCCT GATCATGGCC GTGCTGCGTA CGGTGTTGCT GAGCCTCGCT GTCTGGATGC CGCTGCTGAT TGCGATCTTC CTGTCCTTCG AGGTCCTGGA TAGTTTTCTG TCGGGCGCAA GCTCGGATGG CGAGTTGGCG GCGAAATGCG AGACCGCTGT CGGAACTACC GTGTTCGCCT GCCGCCCTTC GTTCATTGCT CTCCTGTCAT TGGCCGGCGC GGTCGGCGTC GCAATCTTTA TTGGAACGAT CCTTTTCGCG TTTCTCGGTC ACCTGGCGTC AATCAGGGCG TCCGGCAAGC GAGGCCGTTG GATCGCACTC TCAGCTTCGG TCGGCATCGG CTTGGCCGCT CATGTGATCT GGAAATACAA TAGTTCGGCA ACGTTACAGC CGCTCTTGGG CGCCCAGCTT CTGCTGGAAT TATTCATGAT CGCGGCCGCG ATCAGTGTGG CAATTTCCCA GATGTCGCTG CCCGAAAACT GGAGCTATTC GCTGCGACGA CGTTTCGAAA AAGCCTCGAG CAAGGGTCTC CCGATCGCGA TCACGGCAGT GTCTATCGGC CTTCTACCCC TAATCGTCGC CACCTTGAAG CTGACAGATC CGTCCAAACT CGGCGCTTTC GAGCCGGTTT GGGGAACCGT CACGCTGCTG AGTGGAGTCG GCACCGCACT TTACGGCTAC TATCTCAAGG CCAAGAGTCT CTTGCCGGGC GTCGCCGGCA AATTCTTCGC GATAGCTGGG TCGCTGCTCT TCCTTTCGGG ACTGCTTATT CTCTCCTTTG CTACCGCCCG GCAATTGTTT CTTCTCAATA CAAACTGGGC TCTGACGGGA GGCGCGGGTC TCTTCATGCT GTCGATCGCC ATCGGCGTCG CCGGCAGCCT CAATGCCACC GGCCTCCACC GGTTCTATCG CGATCGGCTG ATGGAAACCT TCATGCCGAT GACCGACGCT ATCAGCCAGG GCACAGCGCG GCAAAGCGAC GTTGCCGATA CGCTCACCGT CGTCGATGTG GTGCGCAGCG CCGAGGAGCG CGGCGACCGG CCCTATCACC TGCTCAACGC CCATGCGATC CTGGTCAACG AGCCGGACGA TCCGAAGCTG GCGCTCCGCG GCGGCGACAA TTTTCTGATA TCGCCGGCAA TCATCGGCTC CTCGGCTACC GGCTGGATGC GCAGTCGCGA CTACCTTCGG CTGCAGGGTC CGCTGACACT GGCCTCGGCA ATGGCGGCCT CCGGCGCGGC CACCAATGCG AATGCCGGCT ATATCGGAAC CGGCGTGACG CGCGACCGTT TCCTCTCGGC GGTGATGTCC ATCCTCAACA TCAGGCTGGG ACTGTGGGTC GGGAACCCGC GATGGCTCGC CGCTAAATCT CTGTTCGGCC TGCAAGTCCT GAAGGCTCCG ACCTATTTCC AGCCCGGACT CACCGCCGGC ATTCTCGGAT TCGGTCACCA CAGAAAGGCG AAATTCCTCG AACTCTCCGA CGGTGGCCAC TTCGAGAATC TCGGCCTGTA CGAACTGGTG CGGCGGCGGC TCGACCTGAT CATCGTGGTC GACGCTGAAC AGGACAAGGA CATCAACTTG TCGGCTCTGG TGTCGTCGCA CAATCGCATC AAGGAAGACT TTGGCGTCGC TCTGAAGTTC GCCCCATCCG ACAAGGGCAA GGGACCGGAA CTCTTTCTCG GCGAAGAGGC CAAGAACAGA TATCCGCGCG GCCTACCTCT CGCCAAATCG CCGTTCATGG TCGCGCGAAT CGAATATCCC GCGACGAAGA GTGGCGAGCC CAACAAGACC GGCGTGCTGA TCTATTTGAA ATCGACCATC GTCGAGGGGC TGGATTTCGC CACCCTCGGC TATCGCGCGC TCAACGCCGA CTTTCCGCAC CAGACAACCG CAGATCAGTT TTTCGATCCC GATCAGTTCC AGGCGTACCG CAACCTCGGC CTCAGGAGCT GCGAGATCAT GGCGACCGCG CTCGACCTCG AAGCAAACTT CGACAAGCCA ACCGAACTGC TGAAGAAGTA CGACGACTGG AAGCCGGGCG CGTCAGCAGA CAGTTGA
|
Protein sequence | MLLDEATMLA EEEAFLWPTT KPDYRAGLSL SGGGIRAATV ALGVLEGLAS RGLLQRIHYL STVSGGGYIG SALSWFWCER RVVAEAALQK SRERTVHRFG ADTASFPFQE ERANASPVAE AAALNLKFLR QHGSYLTSGD GIGFAGLIMA VLRTVLLSLA VWMPLLIAIF LSFEVLDSFL SGASSDGELA AKCETAVGTT VFACRPSFIA LLSLAGAVGV AIFIGTILFA FLGHLASIRA SGKRGRWIAL SASVGIGLAA HVIWKYNSSA TLQPLLGAQL LLELFMIAAA ISVAISQMSL PENWSYSLRR RFEKASSKGL PIAITAVSIG LLPLIVATLK LTDPSKLGAF EPVWGTVTLL SGVGTALYGY YLKAKSLLPG VAGKFFAIAG SLLFLSGLLI LSFATARQLF LLNTNWALTG GAGLFMLSIA IGVAGSLNAT GLHRFYRDRL METFMPMTDA ISQGTARQSD VADTLTVVDV VRSAEERGDR PYHLLNAHAI LVNEPDDPKL ALRGGDNFLI SPAIIGSSAT GWMRSRDYLR LQGPLTLASA MAASGAATNA NAGYIGTGVT RDRFLSAVMS ILNIRLGLWV GNPRWLAAKS LFGLQVLKAP TYFQPGLTAG ILGFGHHRKA KFLELSDGGH FENLGLYELV RRRLDLIIVV DAEQDKDINL SALVSSHNRI KEDFGVALKF APSDKGKGPE LFLGEEAKNR YPRGLPLAKS PFMVARIEYP ATKSGEPNKT GVLIYLKSTI VEGLDFATLG YRALNADFPH QTTADQFFDP DQFQAYRNLG LRSCEIMATA LDLEANFDKP TELLKKYDDW KPGASADS
|
| |