Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2501 |
Symbol | |
ID | 3910290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2863041 |
End bp | 2864807 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637884400 |
Product | hypothetical protein |
Protein accession | YP_486117 |
Protein GI | 86749621 |
COG category | [S] Function unknown |
COG ID | [COG1376] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.732185 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGTGA ACGAGTTTCG CGGTTGCGAC ACTCATCTGA GATCAATCAA AAGATTCGAG GACCACCGCG TGCGCCTGGA ACAACCGACG ATGCCGCGCG CAAGCGCTCC GCGAACCGGA CCGATTTTCA AAGCCGCATG GCGCGCCGGG ATTCTGACGG CGGCGGGCGT CTTCGCACTG TCCTCCGAGG CTCATGCCGC GAACTGGTGG TGGCCCGAGA ACGACGACGC GGTCTACATC CCGGCGCAGC CGGCGCCCAA GCGGTATCAA CGGAAGCAGC CGCATCTCGA TACGCGGCAG CAGAAGCTGA TCGAGAAGCC GACCGCCAAG CCGCAGGGGC CGCTGGTGAT CGCGGTGTCG ATCGAGCAGC AAAAGATGCG CGTCTACGAC GCCAACGGCT TCTTCGCCGA GGCGCCGATC TCCACCGGGA TGCGCGGCCA CGCCACGCCG ATGGGCGTGT TCAGCGTCAT CCAGAAAAAC AAATGGCACC GCTCCAACAT CTACAGCGGC GCGCCGATGC CCTACATGCA GCGCCTGACA TGGTCGGGCA TCGCGCTCCA TGCCGGCGTG GTGCCGGGCT ACCCGGCTTC GCATGGCTGC ATCCGGATGC CGACATCGTT TGCGACGAAG ATGTGGGGCT GGACCCGGAT GGGGGCCCGC GTGATCGTCA CGCCCGGCGA CATCACGCCG ACCCACTTCA CCCACCCATT GCTGACAGCC AAACGGCCCG CGCCGGCGGA CGCGCCGATG GCCTCCGATC CGCAGAAGCC GGGCACGGCG CCGAAATCCG ACAAGGCCGC GACGGCCGAA CCGGCCGCCG CCCCCGCCGC CGGACTGCAT CCGGAACTGC GTGCCGGCAT CCTGGGCGAC GCCCCCCGCC CGCCGGTCCA GACCGCCGAC GCCAGCGCGG CGGACCCTGG CGCCGCGCTC GTGCTGTCCG ACTCGCCGGC GCGAGAAAAC GCCTCGGCAG AGCCAGCCCC CGCGACGAGC CCGGCCGACG CCCACGCCGA AGCGACAGTC GACTCGATCG CCAAGGACGA GGCCGCCGAG CCCACCGCCG CCGTTGCCAA GGACATCGCC GAAGCGACCG AAGCCCGGCC GGACGACCTC GATCCCGCGA CCACCGCCAC CGTCGTCATC GCTCCCGGAT CCCCTCCCCA AGCGACCGAA AAAGCGCCTT CTTCCAATGA GAAGGCGCCA TCCGTTGAGT CCGCGGACAA GCTCGACGGC AAGCCGGATG CCGCCAGCCA GAGCAGCGCC CCGGGCAAGG ATCAGTCGCG CCCGGGCGAT CCGGCTGCGC CGCCGGCGGC AGAATTGCCG GCAGCCAAGC GCGGCGGCGG CCAGATTGCG ATGTTCGTCA GCGCCAAGGA CCAGAAGCTC TACGTCCGTC AGAACATGAC ACCGTTGTTC GACGTTCCGG TGGTGATCGC GGCAGGCGAG CGGCCGCTGG GGACGCATGT GTTTACCGCC GAGCTGGCGA AGGATGACGG CGTTCGCTGG ACGGTGGTGT CGCTGCCGGC ACCGCAACGG GTCAATGATG CGGGCAATTC CCGCCGATCC AAGAAGACCT TGCCGACACC TGTGAAAGCA TCGGTGGAAA CAGAAGGCCC CGCCGCAGCG CTCGATCGCC TGACAATTCC CGCCGAAGCC ATGGCGCGGA TCGCCGACGC GATCACCACC GGAACATCGT TCATCGTCTC CGATCAGGGC ATCACTGCGA GCGGCGAAAC CGGCCGGGGG ACCGATTTCA TCATCAGCCT GCGGTAG
|
Protein sequence | MRVNEFRGCD THLRSIKRFE DHRVRLEQPT MPRASAPRTG PIFKAAWRAG ILTAAGVFAL SSEAHAANWW WPENDDAVYI PAQPAPKRYQ RKQPHLDTRQ QKLIEKPTAK PQGPLVIAVS IEQQKMRVYD ANGFFAEAPI STGMRGHATP MGVFSVIQKN KWHRSNIYSG APMPYMQRLT WSGIALHAGV VPGYPASHGC IRMPTSFATK MWGWTRMGAR VIVTPGDITP THFTHPLLTA KRPAPADAPM ASDPQKPGTA PKSDKAATAE PAAAPAAGLH PELRAGILGD APRPPVQTAD ASAADPGAAL VLSDSPAREN ASAEPAPATS PADAHAEATV DSIAKDEAAE PTAAVAKDIA EATEARPDDL DPATTATVVI APGSPPQATE KAPSSNEKAP SVESADKLDG KPDAASQSSA PGKDQSRPGD PAAPPAAELP AAKRGGGQIA MFVSAKDQKL YVRQNMTPLF DVPVVIAAGE RPLGTHVFTA ELAKDDGVRW TVVSLPAPQR VNDAGNSRRS KKTLPTPVKA SVETEGPAAA LDRLTIPAEA MARIADAITT GTSFIVSDQG ITASGETGRG TDFIISLR
|
| |