Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1909 |
Symbol | |
ID | 3907988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2182023 |
End bp | 2183207 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637883803 |
Product | hypothetical protein |
Protein accession | YP_485528 |
Protein GI | 86749032 |
COG category | [S] Function unknown |
COG ID | [COG3878] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0057636 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAGC TGGATGAACG ATCCGATACT GAATTGCTGC AGGCGTGGCT GGACAGGGTC CGCGTAACCG GCGCCGTCGA GCATGTCGGA GCGTTCAATC GTCTCCAAGC TGAACTGGCA AGGATCGAGC GGGAACTGAT CGCGCGCGGT CGCGGCATTG AACAGCTTCG CTCCTTGCTG ACATACACCG ACCCGGAGAT CACCCGGAGT GCCGAGCTGG CGCTCGCCAG GGCGGAGGCA TCCGCAAGCG CGGCGCGCGG CGGATGCCGT GCGTCCAGCG GGTGCGACGA CGCGCACTCA GCGGCCGCGG GCCGACCGGC AGGCGCCGGC ATCCTCGATG ATCATCCGAT ATTCTGGCCG GCGCGACATC CGCCCCCAGC GGCGATGGAG CTCTCCGAGA TCGCGCAGCG CCTCATCGCC GCGTTGCCGC TGGAGGCCGC GGCGCTGCTG CGCCAGCTTC GTCCCGCGAT CGGGCTGTGG CCGCAGGCGG CGCGCGCCGA CGCGCCGATC GACGGTTCCC GGCTCGGCGG CATGCCCTGC GCACCGCCGG ATTGGCGCTG GCCGGTCGCC GCGACCGAGC CGATGCTGTT TCTCGGCCAG ATCAACTGCG CCGACCTGCG CGGTCTGCCG GGCGCGGAGG CGCTGCCGTC GCAAGGCCTG CTGTCCTGCT TCGGCGACCA CGACACGGTG ATGGGGTGTC TGTTCACCGG GCAGGGCGGC GCCTTGTACC ATTGGCCGGA CACAGAGCAT CTGATCCCGG CGCAGCCGCC GCTGGAGATG CTCACGGTGT TTCCTCGCGC CGAATTGTCG TTCCGGCCGA TGTGGGACCT GCCGGACCCG GACAGCAGCG TCGTCACGGC GATCCTGCCG GATCGATCGC GCCAGGCGCT CTACAAAAGC CTGCACAGCG ACTTGCGGCG GCACGGCCTT CCCGCCGTGC TGGACTATCC GTGCAATCGC AGCAAGCTGC TCGGCTGGCC GGATCTGCTG CAGGGTGAGA GTTTCGAGTT CACCCTCGAT CAGCCGTTCG ATCAGTACCG ATTGCTGCTG CAGCTCGACG GCTACACCAA CGGCAGCGAG ATCGCCGACT GGGGGCCGGG CGGCTTCCTC TACTACTTCC TGTCCAAGCA AGACCTCGCC GGACGGCGGT TCGAGACGGC GGAACTGGCA ATCCAGTTCA CGTGA
|
Protein sequence | MNELDERSDT ELLQAWLDRV RVTGAVEHVG AFNRLQAELA RIERELIARG RGIEQLRSLL TYTDPEITRS AELALARAEA SASAARGGCR ASSGCDDAHS AAAGRPAGAG ILDDHPIFWP ARHPPPAAME LSEIAQRLIA ALPLEAAALL RQLRPAIGLW PQAARADAPI DGSRLGGMPC APPDWRWPVA ATEPMLFLGQ INCADLRGLP GAEALPSQGL LSCFGDHDTV MGCLFTGQGG ALYHWPDTEH LIPAQPPLEM LTVFPRAELS FRPMWDLPDP DSSVVTAILP DRSRQALYKS LHSDLRRHGL PAVLDYPCNR SKLLGWPDLL QGESFEFTLD QPFDQYRLLL QLDGYTNGSE IADWGPGGFL YYFLSKQDLA GRRFETAELA IQFT
|
| |