Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5175 |
Symbol | |
ID | 6412875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5579317 |
End bp | 5580315 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642715065 |
Product | hypothetical protein |
Protein accession | YP_001994138 |
Protein GI | 192293533 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.573585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTGA TCCGACGCAT TGTTTTGTCT CGCCGGGCCG TCCTGACCAC AGCCGCCGCA GCCTTGGCGG CGGCGAGAAT TCCGTCGTCC GCGCGAGCCC AGGGCCTGTA CCCGACCCGG CCGGTGCGGA TCGTGCTGCC GTTCGCGGCC GGCGGTGTCG CCGACATCAC AGCGCGCCTG ATCGCGGATC AGCTCGGCAC CAAGCTCGGC CAGCGCTTCT ATGTCGAGAA CCAGCCGGGC GCTGGCGGCA TCGCCGCGGC GCGCACCGTG ATCTCATCAC CACCGGACGG CACGACGCTG GCACTGTTGT CGAACGGCAC CGCGATCAGC GTGTCGCTGT TCAAGAAGCT GCCGTTCGAT CCGGTGAAGG ATTTCGCGCC GATTTCCAGC CTCGGCACCT TCGACTTCCT GTTCGCCGTC CGCGCCGAGT CCAAGTTCAA GACGCTGGAA GAGGTGATCA AGGCGGCGAA GCAGAAGCCG GGCGCGCTCA ATGTCGGCAC CATCAACACC GGCAGCACGC AGAACCTCGC CGCGGCGTTG TTCAAGACCG CAGCCGGCGT CGACTTCGTG ATCGTTCCGT TTCGCGGAAC GCCGGAGGTG CTGGTGGCTC TGCTGCAGGA CAGCGTCGAC CTGACGATCG ACAGCTATTC GGCGCTGAAA GGCAACCTCG CCGACGGCAA GATCCGGGCG CTGGCGGCGA CGGGGCCGCT GCGCTCGAAG ATCACGCCCG AGATTCCTAC GCTGCGCGAG AGCGGCATCG AGGCCAGCAT CGAATCCTGG AACGGATTGT TCGCGCCGGC CGGTACGCCG CCCGCGGTGA TCGGCGCGCT GAACACGGCG CTGCAGGAGA TCCTCGCCGA TCCGGCACTC AAGAAGAAGA TGCTTGAACT CGGCATCGAC GCCAGACCGT CGACACCGGA TCAATTGGCG GCACGGCTGC GCGCCGACAT CGAGAAATGG CGCGCCGTGA TCGAGCAGTC CGGCATCGAG CGGCAATAG
|
Protein sequence | MSLIRRIVLS RRAVLTTAAA ALAAARIPSS ARAQGLYPTR PVRIVLPFAA GGVADITARL IADQLGTKLG QRFYVENQPG AGGIAAARTV ISSPPDGTTL ALLSNGTAIS VSLFKKLPFD PVKDFAPISS LGTFDFLFAV RAESKFKTLE EVIKAAKQKP GALNVGTINT GSTQNLAAAL FKTAAGVDFV IVPFRGTPEV LVALLQDSVD LTIDSYSALK GNLADGKIRA LAATGPLRSK ITPEIPTLRE SGIEASIESW NGLFAPAGTP PAVIGALNTA LQEILADPAL KKKMLELGID ARPSTPDQLA ARLRADIEKW RAVIEQSGIE RQ
|
| |