Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4117 |
Symbol | |
ID | 6411801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4412605 |
End bp | 4414185 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642713999 |
Product | Protein of unknown function DUF1800 |
Protein accession | YP_001993088 |
Protein GI | 192292483 |
COG category | [S] Function unknown |
COG ID | [COG5267] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.970573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAACCA ACACGGCCGG ACGGTGGAGC GCACGAGGAT CGGCGATACT GGCGATCGGC ATCGCCGCAA TCGTCACCGG CGGATCGGCC GGGGCCGCGG AGATCTCGGC GCACGATCTG GCGCTGATCG ATCGGCTGAC CTGGGGCATC AACGGATCCA GCGTGGCGCA ATTCCAGAAA CTCGGCGCCG CGCGCTGGGT GGACCAACAG CTGCACCCCA CGGCGGACAG TGCGCTGCCG CAGCTGGTCG TTGCGCAGAT CGACGCGATG CCCGACGCCG CCGGCCTGAC GCCGGCGGCG ATCAATGCGT TTCAGGCGCA GGGCAAGGAT GCCGATCAAC TCACTGACCC GGAGGCGCGC AAGACAGCCA AGCAGGCGTA CCAGCAGGCG CTGAACGACC GCGCCAAGCA GGCGGCAACC CGGTCGATCC TGCGCGCGCT CTATGCGCCC GATCAGCTCC GCGAGCGGAT GAGCTGGTTC TGGCTGAACC ATTTCAACGT CCACCAGAGC AAGGCCGAGC TGCGCCTCCT GGTCGGCGAC TATGAGGATC GCGCGATCCG CGCGCATGCG CTCGGCAAGT TCGGCGATCT GTTACGCGCC ACGCTGCGGC ACCCGGCGAT GCTGCGCTAT CTCGACAATG CCGGCAACGC CAACGGCCAT CTCAACGAGA ACTACGCCCG CGAGATCATG GAGCTGCACA CCATGGGCGT CGGCAGTGGC TACACCCAGG CCGATGTGGA GTCGCTCGCC AAGATCCTCA CCGGCGTCGG CATCGACCTG AAGCCCGAGG ACCCGAAGCT GAAGCCTGCG CTGGCTCCGC AGCTCGTCCG CGACGGCGCG TTTGAGTTCA ACCCGGCGCG GCACGATTAC TCCGACAAAA CCTTCCTCGG CCACACCATC CGCGGCAGCG GCTTTGCCGA AGTCGACGAG GCGCTCGACC TGATCGTGCA CAATCCGGCG ACCGCGCAGC ACGTCTCGCG CAAGATCGCG ACCTACTTCG TCTCGGACGA GCCGCCGCAA CCGCTGATCG ACAAGATGGC GAAAACCTTC ACCGCCTCCG ACGGTGATAT CGCGCAGGTG CTGGCCACGA TGATCGCCGC GCCGGAGTTC GATGCGTCGC TGAAGACGGC GGAACGCTTC AAGGATCCGG TTGGCTACGT CTATTCGGCG GTGCGGCTCG CTTACGACGA CAAGGTCGTG CTCAACACCG TGCCGATCCA GCGTTGGCTC GGCCGGCTCG GCGAAGGGCT GTATCAGCGC CAGACGCCGG ACGGCTATCC ACTGACGGCG AGCGCCTGGA ACGGCCCCGG CCAGATGATG CTGCGGTTCG AGATTGCGCG TCAGATCGGT TCCGGTTCGG CCGGGTTGTT CAAGCCGGAG CAGGCCGACG CCAAGGATCG GCCCGCATTT CCGCTGCTGC AGAACGCGCT GTATTTCGGC GGGCTCAGCC GGACGCTGAG CTCGACCACG CGCGGCGCGC TCGATCAGGC GATCTCACCG CAGGATTGGA ATACGCTGTT TCTGTCCTCG CCCGAATTCA TGGTTCGTCA ACGCGCGGAG GCACCGCATG AACCGTCGTG A
|
Protein sequence | MRTNTAGRWS ARGSAILAIG IAAIVTGGSA GAAEISAHDL ALIDRLTWGI NGSSVAQFQK LGAARWVDQQ LHPTADSALP QLVVAQIDAM PDAAGLTPAA INAFQAQGKD ADQLTDPEAR KTAKQAYQQA LNDRAKQAAT RSILRALYAP DQLRERMSWF WLNHFNVHQS KAELRLLVGD YEDRAIRAHA LGKFGDLLRA TLRHPAMLRY LDNAGNANGH LNENYAREIM ELHTMGVGSG YTQADVESLA KILTGVGIDL KPEDPKLKPA LAPQLVRDGA FEFNPARHDY SDKTFLGHTI RGSGFAEVDE ALDLIVHNPA TAQHVSRKIA TYFVSDEPPQ PLIDKMAKTF TASDGDIAQV LATMIAAPEF DASLKTAERF KDPVGYVYSA VRLAYDDKVV LNTVPIQRWL GRLGEGLYQR QTPDGYPLTA SAWNGPGQMM LRFEIARQIG SGSAGLFKPE QADAKDRPAF PLLQNALYFG GLSRTLSSTT RGALDQAISP QDWNTLFLSS PEFMVRQRAE APHEPS
|
| |