Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3174 |
Symbol | |
ID | 6410844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 3419185 |
End bp | 3420174 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642713052 |
Product | protein of unknown function DUF815 |
Protein accession | YP_001992153 |
Protein GI | 192291548 |
COG category | [R] General function prediction only |
COG ID | [COG2607] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.535414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAAAA AGACAAAATC CCGCCCCGCC AAAGCCGCCC CGAAGTCCGC CGCCCGCAAG CCCGCCCGCG CACCGGCGGC AAAGCGCCGC CCGCCGGCGA CGCCTCCCGG CGCATCCCTC GAGGGCGCAC TGCTGGAACG GATCGCCCAC GCGCTGGAGG GCATTTCCGC CCACCTGGCG GGCACTTCCG CCGCTCCGGC CGATGCCGCG CTGAATTCGG CCGACGCATT CATCTGGCAG CCCGAAGGGC GGCTCGCGCC GGTTCCAAAG GTCAGCCGGG TCGATCTGTC GCTGCTGCAG GGCGTCGACC GGATGCGCGA CACGCTGATC GAAAATACCG AGCGGTTCGC GACCGGCCTG CCCGCCAACA ACGCATTGTT GTGGGGCGCG CGGGGGATGG GCAAATCGTC GCTGGTCAAA GCCGCTCACG CGCATGTCAA CGCGCGGCCC GACGTCGCCG GCCGGCTGAA GCTGATCGAG ATTCACCGCG AAGACATCGA GAGCCTGCCG GCGCTGATGA CGCTGCTGCG CGCTTCCGAC CTCCGATTCA TCGTGTTCTG CGACGATCTG TCGTTCGACG GCAACGACGC CTCGTACAAA TCGCTCAAGG CCGTGCTCGA AGGCGGCATT GAGGGCCGCC CCGACAACGT AATTCTTTAC GCGACCTCGA ACCGGCGGCA TCTGCTGCCG CGCGACATGA TGGAGAACGA GCGCTCGACC GCGATCAATC CCGGCGAAGC GGTCGAAGAG AAGGTGTCGC TGTCCGATCG GTTCGGACTG TGGCTCGGCT TCCACAAATG CAGCCAGGAT GAATTCCTGG TGATGGTGCG CGGTTACTGC GCACACTACG ACATCGCGAT CGACGACGAA CAGCTCGAGC GCGAAGCTCT GGAATGGTCG ACCACGCGCG GCTCGCGCTC CGGCCGCGTC GCCTGGCAGT TCGTGCAGGA TCTCGCCGGC CGCCTCAAGG TGCGACTCGG AACCAAGTAG
|
Protein sequence | MAKKTKSRPA KAAPKSAARK PARAPAAKRR PPATPPGASL EGALLERIAH ALEGISAHLA GTSAAPADAA LNSADAFIWQ PEGRLAPVPK VSRVDLSLLQ GVDRMRDTLI ENTERFATGL PANNALLWGA RGMGKSSLVK AAHAHVNARP DVAGRLKLIE IHREDIESLP ALMTLLRASD LRFIVFCDDL SFDGNDASYK SLKAVLEGGI EGRPDNVILY ATSNRRHLLP RDMMENERST AINPGEAVEE KVSLSDRFGL WLGFHKCSQD EFLVMVRGYC AHYDIAIDDE QLEREALEWS TTRGSRSGRV AWQFVQDLAG RLKVRLGTK
|
| |