Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0503 |
Symbol | |
ID | 6408152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 545847 |
End bp | 547208 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642710415 |
Product | protein of unknown function DUF21 |
Protein accession | YP_001989538 |
Protein GI | 192288933 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTCGA CTCTGAGCAA CGTTCTGATC GCGGTCCTGC TGCTGATCGC CAACGCCTTC TACGTCGCGG CGGAGTTCGC GCTGGTCCGC AGCCGCGGCT TCCGCATCAA GGCAATGGTC GAGAAGCAGC GGTTCGGCGC CGAACTGGTG CAGCACATCC TCGGCAACGT CGAAGCGTAT CTGGCCTGCT GCCAACTCGG CATCACCATG GCGTCGCTCG GGCTCGGCTG GGTCGGCGAG CCGACTGTCT CGGCGCTCCT CGCGCCGGTG CTGCAGCCGA TGGGCCTGTC GGAATCGGCA CAGCACTTCA TCGCGTTTCT CGGCGGCTTC CTGTTCTTCT CGTCACTGCA CATCGTAATC GGCGAGCAGG TGCCGAAAAC GCTGGCGATC CGCCAGCCGG AGCCGGTGTC GCAGTGGATC GCCTATCCGC TGCACATCTC GTTCATCTTG CTGTATCCGC TGAACTGGCT GCTGAACCAA GCCTCGCGCG GGGTGTTGAA GCTGCTCGGC GTCGAAGAAA ACTCCGAGCA CGAGATCCTC ACCGACGTCG AAATCGAAGG GCTGGTCGGC GAATCCGCCG AGCACGGCAA GATCGAAAGC GGCGAGGCCG AGTACATCCA GAACGTGTTC CGGTTCGGCG AACTGGTGGT GTCGGATGTG ATGGTGCACC GGACCGCGAT GGTTACCGTC AATGCCGACC AGCCCAGCGA ACAATTGGTG AAGGAAGTTC TGGCGACTGA GTACACCCGC GTGCCGCTGT GGCGCGACAA GCCGGAGAAC ATCGTCGGCG TGCTGCACGC CAAGGATTTG CTGCGCGCGC TACGCGCCGC TGACGGCGAC GCGTCCAAAC TCGACATCGG CAAGATCGCG TTGCAGCCGT GGTTCGTTCC GGAGATGCGT CCGGTGTCGG AACAGCTCAA AGCGTTCCGC ACCCGCAAGA CTCACTTCGC GCTGGTCGTC GACGAATACG GCGAAGTCGA AGGCATGGTG ACGCTCGAGG ACATCCTCGA GGAAATCGTC GGCGACATCT CCGACGAGCA CGACGTGGTG GTGGCCGGCG TTCGGACTCA GCCGGACGGC TCGGTGGTGG TCGACGGCTC GGTGCCGATC CGCGATCTCA ACCGCGCAAT GGACTGGAAC CTGCCCGACG AGGAAGCCAC CACAGTGGCG GGCCTGGTGA TTCACGAGGC GCGGTCGATT CCGGACCGCG GCCAGAATTT CACCTTCCAC GGCTTCCGCT TCCGCGTTCT GCGCCGCGAG CGCAACCGCA TCACCGCGCT GCGGATCTCG CCGGTGCCGC GCGACGGCGA GGCTGATCTG ATCGAAAAGA AGAGCCGGAA GACGCGGGCC GGATCCGTCT AA
|
Protein sequence | MNSTLSNVLI AVLLLIANAF YVAAEFALVR SRGFRIKAMV EKQRFGAELV QHILGNVEAY LACCQLGITM ASLGLGWVGE PTVSALLAPV LQPMGLSESA QHFIAFLGGF LFFSSLHIVI GEQVPKTLAI RQPEPVSQWI AYPLHISFIL LYPLNWLLNQ ASRGVLKLLG VEENSEHEIL TDVEIEGLVG ESAEHGKIES GEAEYIQNVF RFGELVVSDV MVHRTAMVTV NADQPSEQLV KEVLATEYTR VPLWRDKPEN IVGVLHAKDL LRALRAADGD ASKLDIGKIA LQPWFVPEMR PVSEQLKAFR TRKTHFALVV DEYGEVEGMV TLEDILEEIV GDISDEHDVV VAGVRTQPDG SVVVDGSVPI RDLNRAMDWN LPDEEATTVA GLVIHEARSI PDRGQNFTFH GFRFRVLRRE RNRITALRIS PVPRDGEADL IEKKSRKTRA GSV
|
| |