Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4116 |
Symbol | |
ID | 6411800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4411413 |
End bp | 4412618 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642713998 |
Product | protein of unknown function DUF1501 |
Protein accession | YP_001993087 |
Protein GI | 192292482 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGTC GTGATCTGCT GAAAGCCGTC GCGGCCCTCG CACCGGCCGC ACTGACGACC ACGATCGCCG GTCGCGTCTG GGCCACGCCG GCCACCGACG CCAAGCTTCT GGTGGTGTTC CTGCGCGGCG CTTACGACGC GGCCAACGTG CTGGTGCCGG TGTCGAGCAG CTTCTACTAC GAGTCACGGC CGAACCTCGC GATTGCCAAG CCGGACGTCG GCAACCCCAA TGCGGCGGTC GCGCTCGACG CCGATTGGGG CCTGCACCCG GCGCTGCGCG ACAGCTTGGC GCCGCTATGG ACGAGCCGCG AGATCGCATT CGTGCCGTTC GCCGGCACCA GCGACGACAC CACCCGCAGT CACTTCGAGA CGCAGGACAC GATCGAACTC GGGCAATCGA CCAAAGGCTC GCGCGACTAT CGCTCCGGCT TCATGAGCCG GCTTGCGGCG GAATTGACGC GGGTGAAACC GATCGCCTTC ACCGAGCAGC TGCCGCTGAT TTTCCGCGGC CAGGCGGAGA TTCCGAATAT TGCACTCGGC AATGTCGGCA AGCCCGGCGT CGATGACCGC CAGGCCGAGC TGATCAAGCA GATGTACGCC AAGACCAAAC TCGCATCCGC GGTGGCAGAA GGCTTTCGGG TGCGCGACGA GGTGGTGAAA TCGATTGCCG ACGAAATGAC TGCAGCGAAC CGCGGTGCGG TGTCGCCGCG CGGCTTCGAG CTGTCGGCGC GCCGGATCGG CCGGCTGATG CGCGAGCAGT TCAACCTCGG CTTCGTCGAT GTCGGCGGCT GGGACACCCA CGTCAATCAG GGCGCGGCGA CTGGTTATCT CGCTGACCGG CTCGGCGAGC TCGGCCGCGC GCTCGCCGGA TTCCGCGAAG AGATCGGGCC GGCGGCGTGG CGCGACAGCG TGGTGGTGGT GATCTCCGAA TTCGGCCGGA CGTTCCGCGA GAACGGCGAC CGCGGCACCG ATCACGGCCA TGGCAGCGTT TACTGGGTGC TCGGCGGCGG GCTCAACGGC GGCCGCATTG CCGGCGAGCA GATCACGGTG GCGCAGCCGT CGCTATTCGA GAACCGTGAT TATCCGGTGC TGACCGATTA TCGCGCGCTG TTCGCGGGTC TGGTGCAGCG GATGTACGGG CTCGATGCTG CGGCGCTGCA ACGGATCTTC GCCGACGTGC GTCCGGCTGA TCTCGGCCTG GTGTGA
|
Protein sequence | MNRRDLLKAV AALAPAALTT TIAGRVWATP ATDAKLLVVF LRGAYDAANV LVPVSSSFYY ESRPNLAIAK PDVGNPNAAV ALDADWGLHP ALRDSLAPLW TSREIAFVPF AGTSDDTTRS HFETQDTIEL GQSTKGSRDY RSGFMSRLAA ELTRVKPIAF TEQLPLIFRG QAEIPNIALG NVGKPGVDDR QAELIKQMYA KTKLASAVAE GFRVRDEVVK SIADEMTAAN RGAVSPRGFE LSARRIGRLM REQFNLGFVD VGGWDTHVNQ GAATGYLADR LGELGRALAG FREEIGPAAW RDSVVVVISE FGRTFRENGD RGTDHGHGSV YWVLGGGLNG GRIAGEQITV AQPSLFENRD YPVLTDYRAL FAGLVQRMYG LDAAALQRIF ADVRPADLGL V
|
| |