Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3832 |
Symbol | |
ID | 6411510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4117739 |
End bp | 4118857 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642713713 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_001992806 |
Protein GI | 192292201 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGCT TGCTGCATGA AGGTTATTTG GCGGCCGGTC GCCGCAATGA GGGGCACGGC GTGAACGATC TGAGTAATAC GCATCTGAGG GACATCAGCC GAACGCTTCG AGTCAGTGCA ATCGCTTTAG TTCTGATGAT CGTATTGTGG GTGCTGCGAG ATATTTTGCT GCTTGGTTTT GCGGCGGCCC TCATTGCCTG CGTGTTGCGC GGCGCAGCTA ACGTTCTGCA TCGAAGAACC GGATTGAGCG ATGGTTTGTC GCTGTTGATC GTCGTGATGA CGATCGTTCT GGCGCTCGGC GCGCTGCTCT TCTGGCGTGG AACCGCAATC GCCAACGAAG TCGCGCAGAT GTATGATCAA TTAACCGCGC AGATGCAGAG TTTGTGGCAG CAGATGTCCG GTAGTGGTTG GCCGGCGCTG CTCGCGAAGC AGCTACGGAA TCTCTCGGAA TCGGCACGAA AGAATCTAAC TGGATATGTC CCGGGCGTTG CCAGTTCGGT GCTTGGTATC GGCGGCAGCG TTGTTGTGGT GCTGGCCACC GCGCTATTTC TGGCTATCTC GCCGCGGAGC TACATGGACG GCGCACTGCG GCTGCTGCCG GTGCAATGGC GGCCGCGCGG TCGCCACGTG ATGCTCGAGA CCGGCAGCAC GTTGCAGTTA TGGTTTCTGG GCCAACTTGC CGACATGCTG ATTGTGGCGT TGCTGATCGG CGTCGGGCTG TATCTTCTCG GCGTTCCGAT GGCGCCAACC TTGGCGCTGC TGGCGGGGTT GTTGAACTTT GTGCCCTATG TGGGCGCGTT AGCCGGAGCG GTTCCGGCGG TGCTGGTGGC GCTGGCACAA TCGCCGAGCT TGGCGCTATG GGTCGCATTG CTGTTCATCT GCGTACAGAC GCTGGAGGGC AATGTTGTCG CCCCGCTAAT CCAGCGCCGG ACGGTCTCCT TGCTACCGGC GTTAACGATC CTTTCGCAGA CAATTCTGGG GACGTTATTT GGCGTGGTAG GGCTCGTCAT CGCAACGCCG CTGACCGCCG CGCTAATGAC CGCGGTGCGG ATGATCTATA TCGAAGACCT ATTGGAGCGC GATTGCGGCG CAGAAGATGC CAAATGCACC CGGCCTTGA
|
Protein sequence | MGRLLHEGYL AAGRRNEGHG VNDLSNTHLR DISRTLRVSA IALVLMIVLW VLRDILLLGF AAALIACVLR GAANVLHRRT GLSDGLSLLI VVMTIVLALG ALLFWRGTAI ANEVAQMYDQ LTAQMQSLWQ QMSGSGWPAL LAKQLRNLSE SARKNLTGYV PGVASSVLGI GGSVVVVLAT ALFLAISPRS YMDGALRLLP VQWRPRGRHV MLETGSTLQL WFLGQLADML IVALLIGVGL YLLGVPMAPT LALLAGLLNF VPYVGALAGA VPAVLVALAQ SPSLALWVAL LFICVQTLEG NVVAPLIQRR TVSLLPALTI LSQTILGTLF GVVGLVIATP LTAALMTAVR MIYIEDLLER DCGAEDAKCT RP
|
| |