Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4038 |
Symbol | |
ID | 6411721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4330799 |
End bp | 4331707 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642713920 |
Product | outer membrane assembly lipoprotein YfiO |
Protein accession | YP_001993009 |
Protein GI | 192292404 |
COG category | [R] General function prediction only |
COG ID | [COG4105] DNA uptake lipoprotein |
TIGRFAM ID | [TIGR03302] outer membrane assembly lipoprotein YfiO |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.890375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGCAC AGCGGATCGC GCTCGGACTG CGGAACGGAG AGACTTCTCG CAGGGGCGAT CGCCTGCGGC GCGGGCTGCC GATGATCTTC AGCCTGCTGG CGCTGTCGCT TCCGCTCGGC GGCTGCGGCA CCGGCGCGTT GTGGGACAAG TTCCTGGCCA AAGACGACAA GATGGTCGAT GAGCCGGCGG ACAAGCTCTA CAACGAGGGC TTGTATCTGA TGAATCAGGA CAAGGACACC AAGGGCGCCG CCAAGAAGTT CGAGGAAGTC GACCGTCAGC ACCCGTATTC GGATTGGGCG CGCAAGTCGC TGCTGATGTC GGCCTATGCT TATTATCAGG CCGGCGACTA CGACAGCTGC ATCGGCTCGG CGACCCGCTA CGTCACGCTG CATCCCGGCA GCCCCGACGC CGCCTATGCG CAGTATCTGA TCGCGGCGTC GAATTACGAC CAGATCCCGG ACATCTCGCG CGACCAAGGC CGCACCGAAA AGGCGATCGC CGCGCTCGAA GAGGTGATCC GCAAATATCC GACCTCCGAA TATGCCAACA GCGCCAAGCA GAAGCTGGAA GGCGCCCGCG ATCAGCTCGC CGGCAAGGAG ATGGATATCG GCCGCTATTA CATGAGCAAG CGCGACTACG CCGCGGCGAT CAATCGGTTC AAGACCGTGG TGACGCGCTA TCAGACCACC CGCCACGTCG AAGAGGCGCT GGCACGTCTG ACCGAGGCCT ATATGGCGAT CGGCATCGTC GGCGAGGCGC AGACCGCCGC CGCCGTGCTT GGCCACAACT TTCCTGACAG CCGCTGGTAC AAAGACGCCT ATAATCTTGT AAAGTCGGGT GGTTTAGAAC CCGCCGAGAA CAAGGGCTCC TGGATCAGCA AGTCCTTCAA AAAGCTCGGT CTCGGATAG
|
Protein sequence | MSAQRIALGL RNGETSRRGD RLRRGLPMIF SLLALSLPLG GCGTGALWDK FLAKDDKMVD EPADKLYNEG LYLMNQDKDT KGAAKKFEEV DRQHPYSDWA RKSLLMSAYA YYQAGDYDSC IGSATRYVTL HPGSPDAAYA QYLIAASNYD QIPDISRDQG RTEKAIAALE EVIRKYPTSE YANSAKQKLE GARDQLAGKE MDIGRYYMSK RDYAAAINRF KTVVTRYQTT RHVEEALARL TEAYMAIGIV GEAQTAAAVL GHNFPDSRWY KDAYNLVKSG GLEPAENKGS WISKSFKKLG LG
|
| |