Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4388 |
Symbol | |
ID | 6412072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4716682 |
End bp | 4717842 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 642714270 |
Product | hypothetical protein |
Protein accession | YP_001993359 |
Protein GI | 192292754 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.315553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCCGAGA CGTTCCCCAC CACGCCGTTC GTGCATCGCC GCGGCGGTGC CCTCGCCACG ACGACCGCCG ACCCCGGCGT ACTGCCCGGG GTCGCTGCGA TCGACAAGGC AGCTTGGTGC GATCTGGCGA CCCGCGTGAT CGAGCCGAAC GGCTACTACC TGCCGGAATG GGTGATGGCG GCGAACGGCG ACGACGCGTC GCCGCGCGCG CTGACCGCGC ACGATTCCGC CAGCCGTCTG ATCGGACTCC TGCCGGTGAT CTCGTGCTGG CGCGCGTTCC GCCTGCCGCT GCCGGCGCTG GTATCGGCCG ATCCGTTCCG CTCGCTGGAT ACGCCGCTGC TTGACCGCGA TGCAGCCAAT GACGCCGCCG CCAAGATCAT CGCGCAGGCA CGCGCCGCAG GCGCCCGCGC CTTGGTACTG CGCGACGTCG CCCGCGAGGG CGAAGCCGTG GCTGCGTTCA CACGCGTGCT CGACGCCGAA GGCCTCAACC CGCGCCTGAT CAACGGCTGG ACCCGCGCCG GCCTCGACGC CACCCGCGAC GGCGAAACCC TGCTGCGCCA GGATCTCGAT ACGAAGAAGC TGAAGAACCT GCGCCGCCTC GAGCGTCGTC TCGGCGAGCA CGGCGAGGTG CGCTTCACCG TCGCTGATAC CGCGGATGAG GCAGCGCGCG CGTTCGACGT GTTTCTGGCG CTGGAAGACA GCGGCTGGAA GGGCCGCCGC GGCAGCTCAC TGAAGCGGCA GCCGGAGCTT GCCGCGCGGC TGCGCAGCGC CGCGGTCGCG CTCGCCTCGC GCGGCCAATG CGAGGTGATC ACCCTGTCTG CCGGTGTGAC GCCGGTCGCA GCCGGAATCG TGCTGCGCCA CGCCGACCGC GCTTACTTCT TCAAGCTCGG CATCGACGAG AGCTTTGCGC GCTGCTCGCC CGGCGTGTTG CTGACAATGG CGCTGACCCG GCATCTGTGC GCCGACCCGG AGATCCGCTT CGCCGACTCC ACCGCCAGCG CCCAGCATCC AATGATCGAC CCGCTGTGGC GCGGCCGGTT CGCGGTCGGC GACCTGGTGC TGCCGCTGCG CAAGCGCGAT CCGCTGTTCG CACCGATCGT CGCGGCTCTG TCGGCGCGCG ACCGGCTGCG GCACCTCGCC AAGCGGCTGT TGAAGCGCTG A
|
Protein sequence | MAETFPTTPF VHRRGGALAT TTADPGVLPG VAAIDKAAWC DLATRVIEPN GYYLPEWVMA ANGDDASPRA LTAHDSASRL IGLLPVISCW RAFRLPLPAL VSADPFRSLD TPLLDRDAAN DAAAKIIAQA RAAGARALVL RDVAREGEAV AAFTRVLDAE GLNPRLINGW TRAGLDATRD GETLLRQDLD TKKLKNLRRL ERRLGEHGEV RFTVADTADE AARAFDVFLA LEDSGWKGRR GSSLKRQPEL AARLRSAAVA LASRGQCEVI TLSAGVTPVA AGIVLRHADR AYFFKLGIDE SFARCSPGVL LTMALTRHLC ADPEIRFADS TASAQHPMID PLWRGRFAVG DLVLPLRKRD PLFAPIVAAL SARDRLRHLA KRLLKR
|
| |