Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2067 |
Symbol | |
ID | 6409727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 2238927 |
End bp | 2239997 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642711953 |
Product | polysaccharide deacetylase |
Protein accession | YP_001991065 |
Protein GI | 192290460 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAAAT TTGCGGTGCT GGCAGCCGGC TGTAGCACGC TGGCTGTTCT GGTCGGGCTC GGCGCCGGCC GCGCGTGGTT CACGCTCACC CCCTCCAAGA TCGACGCCGC CCGCGCTCCG TCCGCTTCGG AAGAAATCAC CACCGGCACG ATCGCTGCGC GCTGGCCGAC GCCCGGCGGT GCGCCTCCGC GCGCCGCGCA GAGCCCCGCT CCGAACATCC AGCCGGCCCG CGCCACAGCC GACCCCGCGC TCAACGCCAA GCCTGCTGCG CTGCCGGCCC CGGCGCCGGT GCGGCAGACC TGCAGCAATC CGAACGCGCT CGGCATTTCG CGTACCGTCG AGATCGACAC ATCGGGCGGC CCTGGCCTCG GCATGTCGCA ATATCGCGAC TACGACTTCC TGCAGCCCGG CGAAGTCGCA CTGACCTTCG ACGACGGCCC TTGGCCGGTG AATACTCCGG CGGTTCTGGC GGCGCTGGAG GCCCAATGCG TCAAGGCGGT GTTCTTCCCG ATCGGCAAGC ACGCGAGCTG GCACCCGGCG ATCCTCAAGC AGGTGATCGC CGCCGGCCAC ACCGTCGGTT CGCACACCTG GTCGCACGTC AATCTCGCCA GCAAGCCGTT CGCCGAGGCC AAGAACGAAA TCGAGAAGGG CATCAGCGGC GTAGCGCAAT CGGCAGGTCA GCCGACCGCG CCGTTCTTCC GCTTTCCGCA GCTGCGCCAG ACGCCGGAGC TGAAGGCGTA TCTCGGCGAA CGCAACATCG CGACCTTCTC GATCGACGTC GACTCCGAAG ACTTCCGGAT TCACAAGCCT GATCAGCTCA TCAACGGCGT GATGGCCAAG CTGAAGAAGG CCGGCAAAGG CATCCTGCTG ATGCACGACT TCCAGAAGTC GACCGCGCAG GCGCTGCCCG AGCTGCTCGC GCAGCTGAAG GCCGGCGGCT ACAAGATCGT TTTCATCACC GCCAAGGATC GGGTTACCAC GCTGCCGGAA TACGACGCCC AGGTCGCGCC GGCGCAGCCC ACCGCCTCGA CCGCAAAGCC GATCGCCAAT GTGATACGCA CCGTCGACTA G
|
Protein sequence | MRKFAVLAAG CSTLAVLVGL GAGRAWFTLT PSKIDAARAP SASEEITTGT IAARWPTPGG APPRAAQSPA PNIQPARATA DPALNAKPAA LPAPAPVRQT CSNPNALGIS RTVEIDTSGG PGLGMSQYRD YDFLQPGEVA LTFDDGPWPV NTPAVLAALE AQCVKAVFFP IGKHASWHPA ILKQVIAAGH TVGSHTWSHV NLASKPFAEA KNEIEKGISG VAQSAGQPTA PFFRFPQLRQ TPELKAYLGE RNIATFSIDV DSEDFRIHKP DQLINGVMAK LKKAGKGILL MHDFQKSTAQ ALPELLAQLK AGGYKIVFIT AKDRVTTLPE YDAQVAPAQP TASTAKPIAN VIRTVD
|
| |