Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4933 |
Symbol | |
ID | 6412624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5313845 |
End bp | 5314924 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 642714815 |
Product | Extensin family protein |
Protein accession | YP_001993897 |
Protein GI | 192293292 |
COG category | [S] Function unknown |
COG ID | [COG3921] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.895839 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGCGC TGGTTGCAGG CGGCACCTCG GCCGCGAACG CGCGTGAGCA CGTGCCGCTG CCGAAGCCGC GTCCGGCCGA GGCGCCGCAG GCGAATGCGC GCGAGGCCGA GCCCGGCGAG GATGAGCCAA CGCCTGCGGA AGCTGCCGCC CCCGACAGCG CGCCCGCCGC CAAAGCCGAT GCGGCGCAGG CTCCCAAGCC ACCGTCCGAA TGCCGGCTGG CCCTGACCGA GCAGATCGCG ATCGCGCCGA GCATTCCGGA TATCACCGGG CCGGGCGCCT GCGGCGGATC CGATCTGGTC CGGCTCGAAG CGGTGGTGCT GCCGGACGGC CGCCGTGTCT CGATGTCGCC GGCCGCGACG CTGCGCTGCG GCATGGCGCG GGCGATCGCT GATTGGGTGC GCGCCGACAT CGCGCCGCTG GCCGTCTCGC TCGGCAGCCG GGTCTCGGAT CTGGACAATT TCGACTCTTA TGAATGCCGC GGCCGCAACC GGGTGCGCGG CGCTAAGCTC AGCGAGCACG GCCGTGCCAA TGCGCTCGAC CTCCGCGGCA TCAAGCTCGC CGACGGGCGG ATGATTTCGC TGACTGACCG CGAGGCGCCG CGCGCGCCCA GGGAAGCCGT GATGCAATCG GTGTGCGCGC GCTTCACGAC CGTGCTCGGT CCAGGCTCCG ACGGCTATCA CGAGGACCAC ATCCACCTCG ATCTCGCCGA GCGCCGCGGC GGCTACCGGA TGTGCCAATG GGCCTTATAC GAGGGGCTCC CGAATATTGC GCCGGTGATG CCGCTGCCGC GCCCGGCCGA AGCGCCGCCG CGCGAAGTCG CGGCCGACGA CGAGCGCGCG CCGCAGCAGG CAGCCCCGTC CCAGTCCGAG GCAGCCGAGC AGGCCCCGAC CGAGGAGGCC GAACGCGAAC AGGCCGAGAC CCCACCGCCT CCGCCGCCGA AGCCGGCCAA GCGCGCCAAG TCCAAGGCGG CCGCCGCGAA GCCGGCCGCG AGCAAGCCGA TCGATCTGAA GCCGCAAGCG GCGCCGGCCG CAACGCCGGC GGCTCGCGGC AAGCCGGCGC CGACACGACC ACCGGTCTGA
|
Protein sequence | MIALVAGGTS AANAREHVPL PKPRPAEAPQ ANAREAEPGE DEPTPAEAAA PDSAPAAKAD AAQAPKPPSE CRLALTEQIA IAPSIPDITG PGACGGSDLV RLEAVVLPDG RRVSMSPAAT LRCGMARAIA DWVRADIAPL AVSLGSRVSD LDNFDSYECR GRNRVRGAKL SEHGRANALD LRGIKLADGR MISLTDREAP RAPREAVMQS VCARFTTVLG PGSDGYHEDH IHLDLAERRG GYRMCQWALY EGLPNIAPVM PLPRPAEAPP REVAADDERA PQQAAPSQSE AAEQAPTEEA EREQAETPPP PPPKPAKRAK SKAAAAKPAA SKPIDLKPQA APAATPAARG KPAPTRPPV
|
| |