Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5044 |
Symbol | |
ID | 6412738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5425625 |
End bp | 5426539 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642714929 |
Product | thioesterase superfamily protein |
Protein accession | YP_001994008 |
Protein GI | 192293403 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2050] Uncharacterized protein, possibly involved in aromatic compounds catabolism |
TIGRFAM ID | [TIGR00369] uncharacterized domain 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGACC AATCGATCCC TGATCCGGTA ATGTCGTTCG ACGACAAAGC GCGGATGATT CAGCACCGCC GTTCGATCCA CGGCGCCATC ATCGGCCTGC AGCTCGACCG CTACGCACCC GCGGAAGCGT GGAGCAGCCT GCCCTATCAC CCGGTATTCG TCGGCGACGT TTCGACCGGC GTGATCCATG GCGGCGTCGT CACCGCGATG CTCGACGAGA GCTGCGGCAT GGCGGTGCAG CTCGCGCTGC CCGGCACCAC CGCGATCGCC ACGCTCGACC TGCGGATCGA TTACTTGCGA CCGGCGACAC CCGGACAGGT GATGCGCGCG CACGCGCATT GCTATCACCT CACCCGTTCG ATCGCGTTCG TCCGCGCCAC CGCGTATCAG GACGCCGAGG ACGTTCCGAT CGCCACCGCC ACCGCGATGT TCATGGTCGG CGCCAACCGC ACCGATATGC TGCGGCAGAC GCCGAAGGTG ACGATGGACT CCGCGCCCGA GCTGGTCGCC CCCGAAGATC CGGACGGCGG GCCGCTTGCG ATCAGCCCGT ATCCACGCTT CCTCGGCATC CGCGTCGATG GCGATGCCCA GGCGATGATG CCATATCATC CGAAGCTGGT CGGCAATCCG ATCCTGCCGG CGCTGCATGG CGGCGTGATC GGCGCCTTCC TGGAGACCGC CGCGATCGTC AGCGTGCGCC GAGAGATCGG CCTCGCCACC GCGCCGAAAC CGATCGGGCT CACCGTGAAC TATCTGCGTT CGGGCCGGCC GCTCGACACC TTCGCCAAGG TCTCGATCGT CAAGCAGGGC CGCCGGGTGG TTGCGTTCGA AGCGCAGGCC TATCAGCGCG ATCCGGCCGA GCCGATCGCG TCGTGCTACG GCCACTTCAA GCTGCGCTCC GGCCCGGCGG AGTAA
|
Protein sequence | MSDQSIPDPV MSFDDKARMI QHRRSIHGAI IGLQLDRYAP AEAWSSLPYH PVFVGDVSTG VIHGGVVTAM LDESCGMAVQ LALPGTTAIA TLDLRIDYLR PATPGQVMRA HAHCYHLTRS IAFVRATAYQ DAEDVPIATA TAMFMVGANR TDMLRQTPKV TMDSAPELVA PEDPDGGPLA ISPYPRFLGI RVDGDAQAMM PYHPKLVGNP ILPALHGGVI GAFLETAAIV SVRREIGLAT APKPIGLTVN YLRSGRPLDT FAKVSIVKQG RRVVAFEAQA YQRDPAEPIA SCYGHFKLRS GPAE
|
| |