Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2203 |
Symbol | |
ID | 6409863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2388328 |
End bp | 2389377 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642712087 |
Product | hypothetical protein |
Protein accession | YP_001991199 |
Protein GI | 192290594 |
COG category | [C] Energy production and conversion |
COG ID | [COG4313] Protein involved in meta-pathway of phenol degradation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAG TGAATTCGGT GGCGCTGCCT GCGCTGTTGC TCGGGGCGGT CGTCTGTCTC GGCGGGACGG CCGCTCGTTC GGACGAAGCG GGCGTGAGCA TGTGGCTGCC CGGCACATTC GGATCGATGG CTGCGGTGCC GACCGCTCCG GGATGGTCCG CGGCCGGCGT CTACTACCAT ACGTCGCTCA GCGCGGGTCG CGAGGTCGCG ACCGCACGGG AAGTCCAGAT CGGGCGGTTC ACCCCGAACT TGAATGTCGG GCTTTCGGCC GATCTCAATG CGACCGGCGA TCTCTTCCTG CTGGCCCCGA CCTACACCTT TGCCACGCCG GTGCTGGGTG GGCAGGCTTC GGTGGGATTC ACGAGCGTGA TGGGGCGCGC GAGTGCCGGA CTGAATGCGA CGCTGTCGGC AACGATCCCG CCGTTCAACG TGATCCGTTC GGATTCGATC AACGATTCAG TGTCGGGCGT CGGTGACCTG TATCCGGTCG CCAAGCTGAA GTGGAATCAA GGCGTCAATA ATTGGATGAT CTATGCGACC GGCGATATTC CCGTCGGTGC GTATAACCGC AGCCGCATCA TCAATCTCGG CATCGGCCAC GGCGCGATCG ATCTCGGCGG CGGCTATACG TACTTCAATC CGAAGGCGGG GTCGGAGTTC TCCGTGGTGA TGGGCTTCAC CGAGAATTTC AAGAATACGT CGACGGACTA CAAGAACGGT CTCGATTTCC ACCTCGATTG GGCCGTGTCG CAATTCGTAT CGAAGCAGTT TTTCGTCGGC GCGGTCGGCT ATGCGTACAA TCAGCTCACC GGCGACAGCG GAACGGGTGC GAAACTCGGC CCGTTCAAAT CGCGTGTCGA AGCGGTCGGT CCCCAGATTG GCTACCTGTT CCCGGTCGGC GACATGCAGG GCGTCCTGAA TTTGAAGGGC TATTGGGAGT TCGACGCGCA GAACCGGGCG AAGGGTTGGA ACACCTGGTT GACATTCGCC GTCTCCGCCC CGCCGCCGCC GCCGCCGCCA CCGGCCGGTG CGAGCCTTCC AACCAAATAG
|
Protein sequence | MNKVNSVALP ALLLGAVVCL GGTAARSDEA GVSMWLPGTF GSMAAVPTAP GWSAAGVYYH TSLSAGREVA TAREVQIGRF TPNLNVGLSA DLNATGDLFL LAPTYTFATP VLGGQASVGF TSVMGRASAG LNATLSATIP PFNVIRSDSI NDSVSGVGDL YPVAKLKWNQ GVNNWMIYAT GDIPVGAYNR SRIINLGIGH GAIDLGGGYT YFNPKAGSEF SVVMGFTENF KNTSTDYKNG LDFHLDWAVS QFVSKQFFVG AVGYAYNQLT GDSGTGAKLG PFKSRVEAVG PQIGYLFPVG DMQGVLNLKG YWEFDAQNRA KGWNTWLTFA VSAPPPPPPP PAGASLPTK
|
| |