Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3394 |
Symbol | |
ID | 5210371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4263285 |
End bp | 4264859 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640596990 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | YP_001277703 |
Protein GI | 148657498 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.232744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.348803 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGTC AACGAGATGT GCAAGCGCTA ATCGAAGTAC TGGGACATGC AAAAGATGAG CAAGTGCGCC AGGCTGCGCT TGAAGCGCTG ATCGAGATTG CGCTCTCCCT GGCTGCCGAA AAGGAAGCCA GAGAAATCGA AGATTTTTTT CAGCGATTGC TTCAGGCAGG CACGCCAGGC GAGGCATCCC CCCAAACGCC GCCTGAGGAC GAAAACGCGG CGCGATACCA GGCCGCCATC CAGGCGCTGG GAGAAATCGG GGCAACTGCC ATTGCGCAAC TCCTCAGCGC ACTGAAGAAT CCACAAAAGC GTCGTTACGC CGCCCAAATC CTGGAGCAAA TCGGCTGGCA GCCTGGGCTG GATGAAAACG GCGCTCTGTA TTGGATCGCA AAAGGCGAAT GGGATAGATG CGCCGCGCTC GGTGGGATTG CCATAGCGCC GCTGTTACTC GTCCTTCAGG AACAGGACAA CGAAACGCGC CGGGCTGCTG CCCATGTCTT GGGACAAATC GGCGATGCCC GCGCCGTAGA GCCGCTCCTT GATTTACTCG TGGATCAGGA CCCAAAGGTG CGCCGGGCAG CCATCGAAGC CATTGGCCGG ATTGGCGATC CCCACACCGT TGAGGCGCTT GAACTGGCGC TCCAGGACAG GGAGGGGAGC GTACGCCTGG CGGCGACGAG AGCCTTGGGG CAAATTGGCA GCCCGCGCGC GGTGGAAGCG CTGATCGCTA CCTTGCAGGG CAGGTTCGCG CATAGACTTC TTCTGGAAGA ACTGGACAAG ATGAAATCGG AAGATGCTCA CCAGATTGCG GCGCTTCTCA GCTCTTGGGA TGAAAACAGC GCGCCTGTGC ATCTTCTTCT CCAACAAGCG CTGAAAGCAG CCAGCCATCC GTTTGTCGTC TATTTGCTTA CCACAATCCT GACCGAATGG AAGAACGAGC GTGCTTCTGC TATAGAGGCG CTGGGGCGAA TTGGCGATTC CCGCGCCGTG GAACCGCTCA TCGCTGCGCT CAAGGACGAG GACGTGAATG TGCGCTGGCC CGCCGCCCGT GCGTTGGGAG AAATCAAAGA CACCCGCGCC ATCAAGCCGC TCATCGCCGC GCTCAAGGAT TGGCATAGCA ATGTGCGCAA AGCCGCCGCC AAAGCGCTGG TCAAAATCGG CACACCAGCC GTGGAGCTGC TCATCGCCGC GCTCAGGGAC GAGGACGAGA GGGTGCGCCA GGCCGCCGCC GAAGCGCTGG ATCATCTGGG CTGGAAACCT GCCCGAGACG AAAATGCAGC GTGGTATTGG GCAACCAAGG GTAAGTGGAA CGCGTGCGTG GATATCGGCG CACCTGCTGT GCAGCCACTT ATTACAAGCC TGCGCACAAA TGACCCCTAC ATGCGTAGAG ACGTGGCGCA GGCATTGGTA AGACTGTATC AAAAGGCCGG TATTTCTCAA GAGGACCGAG AGCAAATTTT GGCAGTTCGT GACCGCTTGA TACGTATTCA TGATGACCAA CCCCATATAG ATCGATCTAG GGATTGCTCT AGACCACATA CTGATACAGA GCTGGGCGTG GAATTCCCCT TATGA
|
Protein sequence | MKSQRDVQAL IEVLGHAKDE QVRQAALEAL IEIALSLAAE KEAREIEDFF QRLLQAGTPG EASPQTPPED ENAARYQAAI QALGEIGATA IAQLLSALKN PQKRRYAAQI LEQIGWQPGL DENGALYWIA KGEWDRCAAL GGIAIAPLLL VLQEQDNETR RAAAHVLGQI GDARAVEPLL DLLVDQDPKV RRAAIEAIGR IGDPHTVEAL ELALQDREGS VRLAATRALG QIGSPRAVEA LIATLQGRFA HRLLLEELDK MKSEDAHQIA ALLSSWDENS APVHLLLQQA LKAASHPFVV YLLTTILTEW KNERASAIEA LGRIGDSRAV EPLIAALKDE DVNVRWPAAR ALGEIKDTRA IKPLIAALKD WHSNVRKAAA KALVKIGTPA VELLIAALRD EDERVRQAAA EALDHLGWKP ARDENAAWYW ATKGKWNACV DIGAPAVQPL ITSLRTNDPY MRRDVAQALV RLYQKAGISQ EDREQILAVR DRLIRIHDDQ PHIDRSRDCS RPHTDTELGV EFPL
|
| |