Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3789 |
Symbol | |
ID | 5210771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4739925 |
End bp | 4741202 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640597385 |
Product | TPR repeat-containing protein |
Protein accession | YP_001278093 |
Protein GI | 148657888 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.302525 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGACG AAAGCACAAG TGCATTTCAC CGTCGTGTTG CCGAAGAACT GGCTGCACGC TATGGCGACG ACGAACGCGC ACCCGAAGGC TGGCGCGAGA TACAAACCTG GCACTGGGAA CAGGCGGGTG TCTATGCTGC CGCCGTCGAA ACGGCAATGG CGGTTGCCGA GACGCGTATT GCGCGTCTCG ATTTCAGCGG TGCGCGCCGC TGGACCGAGC GGGTACTGGC GTTGATCGAG CGTCTCGATG TTGCGGAGCA GCGCTCCTAC GAACTGCGCG CCTGCGCCCT GGCGCTTGCG GTGCTCGAAT TTGGCGGTCA GTATCGCGAA GGTCTCGAAT ACGCCCGTCG CATGCTGCGG GCGGCGCAAC GTCTCAAGAA TGCCGAGGCG GAAGCGCGCG CGCTGCTGGG GATCGGTCGC ATGCATCGGG AATTGGGGCA ACTGACGCAG GCAGAGGCGG CGCTGAACCT TGCACGTGAT CGCGCTGCCC GCGACGATCT GAGCGATCTG GAAGCCGAAG CCCGCTTGCA TCTGGCGAAG GTGCGTCAGT TGCAGGGGCG GCATCTGGAA GCGTTGCAGG AACTGCAACT GGCGCGCGAG GAGCACGAAA CCGGCGACGA CAAACTCAAA CTGGCGCGGG TGCTCACCAG TATTGGTGAT GTCTATCGGG TGTTGGGATC GAGTCGTGAG GCGTTCACGT TCTACACCCG CGCTCTGGCG CTGGAGCAGG GGCGCGGCAG CCTGATCGGG CAGGCGATAC TGAAGGATAA ACTCGCCCTG ACGTTACTCG ATCAAAACAG AGCTGCCGAT GCGCTGGCGT CGGCGGAAGA GAGTCTCGAA CTACGTCGGC GGATCAACGA TATTGTCGGT CAGGCGCGCT CGTATACCGT TATTGGCGCG GTGCTGAGCC GGCTTGGGCG TCACGAACAG GCGATGGAGT CGTACAAGCG CGCCTGTGAA TTGCAGGAAC TGACGCAGAA TCCGCGCGGT CAGTGCATTG CGCTCATCCA TCTTGGCGAT GCCGCACGCG CACTGCGTCA ACCGCAAATT GCAATTGCAC ATTACGAACG TGCGCTTGTC CTGGCGCGTC GTGACCGCGA CTGGATTGGC ATTACGCGCG CGCTTGAACG TCTCGGCGAC CTGCATGCCG CTCAGGAAGA TCGCACGCAG GCGCTGGCAC GCTGGAACGA GGCGCTCCAC ATCCGCGAGA CTCTGCGCCA CGTTGATGAA GCGGCAGCGC TGCGCATGCG CATTCGTTCA CTACAGACGA CGGCATAA
|
Protein sequence | MTDESTSAFH RRVAEELAAR YGDDERAPEG WREIQTWHWE QAGVYAAAVE TAMAVAETRI ARLDFSGARR WTERVLALIE RLDVAEQRSY ELRACALALA VLEFGGQYRE GLEYARRMLR AAQRLKNAEA EARALLGIGR MHRELGQLTQ AEAALNLARD RAARDDLSDL EAEARLHLAK VRQLQGRHLE ALQELQLARE EHETGDDKLK LARVLTSIGD VYRVLGSSRE AFTFYTRALA LEQGRGSLIG QAILKDKLAL TLLDQNRAAD ALASAEESLE LRRRINDIVG QARSYTVIGA VLSRLGRHEQ AMESYKRACE LQELTQNPRG QCIALIHLGD AARALRQPQI AIAHYERALV LARRDRDWIG ITRALERLGD LHAAQEDRTQ ALARWNEALH IRETLRHVDE AAALRMRIRS LQTTA
|
| |