Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0475 |
Symbol | |
ID | 8567109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 523098 |
End bp | 524675 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_003289765 |
Protein GI | 268316046 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.726573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGGAC TGTTTTCGAA AGGGCTTTTG CTGCTGATCG TGCTGGCCTG CACGGGCAGC GTCTTGCAGG CACAACCGAC GCACCGGCTC GTCATTCACG CGGATCAGGG TCGGGTGCAG ATCAGCCGGC ACATCTACGG GCATTTTATC GAACACCTGG GCTACGGCAT CTATGGCGGT TTCTGGCAGC GGGACGCGCA GGGGCGCTGG CACCTGCGGC AGGACATTGT CGACGCGCTC CGGCGCATTC GCATCCCGAA CCTGCGCTGG CCGGGCGGCT GCTTTGCCGA TCTGTACCAC TGGAAAGACG GCATCGGTCC GCAGGAGCAG CGCCGTCCGA TCCTGAATGC CTTCTGGGGA CAGGTGGTCG AGGATAACAG CTTTGGCACG CACGAATTCA TGGCGCTGGT CGAAGCACTG GGCGCCGAGC CCTACATCGC CGGGAACGTG GGCAGCGGCA CGCCCCGCGA AATGGCCGAA TGGGTCGAAT ACCTGACGGC CGAAGACGGT CCCATGGCCC GCCTGCGCAA ACAGAACGGC CGGGAACAGC CCTGGCGTGT GCCCTTCTGG GGCGTGGGCA ATGAAAGCTG GGGCTGCGGC GGCAACATGG ATCCTGAGTA CTATGCGGAC CTCTACCGCC GCTTTGCCAC CTACCTGTTC AACTATGGCG GCAATCGGCT CTACAAAATT GCCGCCGGGC CGGCCGATGC CGACACGACC TGGACCAGCG TGCTCATGCG CGACGTCTTC CGTCGCAATC CCGGGCTGAT GCAGGGCATT TCGGTGCACT ATTACACCTG GATTTCGAAA ACGGGCCGCT GGAACGATAA AGAACCGGCC ACCGGCTTTG ACGAGTGGGG CTGGTTCAAA GGCTTGCAGA AAGCGCTCTT TCTGGACGAA GTGCTGCGCC GGCACGAAGC CGTCATGGAT CGCTACGATC CCGAAAAGCG GGTGGGGCTG ATCGTGGACG AGTGGGGCAT GTGGCATGCG CCCGAGCCCG GCTCGAACCC GGCCTTTCTG GTGCAGCAGA ACACGCTGCG CGACGCGCTC GTGGCGGCCG TTTCGCTGAA CATCTTCAAC CACCACGCCG AACGCGTCAG GATGGCCAAT CTGGCGCAGA CGATCAACGT GCTGCAGGCA CTGATCCTCA CGCGAGAAGG CGAGGAAACG ATCGTGCTGA CGCCCACCTA TCACGTGTTC GACCTCTACA AAGAACATCA GGACGCGACG CTATTGCCTG TCGAGCTGGA GGCCGGGGAA TACCGCTACG GCGACGAGGC CATTCCGGCG CTGAATGCTT CGGCTTCGCG CAATGCGGAA GGGGCCGTGC ATCTGACGAT TGCCAATCTG GACCCGCACC AGGAGCGGGT GGTGCAGGCC CGACTGGAGG GCATCCAACC GGCCCGCGTG GAAGGACGTG TGCTGACCGC CGAGGCAATG GATGCGCACA ACACGTTCGA AGCGCCCGAC CGGGTGCGAC CGGTCGCCTT CACCGCCTAC CGTGCCGTGG GCGATGGCGT CTATGAGCTA CGCCTTCCGG CCAGGTCGGT CGTGGCGCTC ACTTTCTATC CGCGCTGA
|
Protein sequence | MRGLFSKGLL LLIVLACTGS VLQAQPTHRL VIHADQGRVQ ISRHIYGHFI EHLGYGIYGG FWQRDAQGRW HLRQDIVDAL RRIRIPNLRW PGGCFADLYH WKDGIGPQEQ RRPILNAFWG QVVEDNSFGT HEFMALVEAL GAEPYIAGNV GSGTPREMAE WVEYLTAEDG PMARLRKQNG REQPWRVPFW GVGNESWGCG GNMDPEYYAD LYRRFATYLF NYGGNRLYKI AAGPADADTT WTSVLMRDVF RRNPGLMQGI SVHYYTWISK TGRWNDKEPA TGFDEWGWFK GLQKALFLDE VLRRHEAVMD RYDPEKRVGL IVDEWGMWHA PEPGSNPAFL VQQNTLRDAL VAAVSLNIFN HHAERVRMAN LAQTINVLQA LILTREGEET IVLTPTYHVF DLYKEHQDAT LLPVELEAGE YRYGDEAIPA LNASASRNAE GAVHLTIANL DPHQERVVQA RLEGIQPARV EGRVLTAEAM DAHNTFEAPD RVRPVAFTAY RAVGDGVYEL RLPARSVVAL TFYPR
|
| |