Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1514 |
Symbol | |
ID | 5055202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1372013 |
End bp | 1373413 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640469056 |
Product | methyltransferase small |
Protein accession | YP_001153722 |
Protein GI | 145591720 |
COG category | [R] General function prediction only |
COG ID | [COG4123] Predicted O-methyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.271049 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGAGGA GGGGTTTGGT TGCCGATAGG GGGAGGGTGG CGACGCCGCC TGATTTGGCT TTTTACATGG TGGAGAAGCT TTTTAGGGGG GCGCCGCCGG GTGGCGGTAG CAGGGTGTTG GATGCCGGAT GTGGCCTGGG GGTGTTCATA GACGCGGTGT TGAGGTGGTG TAGGGGGCGT TGCGCCGAGC TTCCTGAGGT GGTGGGGGTG GAGGTGGACC CGGCGCTTGC CGAGGCGGCG AGGCGGAGGT TTGCCGGGGA GCGGGTGAGG ATTGTGCGGG GTGACTTCTT GCTGATGTCG GCGGGGGAGC TCGGCGGCTT GTTCGACTAT GTGATCGGCA ACCCGCCCTA CGTCTCTTAC GAATACATCG ACCCGCCGAA GAGGGAGCTG TACAAGAGGC TGTTCACCAC GGCGGTGGGG CGGTTTGATT TGTACATGTT GTTTTTCGAA AAGGCGCTGT CGTTGCTGAA GCCGGGGGGT AGGCTCGTAT TCGTCACGCC GGAGAAGTAC CTCTACGTGC TGTCGGCTGT TGCGCTGAGG AGGTTGCTGG CCAGCTACAG GGTGGAGGAG GTGGAGCTTA TCCGGGAGGA CGCCTTCGGG GGTGTGTTGG CCTACCCGGC GATCACCGTG GTGGTGAAGG AGGCGCCTTC CTTGACGACT ATAAGGCTTA GGGATGGGCG GGCGGCGAGG GTGGCGTTGC CGAGGGACGG CTCTCCGTGG CTCTCCGCCA TAGCCACGGC CAAGTTAAGG ACGCCCTACA GCCTCGGCGA CTTGGTTTTG AGGATAAGCC CGGGGGTCGC CACTGGCCGG GATGACGTTT TCGTGATCCC AAAACGCGCC TTGTCAAAGG AGCTTGAGCC GTTTGCCTAC CCAACGGTGG GTGGGAGGGA GCTCTCCGCC TTTGCCCCCG GCTCCGTTGT GGACTATGAC AAGTTGGCCC ACGTCATCCT CATCCCATAC GACAGAGGCG GCCGGCTCCT GGACGAGGGG GAGGCAAAGC CGCTTTTGGA CTACTTGTCT AGGTGGCGGC GGGTGCTGGA GTCGAGATAT GCGGTTAGGG CGGAGGGTAA GAGGTGGTAC GCCTTTCACG AAGACCCGCC TATGGGCGAT CTGCTCCGGC CTAAGATACT CTGGAGGGAC ATAGCTAAGG AGCCCGCCTT CTACATAGAC GCGAAGGGCC TCCTCATCCC AAAGCACACC GTTTACTACC TAGTCCCCAA GGACCCCGGC ATGTTGCCCA GGCTGGCCGA GTACCTCAAC AGCGCCGAGG CCAAGAGGTG GCTGATGGAG CATTGCCAGA GGGCGGCCAA CGGCTACTTG AGGCTCCAGA CCCACGTGCT TAGGCAACTC CCAGTGCCTC CGGAGGTGGT GGGGGAGGGG CATGGCCTTG GGAGAGTGTA G
|
Protein sequence | MGRRGLVADR GRVATPPDLA FYMVEKLFRG APPGGGSRVL DAGCGLGVFI DAVLRWCRGR CAELPEVVGV EVDPALAEAA RRRFAGERVR IVRGDFLLMS AGELGGLFDY VIGNPPYVSY EYIDPPKREL YKRLFTTAVG RFDLYMLFFE KALSLLKPGG RLVFVTPEKY LYVLSAVALR RLLASYRVEE VELIREDAFG GVLAYPAITV VVKEAPSLTT IRLRDGRAAR VALPRDGSPW LSAIATAKLR TPYSLGDLVL RISPGVATGR DDVFVIPKRA LSKELEPFAY PTVGGRELSA FAPGSVVDYD KLAHVILIPY DRGGRLLDEG EAKPLLDYLS RWRRVLESRY AVRAEGKRWY AFHEDPPMGD LLRPKILWRD IAKEPAFYID AKGLLIPKHT VYYLVPKDPG MLPRLAEYLN SAEAKRWLME HCQRAANGYL RLQTHVLRQL PVPPEVVGEG HGLGRV
|
| |