Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4090 |
Symbol | |
ID | 5541601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5301095 |
End bp | 5302285 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640896202 |
Product | PUA domain-containing protein |
Protein accession | YP_001434140 |
Protein GI | 156744011 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.337957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.372554 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATCG TCATATTGCA TCCCGGTAAA GAACGACCTG TGGTTCAACG GCACCCATGG GTGTTTTCTG GAGCGATTGC GCGCATTCAG GGTCGGTTCC CCGATCGCGG CGAGGTGGTC GATGTGCAGG CGGCCAGCGG CGAGTGGCTG GCGCGCGGTT GCTGGAGTGA CGGATCGCAG ATCCGCGTTC GCCTGTTCAC GTGGAATCCG GATGAGCCGA TTGATGACGC ACTGATCCGG CGCCGTATCG AGCGCGCCAT TGACGGTCGT CGCAGACTGG GCATGCTCAC CGACGATGGG GCGTGTCGCC TGGTCTATGC CGAATCCGAC GGTATTCCCG GCCTGATCGT CGATTACTAC GCCGGGTTTT TGGTGGTGCA ACTGCTGACT CAGGCGATGG CGCTGCGCCG TGCGGCGATC ACGCGCGTGC TGGCGGAGAC GCTTGTGCCG CGCGGCATCT ACGAGCGGAG CGAATCTGAC GTTCGTGAGA AGGAAGGGTT GCCGCCAGCG TCGGGCGTAC TGTGGGGCGA AACGCCGCCC GATTGTGTGC ATGTGCGGTT GCCCGGCGAT CTCTGGCACG CGGTCGATCT CCGCACCGGT CAAAAAACCG GCGCTTACCT CGACCAAGCG TTCAATCGGT GGCGGGTCGC CATGCATTGC ACCGGCGCAG AGATGCTGGA CTGCTTCTGC TACGCTGGCG GCTTTACCAT TGCGGCAGCG CGTGCTGGCG CTCGTCACGC AATTGCTCTC GATACCAGCG AGTCCGCGCT TGAGATGCTC CGCGCTGGGC TTGCCCTCAA CGCCATTGCT ACCCCGGTCG AAACGGTTGC GGCGGATGTG TTTCAGATGT TACGGCGTTA CCGCGATGAA CAACGCCGCT TTGACGTCGT TGTGCTCGAC CCGCCCAAAT TTGCCCATAC GCAGGCGCAG GTCGAACGGG CAACCCGTGG GTATAAGGAC ATCAATGTGC TGGCAATGCA GTTGCTGCGC CCCTGCGGGA TTCTGGCGAC GTTCTCCTGC TCCGGTCTGG TGTCGAGCGA TCTGTTTCAG AAGATTGTCT TTGGTGCTGC GCTCGATGCG CGCCGTGAAG CGCAGATCAT CGAGCGGTTA ACGCAAAGCC CCGATCATCC GGTGTTGCTG ACATTTCCCG AAGGAGCATA TCTGAAAGGT CTGATCTGTC GTGTCTGGTA G
|
Protein sequence | MAIVILHPGK ERPVVQRHPW VFSGAIARIQ GRFPDRGEVV DVQAASGEWL ARGCWSDGSQ IRVRLFTWNP DEPIDDALIR RRIERAIDGR RRLGMLTDDG ACRLVYAESD GIPGLIVDYY AGFLVVQLLT QAMALRRAAI TRVLAETLVP RGIYERSESD VREKEGLPPA SGVLWGETPP DCVHVRLPGD LWHAVDLRTG QKTGAYLDQA FNRWRVAMHC TGAEMLDCFC YAGGFTIAAA RAGARHAIAL DTSESALEML RAGLALNAIA TPVETVAADV FQMLRRYRDE QRRFDVVVLD PPKFAHTQAQ VERATRGYKD INVLAMQLLR PCGILATFSC SGLVSSDLFQ KIVFGAALDA RREAQIIERL TQSPDHPVLL TFPEGAYLKG LICRVW
|
| |