Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1672 |
Symbol | |
ID | 4445807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1866475 |
End bp | 1868094 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639689493 |
Product | Fmu (Sun) domain-containing protein |
Protein accession | YP_831166 |
Protein GI | 116670233 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases [COG0781] Transcription termination factor |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.280936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGT CCGGAACGGG CGGCAGCGGC CGCGGCAGGG GTCCCCGTCA GGGGGGAGCA AGCGGCGGCG GGCAGTCCGG CGGGTCTTCC TCCGGTCGCC GCCAAGGCAG CCAGCGCGAC GCCCAGGGCC GGGAGCGCAA CCGCGGGCCG AAGCGGAATT TCAGCGAGAA CGCCCCTTCG CAGCGTACGC GGCGTGCCGA CCCCGCCCGC CTGGTGGCTT TTGAAGTCCT GCGTGCCGTC GCATCGGAAG ACGCCTACGC AAACCTCGTC CTGCCGGCCC GGATCCGCCA CCACGGCCTG GACAAGCGGG ACGCCGGGTT CGCTACCGAA CTGAGCTACG GGGCGCTGCG CGGGCAGGGA ACCTATGACG CCGTCCTGGC CCGCTGCGTC GACCGGCCGC TGGACCAGCT GGATCCTGCC ATCCTGGATG CCCTGCGGAT CGGCGCCCAC CAGCTGCTGG CCATGCGAGT GCCCGCCCAC GCCGCCCTGG ACCAGACCGT TGGACTGGCC CGTGCGGTCA TCGGCGCCGG CCCATCAGCC TTGATCAACG CCGTCCTGCG CAAGGTCTCG GCCCACACGC TGGACGAGTG GCTGGAGCTG CTGCTCAGCG ACGAGCAGGA CGAAACCCGG GTGGCCTCCA TCCGCTACGC CCACCCGGAG TGGATTGTCC GCGCCCTGCG GCAGTCGCTG GTGGCGCACG GACGCCCGGT CACCGAAATC AACGAACTCC TTGAAGCTGA CAACGCTGCG CCGGTAGTGA ACCTTGTTGC GCTGCCCGGA CTGGGGAGCC TGGACGAAGC CCTGGAGGGC GGGGCCACGG CCGGTGAACT CGTGGAAGGC TCGGCGCTCT CCAGCGGCGG GGACCTCGGC CGGCTGGCGT CGGTGCGTGA AGGCAGCACC AGGGTGCAGG ACGTGGGCTC GCAGCTGGTT GCCCGCGCCA TGGCTGCCGT GGACCTGAAT TCCGGGGATC TGCATGCCCC GGGTCCGGAG AGTGCGGACG GACAGTCCGG CGCCAAGGGC GGCGAAAAAT GGCTGGACCT CTGCGCCGGA CCGGGCGGCA AGGCCGCCCT GCTGGGTGCC CTCGCCCGGC AGCAGGGCGC AACCCTGCTG GCCAACGAAC CCGCCCCGCA CCGTGCCAAG CTGGTCCGGC AGGCCCTGGC CGCGGTCCCT CACGAGGTCT GGCATGTCCG GACCGGGGAC GGCCGCGACG TCGGAACTGA AATGGCGGGG ACTTTCGACC GCGTTCTCGT CGACGTACCC TGCAGCGGAC TTGGCGCCTT GCGGCGCAGG CCGGAGTCGC GGTGGCGGCG CACACCCAAG GACCTCGCGG ACCTGGGCCC GCTCCAGCGC GAACTGCTTA AGTCAGCCTT GGATGCCGTC AGGCCCGGCG GCGTGGTGGC CTACGTGACG TGCTCGCCGC ACCCCGCCGA AACCACCGCC GTCGTGACCG ACGCGCTGCG CAAACGCGAC GACCTGGAAC TGCTCGATGC CGGCGCCGCC TTGGACAAAG TCAGCCTGCC CGGGCATCTT GAGGCCGGCC ACGAAATGAC GGCCCAGCTG TGGCCGCATG TGCACCGGAC TGACGCCATG TTCCTGGCCC TCATCCACAA GAAATCCTGA
|
Protein sequence | MSESGTGGSG RGRGPRQGGA SGGGQSGGSS SGRRQGSQRD AQGRERNRGP KRNFSENAPS QRTRRADPAR LVAFEVLRAV ASEDAYANLV LPARIRHHGL DKRDAGFATE LSYGALRGQG TYDAVLARCV DRPLDQLDPA ILDALRIGAH QLLAMRVPAH AALDQTVGLA RAVIGAGPSA LINAVLRKVS AHTLDEWLEL LLSDEQDETR VASIRYAHPE WIVRALRQSL VAHGRPVTEI NELLEADNAA PVVNLVALPG LGSLDEALEG GATAGELVEG SALSSGGDLG RLASVREGST RVQDVGSQLV ARAMAAVDLN SGDLHAPGPE SADGQSGAKG GEKWLDLCAG PGGKAALLGA LARQQGATLL ANEPAPHRAK LVRQALAAVP HEVWHVRTGD GRDVGTEMAG TFDRVLVDVP CSGLGALRRR PESRWRRTPK DLADLGPLQR ELLKSALDAV RPGGVVAYVT CSPHPAETTA VVTDALRKRD DLELLDAGAA LDKVSLPGHL EAGHEMTAQL WPHVHRTDAM FLALIHKKS
|
| |