Gene Arth_1672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1672 
Symbol 
ID4445807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1866475 
End bp1868094 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content71% 
IMG OID639689493 
ProductFmu (Sun) domain-containing protein 
Protein accessionYP_831166 
Protein GI116670233 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases
[COG0781] Transcription termination factor 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.280936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGT CCGGAACGGG CGGCAGCGGC CGCGGCAGGG GTCCCCGTCA GGGGGGAGCA 
AGCGGCGGCG GGCAGTCCGG CGGGTCTTCC TCCGGTCGCC GCCAAGGCAG CCAGCGCGAC
GCCCAGGGCC GGGAGCGCAA CCGCGGGCCG AAGCGGAATT TCAGCGAGAA CGCCCCTTCG
CAGCGTACGC GGCGTGCCGA CCCCGCCCGC CTGGTGGCTT TTGAAGTCCT GCGTGCCGTC
GCATCGGAAG ACGCCTACGC AAACCTCGTC CTGCCGGCCC GGATCCGCCA CCACGGCCTG
GACAAGCGGG ACGCCGGGTT CGCTACCGAA CTGAGCTACG GGGCGCTGCG CGGGCAGGGA
ACCTATGACG CCGTCCTGGC CCGCTGCGTC GACCGGCCGC TGGACCAGCT GGATCCTGCC
ATCCTGGATG CCCTGCGGAT CGGCGCCCAC CAGCTGCTGG CCATGCGAGT GCCCGCCCAC
GCCGCCCTGG ACCAGACCGT TGGACTGGCC CGTGCGGTCA TCGGCGCCGG CCCATCAGCC
TTGATCAACG CCGTCCTGCG CAAGGTCTCG GCCCACACGC TGGACGAGTG GCTGGAGCTG
CTGCTCAGCG ACGAGCAGGA CGAAACCCGG GTGGCCTCCA TCCGCTACGC CCACCCGGAG
TGGATTGTCC GCGCCCTGCG GCAGTCGCTG GTGGCGCACG GACGCCCGGT CACCGAAATC
AACGAACTCC TTGAAGCTGA CAACGCTGCG CCGGTAGTGA ACCTTGTTGC GCTGCCCGGA
CTGGGGAGCC TGGACGAAGC CCTGGAGGGC GGGGCCACGG CCGGTGAACT CGTGGAAGGC
TCGGCGCTCT CCAGCGGCGG GGACCTCGGC CGGCTGGCGT CGGTGCGTGA AGGCAGCACC
AGGGTGCAGG ACGTGGGCTC GCAGCTGGTT GCCCGCGCCA TGGCTGCCGT GGACCTGAAT
TCCGGGGATC TGCATGCCCC GGGTCCGGAG AGTGCGGACG GACAGTCCGG CGCCAAGGGC
GGCGAAAAAT GGCTGGACCT CTGCGCCGGA CCGGGCGGCA AGGCCGCCCT GCTGGGTGCC
CTCGCCCGGC AGCAGGGCGC AACCCTGCTG GCCAACGAAC CCGCCCCGCA CCGTGCCAAG
CTGGTCCGGC AGGCCCTGGC CGCGGTCCCT CACGAGGTCT GGCATGTCCG GACCGGGGAC
GGCCGCGACG TCGGAACTGA AATGGCGGGG ACTTTCGACC GCGTTCTCGT CGACGTACCC
TGCAGCGGAC TTGGCGCCTT GCGGCGCAGG CCGGAGTCGC GGTGGCGGCG CACACCCAAG
GACCTCGCGG ACCTGGGCCC GCTCCAGCGC GAACTGCTTA AGTCAGCCTT GGATGCCGTC
AGGCCCGGCG GCGTGGTGGC CTACGTGACG TGCTCGCCGC ACCCCGCCGA AACCACCGCC
GTCGTGACCG ACGCGCTGCG CAAACGCGAC GACCTGGAAC TGCTCGATGC CGGCGCCGCC
TTGGACAAAG TCAGCCTGCC CGGGCATCTT GAGGCCGGCC ACGAAATGAC GGCCCAGCTG
TGGCCGCATG TGCACCGGAC TGACGCCATG TTCCTGGCCC TCATCCACAA GAAATCCTGA
 
Protein sequence
MSESGTGGSG RGRGPRQGGA SGGGQSGGSS SGRRQGSQRD AQGRERNRGP KRNFSENAPS 
QRTRRADPAR LVAFEVLRAV ASEDAYANLV LPARIRHHGL DKRDAGFATE LSYGALRGQG
TYDAVLARCV DRPLDQLDPA ILDALRIGAH QLLAMRVPAH AALDQTVGLA RAVIGAGPSA
LINAVLRKVS AHTLDEWLEL LLSDEQDETR VASIRYAHPE WIVRALRQSL VAHGRPVTEI
NELLEADNAA PVVNLVALPG LGSLDEALEG GATAGELVEG SALSSGGDLG RLASVREGST
RVQDVGSQLV ARAMAAVDLN SGDLHAPGPE SADGQSGAKG GEKWLDLCAG PGGKAALLGA
LARQQGATLL ANEPAPHRAK LVRQALAAVP HEVWHVRTGD GRDVGTEMAG TFDRVLVDVP
CSGLGALRRR PESRWRRTPK DLADLGPLQR ELLKSALDAV RPGGVVAYVT CSPHPAETTA
VVTDALRKRD DLELLDAGAA LDKVSLPGHL EAGHEMTAQL WPHVHRTDAM FLALIHKKS