Gene Sare_4233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4233 
Symbol 
ID5704404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4806271 
End bp4807506 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content75% 
IMG OID641273652 
Producthypothetical protein 
Protein accessionYP_001539005 
Protein GI159039752 
COG category[R] General function prediction only 
COG ID[COG4076] Predicted RNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.118857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCTGG GACAGCTGGC GGAGCTGCGT ACGCCCGCGG GTTCGGCCGC GCTCGCGGCG 
GCGACCGAGG TGGCCGGTGC TGACCCGCTG GCCGCGGCGA TGGCGCTACG GTCCGCCGGG
CTCCCGGCCG GGCTGGCGGC GGCGGCGCTG ACCCAGGCCG AGCTGCGCCG CCGCGCGATG
GGCAAGTTCG GTCCGGCGGC GGCTGACATG TTCTTCACCC GCGCCGGCCT GGAACAGGCC
ACCCGTCGGG TCGTGGCGCG GCGTCGCGCC GACCGGCTGC GGGCCGCCGG AGTCCGAACC
CTGGCCGACC TGGGCTGCGG CCTCGGGGCC GATGCCCTCG CGGCAGCCCA CGCCGGCCTG
CGGGTGTATG GCGTGGAGGC CGATCCGCTG ACCGCCGCGA TAGCCGCCGC GAACGCCGAG
GCGGCCGGAC TCACCGAACG GTTCACCGTC GACCATGGGG ACGCGACCGC CTTCGACATC
GACCGCGTGG ACGGCGTCTT CTGCGACCCC GCCCGGCGGC GCACCGGCAC CGGGCGGCGG
ATCTTCGATC CGAGCGCGTA CGCGCCACCC TGGGACTTCG TGGTCGGGCT CGCTGGGCGG
GTGCCGCGCA CGGTGGTGAA GGTCGCGCCC GGCCTTGATC ACCAGTTGAT CCCGGCCGGC
GCGGAGGCGG AGTGGGTGAG CGTCCACGGG GACCTGGTCG AGGCCACCCT GTGGTGCGGC
GAACTCGCGA CAGTGGCGCG CCGCGCGACC GTGCTGCGGG AAGCTTCCCC CGGCGACGCC
TCCAGCAGCG CCGGTTCTGC CGCCCGCCGC GCGACAGCGC ACGAACTGAC TGGTTCCACC
GTCGCCGAGG CGCCGGTCGG TCCGGTCCGC CGCTACGTCT ACGACCCGGA CCCGGCGGTG
GTCCGCGCGC ACCTCGTCGC CGAACTGGCC GGAATGCTGG ACGCCAACCT TGCCGACCCG
ACGATCGCCT ACCTGTACGC CGACACTCCG ACGCCGACAC CCTTCGCCCG CTGCTTGGAG
ATCACCGACG TGCTGCCGTT CTCGCTGAAG CGACTTCGTG CCCTGCTGCG CGAGCGACGC
GTCGGCCGGG TGGAGATCCG CAAGCGTGGC TCGGCCCTCG AGCCGGAGCG ACTCCGCCAC
GATCTGCGCT TGACCGGCGA CCAGCCGGCC AGCCTCGTGC TGACCCGCGT GGGCGGTGCC
CCCACGGTGC TGATCTGCCG TCCGCCCACC AGCTAG
 
Protein sequence
MDLGQLAELR TPAGSAALAA ATEVAGADPL AAAMALRSAG LPAGLAAAAL TQAELRRRAM 
GKFGPAAADM FFTRAGLEQA TRRVVARRRA DRLRAAGVRT LADLGCGLGA DALAAAHAGL
RVYGVEADPL TAAIAAANAE AAGLTERFTV DHGDATAFDI DRVDGVFCDP ARRRTGTGRR
IFDPSAYAPP WDFVVGLAGR VPRTVVKVAP GLDHQLIPAG AEAEWVSVHG DLVEATLWCG
ELATVARRAT VLREASPGDA SSSAGSAARR ATAHELTGST VAEAPVGPVR RYVYDPDPAV
VRAHLVAELA GMLDANLADP TIAYLYADTP TPTPFARCLE ITDVLPFSLK RLRALLRERR
VGRVEIRKRG SALEPERLRH DLRLTGDQPA SLVLTRVGGA PTVLICRPPT S