Gene RoseRS_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3021 
Symbol 
ID5209989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3799388 
End bp3800785 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content63% 
IMG OID640596613 
Producthypothetical protein 
Protein accessionYP_001277335 
Protein GI148657130 
COG category 
COG ID 
TIGRFAM ID[TIGR02710] CRISPR-associated protein, TIGR02710 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.127386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGT CTGATAAGCG GATCCCTTTT CCCGAAATCT TCGCGTCCTT TCGCGCTACC 
GCAGACGGGT GTGCATTTCG CGGTCTGGTG CTGGTCGGTA CACTTCAGGC GGATACACCG
GCGCTCCTGA TCGCCGGTCT CAACCCGGAG CGGGTAGCGT TTTTGCTCAC CGATCAATCC
CGCCCAAAGC TGGACGAGGT GCGTCAACGA CTCGCGCAGG TCGCCGATCA GGCGCCGCTG
CGCTGTTCCC CCGATGACTG GTTCTGCCCG GACGGCGATT ATTCCAGTGT GCTGCGCGTC
TACACCGGGT TGCGGACGGT TCTTGATCGC TGGCGCGACC TGGAGCGTCA CGAGATTGCC
GTCGATCTGA CCGGCGGTAA ATCGACGATG ACCGTTGGGC TGGCAAAAGC AGCGCATGTG
CTGCGCCTGG CGGCAGTCTA TGTCGATAGC GATTATGCCG ATGGTCGTCC GGTGCCCGGA
ACGCAGCGCC TCGCAACACC GGAAGACCCG TATACCGTCT TCGGCGACCT GGAGGCGGCT
GAGGCGCGTC GTTTACATAA CAACCACGAC TATGCCAGCG CTGAGAAGAT CTTCCGCGAT
CTGGCGCAGC GGGTTCCGGA CAATCCCGAT TATGCGATCT ATGCCGATCT GTCGACGGCG
TACCTTGCGT GGGACAGTTT TGCGCCCCAT CAGGCGGGTG ATGCGCTCGA CCGCGTGCTG
GCGCGCGCCG ATCTGCCAGC CGATCTGCAG CCAGCGCGGA GTGTGTTGCA GGCGCAACGC
GAGACGCTGG CGCAGTTAAC CGCCATCAAC CGACGTCTGA CCCAGCGCAA GACCCCGCCA
GCCGATGCGC TGACGGCACT ACGCGATCTG AATCAGGGGC TGGCGTTGCT CGGTTCGCTT
CACAGCGCCG CACTGCGCCG CGCAGCCCAG GAACGCTACG ATGTCGCGGC GCTGATGCGC
TACCGCTGCC TGGAATTGCT GTCGCAGCAT CGCCTGGCGA CCTACGGCAT CTGGACGGCG
GAACCATCGT TTGATGCCGC CCTGCGGCGC GTCCCTGACC TCGATGATCG CTACCGGCAG
GCGCAGCGCG ACCAGGGGTT TCGCAAACAG TACCCGCTGC CTTCTTCAGA ACGCTCCATC
GCGCTGTTCG ACGGCTATAT GCTGCTCCAG GCGCTCGACG ATCCGCTGGT GCGCGGATGG
AATATCGGCG ATATTCGCCA GCGCTCGTAT GTGCGCAACA CCAGTATCCT GGCGCATGGG
TTCCGCCCGA TTTCCTCCCT TGAGTACGAA CAGTTCGCCG ATATTGTCGA GGAATTGCTG
GATCGCTTCT TTGCACTGAT CGGCAGGTCG CGCCAGGAGT GGGAGCGGGT CCATCGGTTC
GTTTCACTTG CAGCCTGA
 
Protein sequence
MTASDKRIPF PEIFASFRAT ADGCAFRGLV LVGTLQADTP ALLIAGLNPE RVAFLLTDQS 
RPKLDEVRQR LAQVADQAPL RCSPDDWFCP DGDYSSVLRV YTGLRTVLDR WRDLERHEIA
VDLTGGKSTM TVGLAKAAHV LRLAAVYVDS DYADGRPVPG TQRLATPEDP YTVFGDLEAA
EARRLHNNHD YASAEKIFRD LAQRVPDNPD YAIYADLSTA YLAWDSFAPH QAGDALDRVL
ARADLPADLQ PARSVLQAQR ETLAQLTAIN RRLTQRKTPP ADALTALRDL NQGLALLGSL
HSAALRRAAQ ERYDVAALMR YRCLELLSQH RLATYGIWTA EPSFDAALRR VPDLDDRYRQ
AQRDQGFRKQ YPLPSSERSI ALFDGYMLLQ ALDDPLVRGW NIGDIRQRSY VRNTSILAHG
FRPISSLEYE QFADIVEELL DRFFALIGRS RQEWERVHRF VSLAA