Gene RoseRS_4236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4236 
Symbol 
ID5211221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5307338 
End bp5309050 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content53% 
IMG OID640597825 
ProductBNR repeat-containing glycosyl hydrolase 
Protein accessionYP_001278529 
Protein GI148658324 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000155192 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000114399 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAATAGAC ATCACACTAT CCGCTACTAT CGTAAGTTTT TGATCGCTGC AACCCTGCTT 
ATTGGGTTGA TAGGCGCTTT TATCGGCTTG CACACAACTG TACATGGGCA GACGACACGT
TGGTCACAAC CGGTAGATCT TTCGGTTGGG GCGAGGTCCT CATGGTTTCC AAGCCTGACT
GTTGCGGCGG ATGGCAGTGT GCATGTCGTG TGGGCTAGCG GTCGACCGCT CACCGAGGAT
AGTGGATTTA GTGCTGGTTC AAATCTTGCC AATGTTATGG ATCTGTTGAT GTATGCCGTG
TATCGAAATG GCAAGTGGTC GCCAGCAAAT GATATTCTTT TTTCCGGTTT GGGCGGAGCG
GCTGTCCGAA ACAGTATTGT AACTGGTCAT GATGGGAATC TCCACGTTGC TTTCCGGAGC
AGCGAACGCA TCTTTTTCAG CAGCGCGGAT CCTGTGCAGG CATTTCGCCC GTTTGCCTGG
CGCGATCCGA GAAGGATAAA TGGCTCGAGC GGACCGTACT ATGTTGAACT GGCGGTTGAT
AGTAAAGGAA CATTGCACGT GGTATGGACG GAGGTCGTAG TTTCGGAAGA ACGATCACGG
TATACCTTGT GCCCGTACTG CGCTAACCTC TTCTATCGCA ATTCGCAAGA TGGCGGGCTG
ACCTGGTCGG CGCCTGTCAA TCTGGCGGAT TCGTTTGATG GAACGACCAA ACCGCACATT
GCTATTGATC TGCAGGATGG GATTCATGTC GTATGGGATA TTGGCTTTGA TAATATAACT
GGCAAAGGCG CCCCTCTTGC TGGCGGGTAT CGCTATTCAA GCGATGGCGG TATCACGTGG
AATACTGTAG TGCGTTTTAC ACTCGCCGAA GGGCAATCTT CCCTTCAGCC GACCCTGCTT
TCAACGTCAG TTGCGTTGCC AACCCAAACA GTCACTCCAC CCTCTACAGA AACTCCCGCG
AATCCACTCA TTGAATCGCT CAGCGATGCG CCGCAGCAAA CGACGCTTGG ACTTTTTCAG
CACCGTGATC CGATTGTTGT GTACCGCAGC ACACGAACTG ATCGGATTTA CTATCAGGTT
TCACGTGACA ATGGTATAAC CTGGAGCAAT CCGCGTATAC TCCATGGCGT GCGCGCACGT
GACCTTAGAG AAACACCATG GGATGCATAC ACGATGGCTA CCGATGGCTC GGGAAATGTC
CATCTTATTC TCTCCGGTTT GCTGGATACA GGGAATGCTC CGACGAACCG CCAGAAGCCC
TCACTGCTGC ATCTGGTCTG GAATGGCGCT TACTGGTCTC GTCCAGAAGT TGTTGTTGCG
AATGACTTGT ATCCCGAATG GCCACGCCTC GTCGTGCACG GACAGCAGTT GCATCTCGTC
TGGTTCACGC GTAGCGATGA AGACATTTTC AAGAGTGATA ATGCGAACTA TCGGGTGTGG
TATAGCAGCG CGACTATCGA TGCACTGCCA TTGCCGGCAG CGCCAACGTT TACTCCTGCG
CCAACTGACG GACCAACCCC TACAATCATT CCATCGCCTG CGCCTTCGCC GACCCCGTTA
CCATCAGCGA TACAGCAAGT CCCGCCACCG AATGGATATC CAGCCTGGGA GTCGGTGGCA
CTGACCGTCA TGAGCATTGC GTTACTGCCG GTGCTTGCCT TCGTAGCAAT CGTAGCAATC
GCTCACTCAC GCGGTATGCG CTGGCGCATA TAA
 
Protein sequence
MNRHHTIRYY RKFLIAATLL IGLIGAFIGL HTTVHGQTTR WSQPVDLSVG ARSSWFPSLT 
VAADGSVHVV WASGRPLTED SGFSAGSNLA NVMDLLMYAV YRNGKWSPAN DILFSGLGGA
AVRNSIVTGH DGNLHVAFRS SERIFFSSAD PVQAFRPFAW RDPRRINGSS GPYYVELAVD
SKGTLHVVWT EVVVSEERSR YTLCPYCANL FYRNSQDGGL TWSAPVNLAD SFDGTTKPHI
AIDLQDGIHV VWDIGFDNIT GKGAPLAGGY RYSSDGGITW NTVVRFTLAE GQSSLQPTLL
STSVALPTQT VTPPSTETPA NPLIESLSDA PQQTTLGLFQ HRDPIVVYRS TRTDRIYYQV
SRDNGITWSN PRILHGVRAR DLRETPWDAY TMATDGSGNV HLILSGLLDT GNAPTNRQKP
SLLHLVWNGA YWSRPEVVVA NDLYPEWPRL VVHGQQLHLV WFTRSDEDIF KSDNANYRVW
YSSATIDALP LPAAPTFTPA PTDGPTPTII PSPAPSPTPL PSAIQQVPPP NGYPAWESVA
LTVMSIALLP VLAFVAIVAI AHSRGMRWRI