Gene Rcas_3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3351 
Symbol 
ID5540850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4375206 
End bp4377218 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content64% 
IMG OID640895469 
Producthypothetical protein 
Protein accessionYP_001433419 
Protein GI156743290 
COG category[A] RNA processing and modification 
COG ID[COG5178] U5 snRNP spliceosome subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCCA GATACATATC TGTCGGTGAT GCGCCGCGCA TTCGCTTCGA CCGTTGCACT 
GGCGCGCTGC GTGTCGAGCC GGGTGAGTCC GGTGTGCTCG AGGTCTGGGT TGAAGAAGAG
CACGAAATTG TACTGGAACA TGATGAGCAG GGAGTCGTGA TCCCCGGCGA CATCGAGGGC
GATCTCCGAT TGCGCGCGCC GACGGCCGCA ACCGTTTCTG GGGGAGTCGT TGAGGATGCT
GTTTTTGTTT CCGGCATCGC GGCATTTACC CTCGACTCGG CTCAAGACGA TGTGCGGATC
GAAGGTGTCG GCGGCGTGGT GACGCTGGGT GATGTCGAAG GTGATCTTGC CATCAACCGT
GCTGCGGCGG TCAGGGTTGG CGATGTGGCG GGAAATGCGC AGATTGCGGC AGTAGATGAA
GCGATTGTGG TGCGCCGGGT CGAAGGTGAT CTGACCGTCA GCAGCGCCGG TTCAATCGAA
GCGCAGGGTG TCGATGGTGA TCTGCGCGTG GTGGATGTGC GCGGCGCGGT GTTACTTGAG
CGCATTGCCG GTGATGTGCT CGTCGAGCGC GTCGGCGAAC TGCGGGTCGT TGAAGCCATC
GATGGCGATG TACAGGTTAA TCGGTCGGGG ACGGTCGCGT TAAGCGATGT CGCCGGCGAT
GTCGGCGTCG ATGAAGTACA GAGCCTGATG GTTGGCGCGA TTGCCGGAGA CCTCCACGCC
AGCAGCGTGT CGGAGGTCGT GCGCTTCCGC GATCTGGATG GTGACCTGCG CCTGCGCCGA
AGCGATCGCG TCGCGGTCGA AGGAGGAAGC ATCAGCGGTG ATGTGCGGGT TGACCAGGTG
CGTAGCGTGA AGCTCGGTTC GATTGGCGGT GATGCAGATA TGCGCAATGT CGGCAGCGAC
CTCAGCATCG GCAGCATTGG CGGTGACGCG ACGTTCATTG ACATTGGCGA TGATCTGAAA
GCCGGTGGTA TTGGCGGCGA CCTGATGCTG CGCAATGTGA AGAGCGCAAC CCATGTCGGG
CATGTCGGCG GCGACCTGAC GCTGGCAATG GAGTTCGCTC CGAATAGCGT CACCCGGCTG
AATGTCGGCG GCGATGCGCG CATCGAATTA CCGTCCGACG TGAGCCTGAC AGTGCGAGCA
ACGGTTGGCG GAACGGTTCG GGGTCCCGGT ATCACTGCCA GTGGCGGGAT GTTCAGTGCG
GTGTACGGTG AAGGGGCAGC GTTGCTTGAG TTGTTTATCG GCGGTGATCT GTCGCTGCGC
GGAGCGACGC CGCGCAGCAG CAGCTCGATG GGGGAGGGCG CTCCACATCC CGCGAGCGGC
TCGCGCGCTC CCAACGACGA CATCGGGCGC TGGGCGGAAC GGTTCGCCGA AGAAATGGGG
CGCTGGGGCG AGCAGTTTGG TCGGGACATG AACCGTTGGG CGGAGGAGTT CAGCCGGGAG
ATGGGTGGGC GCAGCGACGA GTGGGCACGT CGTTCGCAGC GCAAAGCCGA ACGCCTTCGC
CAGCGGATCG AGCGCGAGAT GCGCGAAGCG GCAGAACGCG CTCGTGAAAC TGCGCAACGC
GGGGGGCATC CCCGCGAGGT GCGCGTGCGT ATTAATGACC GTGAATGGCG GTTCGACGAG
GAACGACTGG AACGGATGAA GCGCGAAGCG GCAGCCGCCG CGCAAGCCGG GATCGTTGGC
GCGCTTGAAG CGGTCGAGCG TGCACTGGAA AGCATGGGCA TCCCGCGAAC GCAACCACCG
GCGCCACCTC CGCCGCCGCA TGCGCCGCCT CCGCCGCCTC CGCCGCCGCA TGCGCCGCCT
CCGCCAGCAG CGGCAACCGG TACAACGATT CGGATTAACG GCGAGCCACC CTCCGGCGAA
TCCACGCCGG TGACTGTTGA TGAAGCGCCG GTGCAGGAAA CCGCGCCGTC TGGGACGACC
ATCGAAGAGC AACGCGCCGC CATCCTGCGC ATGGTTGCCG AGGGACGGAT CACGCCGGAG
GAGGCTGACC TGCTGCTCGA GGCGCTGGGT TAG
 
Protein sequence
MPARYISVGD APRIRFDRCT GALRVEPGES GVLEVWVEEE HEIVLEHDEQ GVVIPGDIEG 
DLRLRAPTAA TVSGGVVEDA VFVSGIAAFT LDSAQDDVRI EGVGGVVTLG DVEGDLAINR
AAAVRVGDVA GNAQIAAVDE AIVVRRVEGD LTVSSAGSIE AQGVDGDLRV VDVRGAVLLE
RIAGDVLVER VGELRVVEAI DGDVQVNRSG TVALSDVAGD VGVDEVQSLM VGAIAGDLHA
SSVSEVVRFR DLDGDLRLRR SDRVAVEGGS ISGDVRVDQV RSVKLGSIGG DADMRNVGSD
LSIGSIGGDA TFIDIGDDLK AGGIGGDLML RNVKSATHVG HVGGDLTLAM EFAPNSVTRL
NVGGDARIEL PSDVSLTVRA TVGGTVRGPG ITASGGMFSA VYGEGAALLE LFIGGDLSLR
GATPRSSSSM GEGAPHPASG SRAPNDDIGR WAERFAEEMG RWGEQFGRDM NRWAEEFSRE
MGGRSDEWAR RSQRKAERLR QRIEREMREA AERARETAQR GGHPREVRVR INDREWRFDE
ERLERMKREA AAAAQAGIVG ALEAVERALE SMGIPRTQPP APPPPPHAPP PPPPPPHAPP
PPAAATGTTI RINGEPPSGE STPVTVDEAP VQETAPSGTT IEEQRAAILR MVAEGRITPE
EADLLLEALG