Gene RoseRS_3600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3600 
Symbol 
ID5210578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4497522 
End bp4499273 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content62% 
IMG OID640597193 
Productfibronectin-binding A domain-containing protein 
Protein accessionYP_001277905 
Protein GI148657700 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.110241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTTCG ATGCGTTGAC GCTTGCCGCC GTTGTCGATG AGTTACGCGC CACTCTTGTC 
GGTGGTCGTG TCCAGCATGT CCTGTTGCCG GGAGAGTTGA GCGTCGCTCT CGAAATCTAC
GCCGGTCGGC GCTATTATCT CGTACTTTCC GCCCATCCGC AATTCGCGCG CGTTCACCTC
AGCCCGGTGC GCATCTCGCG TGGCACGGAT GCGACGCCGC CACTGCTGCT GTTGCTGCGC
AAGTATGTCA ATCGTGGTCG GATCACCGCT ATCGAGCAAC CGGATCTGGA GCGTGTGTTG
CTGCTAAGTA TAGCCAAACG GCCGCTCCTG CGCAACTCTG ATGACGAACC TGAACTCGAT
AGCGATGATG AGGATCGACC AGACACCACA TCACCGGAGA ATGAAACACT CCGGTGCGAA
TTGATTGTTG AAATCATGGA ACGCCGCAGT AATATCGTGC TGGTGGGCGA CGATAACGTG
ATTCTGGCGG CAGCACGGCA TGTGACGCCA CGTATGAGCC GGCGTCCGGT GTTGCCGCGT
GAACCATACG AACTGCCGCC GCCCCAGTCC AGGCACGATC CGCGCCAGAC GACAGCAGTC
GAGATGCGCG CTGCCGTGCC GGATGGTCAA CCCGATCTGG CGCGCGCGCT GGTGAGCGCC
TACCGTGGGC TGTCGCCGCT TGCTGCGCGT GAGGTGGTCT ATCGTGTGAT GGGGCGCACC
ATTGTGCCAA CCGGCGCCGA TCTGCCATGG GAAGCGCTTG CCGGTGCGTT GCGGCAGTTG
TGTCAGCCTC CCTGGACGCC GCATATTGTG ATTGATGATG GCGAACCGGT CGCATTCGCG
CCATACGAAC TGACCCATCT GCCGGGGGCG CGTCCCTGTC CATCGATGAG CGCTGCGCTG
GACGCATACT ATGCAACGCG CGAGCGCCTT ACCGCCCATC ATCAACGGCG CGACGCTCTG
CGTGAGCGGT TGAACGCCAT GCGTGAGCGT CTGGAGCGGC AACGTTCCGC CCTCCGCGCC
GAACTGGAGC GCGCTTCCGA TCTTGAGCGG TTGCGCTGGG AAGGAGAAAT GATCTTTGCG
TTCCTTCACG AACTGGCGCC AGGACAGGAT CATCTGGAGG TGGACGGGCG ACGTATCGCG
CTCGATCCGC GCAAGTCGCC GGTCGAGTGT GCGCAGGATC GTTTCCGCGC CTATGAGAAA
GCCAAAGGTG CGCTTGCTGG CGTTCCCGAA CGATTGCGCG CCGTCGAGTT GCGTCTGGCA
GGTCTGGATG AAACGCTGGC GCTACTGGAA CTGGCGGAAG GATACGACGC TATCGAGGCA
ATTGCACGTG AGGCAGAGGC GGAAGGCTAT CTCGGACCTG AAACAGGTCG CACCCGCAAG
CGTCCTGATC GCCCTGCGCC GCCGTTACGC CTCGAATCAA GCGACGGCTT GACCATCTAT
GTTGGACGCA CTGCACAGCA GAACGAACAT GTCACCTTTC GCCTCGGCGC ACCTGATGAT
CTCTGGCTGC ACGTGCGCGG CGCACCTGGT GCGCACGTGA TTATCAAAGC CGGTCAGCGT
GACGTCCCGG AGCGCACCAT CGAAGAAGCA GCAGCGCTGG CAGCGTACTA CAGCAGTCAG
CGCGCCTCTG CCAGCGTCGA GGTTGAAATT GCACGGCGAC GCCACGTGCG GAAAGTGCGT
GGCGGACCGC AGGGCCTGGT GACCTATCAG GCCGAGCGGG CAGTGCGTGT GACGCCACGC
CCGCCGTGGT AG
 
Protein sequence
MYFDALTLAA VVDELRATLV GGRVQHVLLP GELSVALEIY AGRRYYLVLS AHPQFARVHL 
SPVRISRGTD ATPPLLLLLR KYVNRGRITA IEQPDLERVL LLSIAKRPLL RNSDDEPELD
SDDEDRPDTT SPENETLRCE LIVEIMERRS NIVLVGDDNV ILAAARHVTP RMSRRPVLPR
EPYELPPPQS RHDPRQTTAV EMRAAVPDGQ PDLARALVSA YRGLSPLAAR EVVYRVMGRT
IVPTGADLPW EALAGALRQL CQPPWTPHIV IDDGEPVAFA PYELTHLPGA RPCPSMSAAL
DAYYATRERL TAHHQRRDAL RERLNAMRER LERQRSALRA ELERASDLER LRWEGEMIFA
FLHELAPGQD HLEVDGRRIA LDPRKSPVEC AQDRFRAYEK AKGALAGVPE RLRAVELRLA
GLDETLALLE LAEGYDAIEA IAREAEAEGY LGPETGRTRK RPDRPAPPLR LESSDGLTIY
VGRTAQQNEH VTFRLGAPDD LWLHVRGAPG AHVIIKAGQR DVPERTIEEA AALAAYYSSQ
RASASVEVEI ARRRHVRKVR GGPQGLVTYQ AERAVRVTPR PPW