Gene RoseRS_4116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4116 
Symbol 
ID5211099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5156786 
End bp5158153 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content55% 
IMG OID640597704 
ProductAllergen V5/Tpx-1 family protein 
Protein accessionYP_001278410 
Protein GI148658205 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATATT TTCTGCTCCT TCTCATGACG GGAATGGCGT TTTTCTCGAC GGTCCCTCAC 
GCAACGGGTC GGAGTGCTGA ACGCTGCTTC TCTGAAACCG GCTTTTGCAT CTCAGGACGC
ATACGATCAT TCTGGGAACA GAACGGCGGA TTACCGGTGT TTGGTTTTCC CACAGGACCT
GAGCAAGGCA TGGCTATTGA AGGGCGTATA GTGCGCGCGC AGCAATTTGA GCGCAATCGC
CTCGAACTGC ATCCAGGCAA TCGTCCGCCC TACGATGTCT TACTGGGAAG ACTGGGCGCC
GAGCGTCTCT CTCAAATGGG ACGCGACTGG CAATCCTTTC CGCAGAGTCA GCCACAACCA
GGATGCCGTT TCTTTCCAGA GACAGGTCAC AATGTGTGTG GGGATATTCT TGCTGCATGG
CGGGCGAATG GACTTGAACT CGATGGTCGG CGTGGTACGA CCGAGGCTGA GAGTCTGGCG
CTCTTCGGTT TGCCGTTGAG CGATCTTGTT TCAGAAACAC TGAGCGATGG CAAAACGTAT
CAGGTGCAGT GGTTCGAGCG TGCGCGATTC GAGTTGCATC CGGAACTTGC TCCTCCTTAC
CATGTCTTGT TGGGGTTGCT GGGCAATGAA ACTCGGCACA GTGGGCAGGC ATCGCAACAA
CCATCCCTCC CATCGAATGA CTGGCTCGCA CAGGTCAATG CATATCGCGC CCGTGCTGGT
GTTCCGCCGG TCACTGCCGA TCCGACTTTG AATGACAATT GTGTTCAACA TGCGCGCTAT
ATGGCGGAAA ACGGAGTGCT GACCCATGAC CAGAATCCGT CGCTTCCCTG GGCTTCTGGA
GCAGGGCAGA CATGTGCCCA GAAAGGAAAT GTCTGGCTGG GTTCGGGCAA CGTCTGGAAA
CCGCTGGACG CGATAGATGG CTGGATGATG TCGGTTGGAC ATCGGGCCTG GTTGCTGTAT
CCAACGACTC CGACCTTTGG GTTCGGATTT TACCAAACGA GAGGGGTCAG CGCTGCCGGG
TTGGATGTTC TGACACATGC TCGATTGGAT CAGGATACGA CCTTCCCCGG TTGGCCAGTA
CGGTATCCTG GAGGCGATCA GCAGGATGTC CCTCCGATAC AGTTGCCAAT TACCCTGTTC
TGGCCATACT TTGGTCCGAC GCCTGTTATC AGCAGTGTAA GCCTGCGCAC AGGGTCAGGA
ATGTCACTGC CCCATTCTGC AACAACGAAC CTTCCTGCCG GTCACAAAGG CGTGGCTATC
ATACCCGCTC AGGCGCTGCC GCCATTTACG ACCATTGAAG CAACGGTCAC CGGAACCTAT
GATGGTCGAC CTTTCAACGT CACATGGCAA TTCACAACCC GCAGATGA
 
Protein sequence
MRYFLLLLMT GMAFFSTVPH ATGRSAERCF SETGFCISGR IRSFWEQNGG LPVFGFPTGP 
EQGMAIEGRI VRAQQFERNR LELHPGNRPP YDVLLGRLGA ERLSQMGRDW QSFPQSQPQP
GCRFFPETGH NVCGDILAAW RANGLELDGR RGTTEAESLA LFGLPLSDLV SETLSDGKTY
QVQWFERARF ELHPELAPPY HVLLGLLGNE TRHSGQASQQ PSLPSNDWLA QVNAYRARAG
VPPVTADPTL NDNCVQHARY MAENGVLTHD QNPSLPWASG AGQTCAQKGN VWLGSGNVWK
PLDAIDGWMM SVGHRAWLLY PTTPTFGFGF YQTRGVSAAG LDVLTHARLD QDTTFPGWPV
RYPGGDQQDV PPIQLPITLF WPYFGPTPVI SSVSLRTGSG MSLPHSATTN LPAGHKGVAI
IPAQALPPFT TIEATVTGTY DGRPFNVTWQ FTTRR