Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4116 |
Symbol | |
ID | 5211099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5156786 |
End bp | 5158153 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640597704 |
Product | Allergen V5/Tpx-1 family protein |
Protein accession | YP_001278410 |
Protein GI | 148658205 |
COG category | [S] Function unknown |
COG ID | [COG2340] Uncharacterized protein with SCP/PR1 domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATATT TTCTGCTCCT TCTCATGACG GGAATGGCGT TTTTCTCGAC GGTCCCTCAC GCAACGGGTC GGAGTGCTGA ACGCTGCTTC TCTGAAACCG GCTTTTGCAT CTCAGGACGC ATACGATCAT TCTGGGAACA GAACGGCGGA TTACCGGTGT TTGGTTTTCC CACAGGACCT GAGCAAGGCA TGGCTATTGA AGGGCGTATA GTGCGCGCGC AGCAATTTGA GCGCAATCGC CTCGAACTGC ATCCAGGCAA TCGTCCGCCC TACGATGTCT TACTGGGAAG ACTGGGCGCC GAGCGTCTCT CTCAAATGGG ACGCGACTGG CAATCCTTTC CGCAGAGTCA GCCACAACCA GGATGCCGTT TCTTTCCAGA GACAGGTCAC AATGTGTGTG GGGATATTCT TGCTGCATGG CGGGCGAATG GACTTGAACT CGATGGTCGG CGTGGTACGA CCGAGGCTGA GAGTCTGGCG CTCTTCGGTT TGCCGTTGAG CGATCTTGTT TCAGAAACAC TGAGCGATGG CAAAACGTAT CAGGTGCAGT GGTTCGAGCG TGCGCGATTC GAGTTGCATC CGGAACTTGC TCCTCCTTAC CATGTCTTGT TGGGGTTGCT GGGCAATGAA ACTCGGCACA GTGGGCAGGC ATCGCAACAA CCATCCCTCC CATCGAATGA CTGGCTCGCA CAGGTCAATG CATATCGCGC CCGTGCTGGT GTTCCGCCGG TCACTGCCGA TCCGACTTTG AATGACAATT GTGTTCAACA TGCGCGCTAT ATGGCGGAAA ACGGAGTGCT GACCCATGAC CAGAATCCGT CGCTTCCCTG GGCTTCTGGA GCAGGGCAGA CATGTGCCCA GAAAGGAAAT GTCTGGCTGG GTTCGGGCAA CGTCTGGAAA CCGCTGGACG CGATAGATGG CTGGATGATG TCGGTTGGAC ATCGGGCCTG GTTGCTGTAT CCAACGACTC CGACCTTTGG GTTCGGATTT TACCAAACGA GAGGGGTCAG CGCTGCCGGG TTGGATGTTC TGACACATGC TCGATTGGAT CAGGATACGA CCTTCCCCGG TTGGCCAGTA CGGTATCCTG GAGGCGATCA GCAGGATGTC CCTCCGATAC AGTTGCCAAT TACCCTGTTC TGGCCATACT TTGGTCCGAC GCCTGTTATC AGCAGTGTAA GCCTGCGCAC AGGGTCAGGA ATGTCACTGC CCCATTCTGC AACAACGAAC CTTCCTGCCG GTCACAAAGG CGTGGCTATC ATACCCGCTC AGGCGCTGCC GCCATTTACG ACCATTGAAG CAACGGTCAC CGGAACCTAT GATGGTCGAC CTTTCAACGT CACATGGCAA TTCACAACCC GCAGATGA
|
Protein sequence | MRYFLLLLMT GMAFFSTVPH ATGRSAERCF SETGFCISGR IRSFWEQNGG LPVFGFPTGP EQGMAIEGRI VRAQQFERNR LELHPGNRPP YDVLLGRLGA ERLSQMGRDW QSFPQSQPQP GCRFFPETGH NVCGDILAAW RANGLELDGR RGTTEAESLA LFGLPLSDLV SETLSDGKTY QVQWFERARF ELHPELAPPY HVLLGLLGNE TRHSGQASQQ PSLPSNDWLA QVNAYRARAG VPPVTADPTL NDNCVQHARY MAENGVLTHD QNPSLPWASG AGQTCAQKGN VWLGSGNVWK PLDAIDGWMM SVGHRAWLLY PTTPTFGFGF YQTRGVSAAG LDVLTHARLD QDTTFPGWPV RYPGGDQQDV PPIQLPITLF WPYFGPTPVI SSVSLRTGSG MSLPHSATTN LPAGHKGVAI IPAQALPPFT TIEATVTGTY DGRPFNVTWQ FTTRR
|
| |