Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1891 |
Symbol | |
ID | 5208852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2346784 |
End bp | 2349582 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640595500 |
Product | von Willebrand factor, type A |
Protein accession | YP_001276230 |
Protein GI | 148656025 |
COG category | [S] Function unknown |
COG ID | [COG5426] Uncharacterized membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.619443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTTT CGTTCATTCA TCCCGATGCC CTCTGGCTCT TGATCGTGTT GCCGCTGCTG TGGGGCGTTG CGTTGATCGC GCCGTCGACC GGCACCGCCT GGCAGCGGCG GATCGCGCTG ATCCTCCGCA CCCTGATGGT TCTGGCGCTC ATCGGTGCGC TTGCAGGCGC GCAGGTTGTG CAACCTCCCG CTTTCACCAC CACTATTTTT CTCCTCGATG GCTCGGATTC GGTTGCAGTG TCGCAGCGTG TCCGCGCCGA GGCGTTCATC GCACAGGCGC TGGCGTCGAT GCCGCCGGAT GATCAGGCTG GCGTGGTGGT CTTCGGGCGG GAAGCGCTGG TTGAGCGTAT GCCGTCGCCG GAACGCACCT TTGGCGCACC GGCGGTGCGT CCGTCTGGCA GTGCAACCAG TATTGCCGAT GCGTTGCAAC TCGGTATGGC GCTGCTTCCC GCCGAAGGGC ATCGACGGCT GGTGCTCCTT TCTGATGGCG GCGAAAATCG GGGGTCTGCC CGCGAGATTG CGCAACGCGC GGCGGTTGCG GGGATCCCTA TCGATGTTGT GCCGCTCAGT GGCGTCGCTG ATGGTCTCGA TGCGCAGATT GTCAGTGTGA CGCTGCCATC GACCGCGCGT GAAGGTCAGC GCCTGCCGTT GCGTGTCGAT CTCGAAAGCA ATGCGCCTGC CACAGGACGC CTGATCGTGA CAGGACCTGA TGGAGGAACG GTCGCAACCA TACCGGTCGA TATCGGCGCT GACCGGCAAA CGATCTCCAT CCTGCTTCCC GAAGCGCCGG CTGCATTCAA TCGCTACACC GTGCGTCTCG ATGTCCCCGG CGATACCCGC GCACAGAACA ACGCGGTCGA AACCTTCAGC GTCGTCAGAG GCAGACCGCG CGCGTTGCTG GTTGCGCAGT CGCCCGAAGA TGCAGCCGGT CTGGAACGTG CGCTGCGCGC CGCCCAGATA GATGTCGCCG TCGTCGCGCC AGCGGCAATG CCCGATACGC TGCTGGCGAT GAGCCAGTAC GATGCCATTG CTCTGGTGAA TGTCCCCCGT CGTGCGTTCT CCGAATCGAC GCTGCAACAC CTGGCTACAT ACGTCCATGA TCGCGGCGGC GGTCTGATCA TGGTCGGCGG ACCACGGTCG TTCGGTCCAG GCGGCTGGCG TGGTACGCCA GTCGAAGCGG CGCTGCCGGT GACCATGGAC ATTCCCATCT ACCGCACAAT GCCGCCGGTC AGTGTGGTGA TCGTCATCGA TATTTCAGGC AGCATGGCGA TGACCGAGGA TGGCATTCCC AAACTGTCGC TGGCGCTCGA TGGTGCGCGA CGGATTGCAT CACTGCTGCG CGACGAGGAT GAACTGACCA TTCTTCCATT CGATGACCGT CCGGGGGTCG TCGTCGGTCC GCTTCCGGGA TCGCAGCGCG ACAAAGCCAT CGAACAGATG AGTCAGGTGC GCCTCGGCGG AAGCGGTATC AACATCCATG ATGCGCTCGT GGCGGCGGCG AGGTACGTTC GCGCCAGTGA CCGCCCCATT CGCCATATCA TCACGATCAC CGATGGCAAT GATACCGTGC AGCAGGAAGG CGCGCTCGAC ATTGTGCGCG CGCTGCGCGA TGAGCGCGTC ACCCTGACCT CGATTGCCGT CGGGCAGGGC AGCCATGTGC CGTTCATCCG CGATATGGCG GCGGTCGGCG GCGGGCGCAC CTTCCTGACC GAACGCGCCG CCGATCTTCC CGACCTGTTG TTGGATGAGG CGGAAATGAT CATTCAACCT TCGATCATTG AAGGGGTTGT CACACCGTTG CGCGGCGCGC CGCATCCTGC GATCCGCAGC ATTGACGCAG CGCCCGTTCT GTATGGCTAT GTTTTGACGA CGCCGCGTGA CACCGCGCAG GTGGCGCTCG TCACCCCGGA GGGGGATACG TTGATGGCAG CGTGGCAGTA TGGTCTGGGA CGCTCAATCG CCTGGACGAG CGATTTCAGC GGGCGATGGG CGAAGGAGTG GGTGGCGTGG GATCGGTTCC CGCAGTTCGG CGCGCACCTC TTCAACTGGC TCCTGCCGCC GCAAACCGAT GATGTTCTGA GCATTGCAAC GCACCCATCG GGCGACACGC TGACGATCGA GACCATTGCG CGGAGGCCCG ATGGGTCCCC ATGGAGCGGC TTACTCGTCT CTGTGCGTCT CATCGCGGCT TCTGGCGAGG TTATCGAAAC CGTGCTGCGT GAAGTCAGCC CTGGTCAATA CCGCGCTGCT CCGGATGGCG TGCCGCCTGG AGCGTATCTG GTTCAGGCGA CCGCGCAGGA CAGCCAGGGC GCGCTGGTCG CGGCAGTGAC GGGCGGGGCG GTCATGCCGC TCAGTAGGGA GTATCGCAGC CAGGCGGGGA ACCGCCATCT TCTCGAAGAA CTGGCGCAGA TCACCGGCGG GCGGCTTGAT CCGCAACCAC GCCAGGTGTT CGAGCGGGGT GGCGAAACGC GCGGCGCTGT GCGCGAGGTG GGCCTGCTAT TGATCGTGCT GGCGCTGATC CTGCTGCCGC TTGACATTGC CGTGCGACGC CTGCCGCTCC AGCGCGGAAT GATAGTCGCC GCGCTGCGGA AGGTCGGTCT GAGTGCACAT ACAGGGCAGT TCGAGACGCA GGCGGCGCCG GTTGCTGTTC CGTTCTCGCC GTCTCGCTCT GAATCGGCAT CGCGTCCAGA TCCAGCAGGG GAAGCACAGG TGCCCCCTGC TGAACTGGAA CGTCTCCGCG CAGCGCAGGA AGCGGCGCGG CGGCGGCTAC GCGGCGAAGA TGCCGACGTG CGCCGCTGA
|
Protein sequence | MNVSFIHPDA LWLLIVLPLL WGVALIAPST GTAWQRRIAL ILRTLMVLAL IGALAGAQVV QPPAFTTTIF LLDGSDSVAV SQRVRAEAFI AQALASMPPD DQAGVVVFGR EALVERMPSP ERTFGAPAVR PSGSATSIAD ALQLGMALLP AEGHRRLVLL SDGGENRGSA REIAQRAAVA GIPIDVVPLS GVADGLDAQI VSVTLPSTAR EGQRLPLRVD LESNAPATGR LIVTGPDGGT VATIPVDIGA DRQTISILLP EAPAAFNRYT VRLDVPGDTR AQNNAVETFS VVRGRPRALL VAQSPEDAAG LERALRAAQI DVAVVAPAAM PDTLLAMSQY DAIALVNVPR RAFSESTLQH LATYVHDRGG GLIMVGGPRS FGPGGWRGTP VEAALPVTMD IPIYRTMPPV SVVIVIDISG SMAMTEDGIP KLSLALDGAR RIASLLRDED ELTILPFDDR PGVVVGPLPG SQRDKAIEQM SQVRLGGSGI NIHDALVAAA RYVRASDRPI RHIITITDGN DTVQQEGALD IVRALRDERV TLTSIAVGQG SHVPFIRDMA AVGGGRTFLT ERAADLPDLL LDEAEMIIQP SIIEGVVTPL RGAPHPAIRS IDAAPVLYGY VLTTPRDTAQ VALVTPEGDT LMAAWQYGLG RSIAWTSDFS GRWAKEWVAW DRFPQFGAHL FNWLLPPQTD DVLSIATHPS GDTLTIETIA RRPDGSPWSG LLVSVRLIAA SGEVIETVLR EVSPGQYRAA PDGVPPGAYL VQATAQDSQG ALVAAVTGGA VMPLSREYRS QAGNRHLLEE LAQITGGRLD PQPRQVFERG GETRGAVREV GLLLIVLALI LLPLDIAVRR LPLQRGMIVA ALRKVGLSAH TGQFETQAAP VAVPFSPSRS ESASRPDPAG EAQVPPAELE RLRAAQEAAR RRLRGEDADV RR
|
| |