Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3544 |
Symbol | |
ID | 5210522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4438960 |
End bp | 4441860 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640597140 |
Product | von Willebrand factor, type A |
Protein accession | YP_001277852 |
Protein GI | 148657647 |
COG category | [S] Function unknown |
COG ID | [COG5426] Uncharacterized membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGCC TCTCGTTCAT CACACCGCTT GCACTCATTC TCCTGACGCT GCTTCCGGCG TTGTGGGCGT TCACCCTGTT GACGCCGCGC CGCCTCGCTC CGTGGCGTTT CTGGTCGAGT CTGGCGCTGC GCAGCGTCAT TCTTGCTGCG CTCGTGCTGG CGATCGCCGG TGCGCAGATT GTGCTGCCGG TGCGTGAGGT GACCACCGTT TTTTTAATCG ATGTGTCGGA CTCGATGACC CCAGCGCAAC GTGAACGCGC TTTGCAATAT GTCAACGACG CACTGGCTGC CATGCCAGCC GGTGATCGTG CCGCCGTGGT GGTGTTCGGT GAAAATGCGC TGGTGGAGCG CGCCCCCGGT CCTATTGGCG CACTGGGTCG TCTGTCATCG ACGCCGATGA CCACCCGCAC CAATCTCCAG GAGGCGGTGC AACTGGGGCT GGCGCTGTTC CCTGCCGAGA CGCAGAAGCG GTTGGTGCTC ATCTCGGACG GGGGCGAAAA TGCCGGAAGA GTGGCGGATG CGGCGCAACT GGCTTCGATT CGCAAGGTGC CGATCGATGT GGTGTATCTG CCCGGTGAAC GTGGTCCTGA CGTCATCGTC GCCGGTCTGA GCGCGCCTGC GGTTGTGCGC GAAGGGCAGG ACATCATCGT GCAGGCGAAT ATATCGTCCA ACTATGCCAC CGGCGGGCGT CTGCAAACCT TCGTTGACGG GCAACTGATC GGCGAACAGG AACTCTCCAT CCCTGAAGGA TCGAGCACAG TTGATATCCG TGTGCCATCG GGTGAAACCG GATTCCGGCG TCTCGAAGTC CGGCTTGATG CCGACGGCGA TACCGAGCCG CAGAACAATC GAGGGGCAGC GTTCACCGAA GTGCAGGGAC CGCCACGCCT GTTGCTGATC GCCTCCGACG AATCACGCGC GGCAAACCTG CGCAATGCGT TGCTCGCCGC CGGGGTGCGC GTCGATCTGC TTCCCCCCAG TCAGGCGCCA GCCACGCTCG CTCAACTTGG CGCCTACGCT GGCGTCATGA TCGTCGATAC CCCGGCACGT GAGATGCCGC GCACGCTGCT TGAGGCATTG CCAGCATATG TGCGCGAACT TGGGCGCGGC CTGGCGATGG TCGGCGGCGT CGACTCGTTT GGCGCTGGCG GCTACCGGCG TACACCGCTG GAACCCATGC TGCCGGTGCT GCTCGACCCG CTGGACACGA AACAACAACC CGATCTGGCG CTGGTGATGG TGATCGACCG GAGTGGCAGT ATGGCTGAAC CGGTGGCAGG CGGCAGGCGG AATAAACTCG ATCTCGCCAA AGAAGCAGTG TACCAGGCAA GTCTCGGTTT GACCCCCATC GACCAGGTCG GGCTGGTTGT CTTCGACGAT ACGGCAAACT GGGTGCTTCA GTTACAACCG TTGCCGTCGA TGGTCGAAAT CGAGCGGGCG CTCGGTTCAT TTGGCATCGG CGGCGGCACG AATATTCGGC CCGGCATCGA ACAGGCGGCG CTGGCGCTGG CATCCACCGA CGCGAAGATC AAGCATGTCC TCCTGCTGAC CGATGGTATT GCGGAGAGTA ATTATAGCGA TCTGATTGCT CAAATGCGCG CGTCCGGCAT TACCATTTCC ACCGTTGCAG TCGGTCTGGA TGCCAACCCT AATCTGGTCG ATGTAGCGAA CGCTGGCGGC GGGCGTTCCT ATCGCGTGAC CAGCATCGAT GAAGTGCCGC GCATTTTCTT GCAGGAGACG ATTATCGCCG CCGGGCGTGA CATCATCGAA CAGCCAATCG AACCGCAGTT GGGTCTATCT TCGCCGATCA TCCGCAGCCT GGGGGGATTG CCGCCGCTCT ACGGCTATAA TGGCACAGAG GTGCGCGAGG CGGCGCGCAC CCTGCTCCTC ACGCCGGATG GTAAACCGTT GCTGGCGCAG TGGCAGTATG GTTTGGGGCG GGTGGTCGCC TGGACGAGCG ATACCCAGGG ACGCTGGGCG CGTGACTGGA TCGCCTGGGA TCGGTTTCCA CAGTTCGCTG GCGGTCTGGC AGACCTGCTG CTCCCGCCTC GTGAGAGCGG TTTGCTCGAA CTCCGGGCAA CCGCCGCCGG TCCACGCGCA TTTCTGGAAC TGATTGCGCA GGACGAACAG GGACGTCCGC TCAACAACCT GGCAATCGCC GGGCGCGCCG TCGATCCGCA GAATCAGGGA GCGACGGTGC AGTTCCAGCA GATTGGTCCG GGCAGATACC GCGCCGCAGT GGATACGCCG TCACCGGGGG TCTACCTGGC GCAGGTTGCA GCATCCGATG CAGAAGGGCG TCAGATTGGC GTTGCGGTAA CCGGCATCGT CGTCAGTTAC TCGCTCGAGT ACAGTGCACA GCGCGAGAAC CTGCCACTTC TGACAGAGGT TGCCAGCATC AGCAGAGGAC GGATCAACCC GTCGCCGGAG ACAGCGTTCG CCTCACCCAA CCAGGAGGTC GGTTCGGTGC GTGAGATCGG ATTTCCGCTC CTCTGGCTGG CGCTGATCCT GTGGCCCCTC GACATTGCCG CGCGGCGCGT GATGTTGCGT TTGGAGGATG TCGCCCCCTG GCTGGAACGG CTTCGCCGAA GGCGTCCCTC CGTCGTCGCT GCGCCAGAAG CATCGGCGAC GATGACGCGG CTTGGCACAG CGAAACGGCG CGCAACTGCG GCGCGTCCGT CGTCGATCAG TGTGGAACGT TCTGGAATCG ACGCACCGAC GGTGCCACAA ACCGTCGTTC CCACCGATCA GGCCTCCCAG GGGCGCGCCC CTGCGCCGCC GCCGCAGACG ACCGAACAGC GCGCCAGACC GACCGCAACC CGCCCGGAAG CGGCGGAAGA GCAGTTCGCC CGGTTGCTGG CAGCAAAACA GCGCGCGCGG CGCAAATCCG AGGATCGCTG A
|
Protein sequence | MIRLSFITPL ALILLTLLPA LWAFTLLTPR RLAPWRFWSS LALRSVILAA LVLAIAGAQI VLPVREVTTV FLIDVSDSMT PAQRERALQY VNDALAAMPA GDRAAVVVFG ENALVERAPG PIGALGRLSS TPMTTRTNLQ EAVQLGLALF PAETQKRLVL ISDGGENAGR VADAAQLASI RKVPIDVVYL PGERGPDVIV AGLSAPAVVR EGQDIIVQAN ISSNYATGGR LQTFVDGQLI GEQELSIPEG SSTVDIRVPS GETGFRRLEV RLDADGDTEP QNNRGAAFTE VQGPPRLLLI ASDESRAANL RNALLAAGVR VDLLPPSQAP ATLAQLGAYA GVMIVDTPAR EMPRTLLEAL PAYVRELGRG LAMVGGVDSF GAGGYRRTPL EPMLPVLLDP LDTKQQPDLA LVMVIDRSGS MAEPVAGGRR NKLDLAKEAV YQASLGLTPI DQVGLVVFDD TANWVLQLQP LPSMVEIERA LGSFGIGGGT NIRPGIEQAA LALASTDAKI KHVLLLTDGI AESNYSDLIA QMRASGITIS TVAVGLDANP NLVDVANAGG GRSYRVTSID EVPRIFLQET IIAAGRDIIE QPIEPQLGLS SPIIRSLGGL PPLYGYNGTE VREAARTLLL TPDGKPLLAQ WQYGLGRVVA WTSDTQGRWA RDWIAWDRFP QFAGGLADLL LPPRESGLLE LRATAAGPRA FLELIAQDEQ GRPLNNLAIA GRAVDPQNQG ATVQFQQIGP GRYRAAVDTP SPGVYLAQVA ASDAEGRQIG VAVTGIVVSY SLEYSAQREN LPLLTEVASI SRGRINPSPE TAFASPNQEV GSVREIGFPL LWLALILWPL DIAARRVMLR LEDVAPWLER LRRRRPSVVA APEASATMTR LGTAKRRATA ARPSSISVER SGIDAPTVPQ TVVPTDQASQ GRAPAPPPQT TEQRARPTAT RPEAAEEQFA RLLAAKQRAR RKSEDR
|
| |