Gene RoseRS_3544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3544 
Symbol 
ID5210522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4438960 
End bp4441860 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content63% 
IMG OID640597140 
Productvon Willebrand factor, type A 
Protein accessionYP_001277852 
Protein GI148657647 
COG category[S] Function unknown 
COG ID[COG5426] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGCC TCTCGTTCAT CACACCGCTT GCACTCATTC TCCTGACGCT GCTTCCGGCG 
TTGTGGGCGT TCACCCTGTT GACGCCGCGC CGCCTCGCTC CGTGGCGTTT CTGGTCGAGT
CTGGCGCTGC GCAGCGTCAT TCTTGCTGCG CTCGTGCTGG CGATCGCCGG TGCGCAGATT
GTGCTGCCGG TGCGTGAGGT GACCACCGTT TTTTTAATCG ATGTGTCGGA CTCGATGACC
CCAGCGCAAC GTGAACGCGC TTTGCAATAT GTCAACGACG CACTGGCTGC CATGCCAGCC
GGTGATCGTG CCGCCGTGGT GGTGTTCGGT GAAAATGCGC TGGTGGAGCG CGCCCCCGGT
CCTATTGGCG CACTGGGTCG TCTGTCATCG ACGCCGATGA CCACCCGCAC CAATCTCCAG
GAGGCGGTGC AACTGGGGCT GGCGCTGTTC CCTGCCGAGA CGCAGAAGCG GTTGGTGCTC
ATCTCGGACG GGGGCGAAAA TGCCGGAAGA GTGGCGGATG CGGCGCAACT GGCTTCGATT
CGCAAGGTGC CGATCGATGT GGTGTATCTG CCCGGTGAAC GTGGTCCTGA CGTCATCGTC
GCCGGTCTGA GCGCGCCTGC GGTTGTGCGC GAAGGGCAGG ACATCATCGT GCAGGCGAAT
ATATCGTCCA ACTATGCCAC CGGCGGGCGT CTGCAAACCT TCGTTGACGG GCAACTGATC
GGCGAACAGG AACTCTCCAT CCCTGAAGGA TCGAGCACAG TTGATATCCG TGTGCCATCG
GGTGAAACCG GATTCCGGCG TCTCGAAGTC CGGCTTGATG CCGACGGCGA TACCGAGCCG
CAGAACAATC GAGGGGCAGC GTTCACCGAA GTGCAGGGAC CGCCACGCCT GTTGCTGATC
GCCTCCGACG AATCACGCGC GGCAAACCTG CGCAATGCGT TGCTCGCCGC CGGGGTGCGC
GTCGATCTGC TTCCCCCCAG TCAGGCGCCA GCCACGCTCG CTCAACTTGG CGCCTACGCT
GGCGTCATGA TCGTCGATAC CCCGGCACGT GAGATGCCGC GCACGCTGCT TGAGGCATTG
CCAGCATATG TGCGCGAACT TGGGCGCGGC CTGGCGATGG TCGGCGGCGT CGACTCGTTT
GGCGCTGGCG GCTACCGGCG TACACCGCTG GAACCCATGC TGCCGGTGCT GCTCGACCCG
CTGGACACGA AACAACAACC CGATCTGGCG CTGGTGATGG TGATCGACCG GAGTGGCAGT
ATGGCTGAAC CGGTGGCAGG CGGCAGGCGG AATAAACTCG ATCTCGCCAA AGAAGCAGTG
TACCAGGCAA GTCTCGGTTT GACCCCCATC GACCAGGTCG GGCTGGTTGT CTTCGACGAT
ACGGCAAACT GGGTGCTTCA GTTACAACCG TTGCCGTCGA TGGTCGAAAT CGAGCGGGCG
CTCGGTTCAT TTGGCATCGG CGGCGGCACG AATATTCGGC CCGGCATCGA ACAGGCGGCG
CTGGCGCTGG CATCCACCGA CGCGAAGATC AAGCATGTCC TCCTGCTGAC CGATGGTATT
GCGGAGAGTA ATTATAGCGA TCTGATTGCT CAAATGCGCG CGTCCGGCAT TACCATTTCC
ACCGTTGCAG TCGGTCTGGA TGCCAACCCT AATCTGGTCG ATGTAGCGAA CGCTGGCGGC
GGGCGTTCCT ATCGCGTGAC CAGCATCGAT GAAGTGCCGC GCATTTTCTT GCAGGAGACG
ATTATCGCCG CCGGGCGTGA CATCATCGAA CAGCCAATCG AACCGCAGTT GGGTCTATCT
TCGCCGATCA TCCGCAGCCT GGGGGGATTG CCGCCGCTCT ACGGCTATAA TGGCACAGAG
GTGCGCGAGG CGGCGCGCAC CCTGCTCCTC ACGCCGGATG GTAAACCGTT GCTGGCGCAG
TGGCAGTATG GTTTGGGGCG GGTGGTCGCC TGGACGAGCG ATACCCAGGG ACGCTGGGCG
CGTGACTGGA TCGCCTGGGA TCGGTTTCCA CAGTTCGCTG GCGGTCTGGC AGACCTGCTG
CTCCCGCCTC GTGAGAGCGG TTTGCTCGAA CTCCGGGCAA CCGCCGCCGG TCCACGCGCA
TTTCTGGAAC TGATTGCGCA GGACGAACAG GGACGTCCGC TCAACAACCT GGCAATCGCC
GGGCGCGCCG TCGATCCGCA GAATCAGGGA GCGACGGTGC AGTTCCAGCA GATTGGTCCG
GGCAGATACC GCGCCGCAGT GGATACGCCG TCACCGGGGG TCTACCTGGC GCAGGTTGCA
GCATCCGATG CAGAAGGGCG TCAGATTGGC GTTGCGGTAA CCGGCATCGT CGTCAGTTAC
TCGCTCGAGT ACAGTGCACA GCGCGAGAAC CTGCCACTTC TGACAGAGGT TGCCAGCATC
AGCAGAGGAC GGATCAACCC GTCGCCGGAG ACAGCGTTCG CCTCACCCAA CCAGGAGGTC
GGTTCGGTGC GTGAGATCGG ATTTCCGCTC CTCTGGCTGG CGCTGATCCT GTGGCCCCTC
GACATTGCCG CGCGGCGCGT GATGTTGCGT TTGGAGGATG TCGCCCCCTG GCTGGAACGG
CTTCGCCGAA GGCGTCCCTC CGTCGTCGCT GCGCCAGAAG CATCGGCGAC GATGACGCGG
CTTGGCACAG CGAAACGGCG CGCAACTGCG GCGCGTCCGT CGTCGATCAG TGTGGAACGT
TCTGGAATCG ACGCACCGAC GGTGCCACAA ACCGTCGTTC CCACCGATCA GGCCTCCCAG
GGGCGCGCCC CTGCGCCGCC GCCGCAGACG ACCGAACAGC GCGCCAGACC GACCGCAACC
CGCCCGGAAG CGGCGGAAGA GCAGTTCGCC CGGTTGCTGG CAGCAAAACA GCGCGCGCGG
CGCAAATCCG AGGATCGCTG A
 
Protein sequence
MIRLSFITPL ALILLTLLPA LWAFTLLTPR RLAPWRFWSS LALRSVILAA LVLAIAGAQI 
VLPVREVTTV FLIDVSDSMT PAQRERALQY VNDALAAMPA GDRAAVVVFG ENALVERAPG
PIGALGRLSS TPMTTRTNLQ EAVQLGLALF PAETQKRLVL ISDGGENAGR VADAAQLASI
RKVPIDVVYL PGERGPDVIV AGLSAPAVVR EGQDIIVQAN ISSNYATGGR LQTFVDGQLI
GEQELSIPEG SSTVDIRVPS GETGFRRLEV RLDADGDTEP QNNRGAAFTE VQGPPRLLLI
ASDESRAANL RNALLAAGVR VDLLPPSQAP ATLAQLGAYA GVMIVDTPAR EMPRTLLEAL
PAYVRELGRG LAMVGGVDSF GAGGYRRTPL EPMLPVLLDP LDTKQQPDLA LVMVIDRSGS
MAEPVAGGRR NKLDLAKEAV YQASLGLTPI DQVGLVVFDD TANWVLQLQP LPSMVEIERA
LGSFGIGGGT NIRPGIEQAA LALASTDAKI KHVLLLTDGI AESNYSDLIA QMRASGITIS
TVAVGLDANP NLVDVANAGG GRSYRVTSID EVPRIFLQET IIAAGRDIIE QPIEPQLGLS
SPIIRSLGGL PPLYGYNGTE VREAARTLLL TPDGKPLLAQ WQYGLGRVVA WTSDTQGRWA
RDWIAWDRFP QFAGGLADLL LPPRESGLLE LRATAAGPRA FLELIAQDEQ GRPLNNLAIA
GRAVDPQNQG ATVQFQQIGP GRYRAAVDTP SPGVYLAQVA ASDAEGRQIG VAVTGIVVSY
SLEYSAQREN LPLLTEVASI SRGRINPSPE TAFASPNQEV GSVREIGFPL LWLALILWPL
DIAARRVMLR LEDVAPWLER LRRRRPSVVA APEASATMTR LGTAKRRATA ARPSSISVER
SGIDAPTVPQ TVVPTDQASQ GRAPAPPPQT TEQRARPTAT RPEAAEEQFA RLLAAKQRAR
RKSEDR