Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1363 |
Symbol | |
ID | 5538835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 1742779 |
End bp | 1745697 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640893500 |
Product | von Willebrand factor type A |
Protein accession | YP_001431477 |
Protein GI | 156741348 |
COG category | [S] Function unknown |
COG ID | [COG5426] Uncharacterized membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0717995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGCC TCTCGTTCAT TACGCCTCTC GCCCTCACGC TCCTGGCGCT GATCCCGGCG CTGTGGGCTT TGACGCTGCT GACGCCGCGC CGCCTGGCCC CCTGGCGCTT CTGGTCGAGC CTGGTTCTGC GCAGCATCAT CCTTCTTGCG CTGACGCTCG CCCTCGCCGG GACGCAGATC GTCCTGCCGG TGCGCGAACT GACGACGGTG TTCCTGGTCG ATGTGTCGGA CTCGATGACG CCGGCGCAAC GCGAACGCGC CTTGCAGTAC GTTAACGACG CACTGGCTGC CATGCCTCCA GGCGACCAGG CGGCTGTCGT GGTGTTCGGC GACAATGCGC TGGTGGAGCG CGCTCCTGGT CCAATCGGTC CGCTGAGTCG CCTGACGTCG GTTCCGATCA CGACACGCAC CAATCTCCAG GAGGCAGTAC AGTTGGGGCT GGCGCTCTTC CCGGCGGAAA CGCAGAAGCG CCTGGTGCTC ATTTCGGACG GCGGCGAGAA TGCCGGGCGT GTGGCGGATG CAGCGCAACT CGCGGCTATT CGGAAGGTGC CAATCGATGT CGTCTACATG CCGGGCGAGC GAGGTCCTGA TGTCATCGTT GCCGGGCTGA GCGCGCCAGC CGTCGTGCGT GAAGGGCAGG ACCTCACGTT GCAGGCGAAT ATCACGTCCA ACTATGCGAC GAGCGGACGT TTGCAAACGT TTGTGGACGG GCAACTGATC GGTGAGCAGG AACTCTCCAT CCCTGAAGGA GCGAGCACCA TCGATATTCG CGTCCCTTCG GGCGAAACCG GGTTTCGCCG CATCGAAGTG CGCCTCGACG CCGATGGGGA CACAGAGCCG CAGAACAATC GTGGGGCGGC GTTCACCGAA GTGTTGGGAC CGCCGCGCCT GCTGTTGATC GCCTCCAACG AGGCGCGCGC CGTCAATCTG CGCGACGCGC TGCGCGCCGC CGAGGTGCGT GTCGATGTCC TCCCGCCGGA TCAGGCGCCC GCCACTCTGG ATCAGCTCGG CGCCTACGCT GGGGTGATAA TTGTCGATAC GCCAGCGCGC GATATGCCTC GCACATTGAT GGAGGCGCTG CCGGTTTATG TGCGTGAACT GGGGCGTGGG CTTGCCATGG TCGGCGGCAT CGATTCATTT GGCGCCGGGG GGTATCGGCG CACGCCGCTG GAGCCAGTGC TACCGGTGTT GCTCGATCCG CTCGACACAA AGCAGCAACC GGACCTGGCA CTGGTGATGG TGATCGACCG CAGCGGCAGC ATGTCGGAGT TGGTGGGCGG AAGCCGACGC AACCGGCTCG ACCTCGCCAA GGAAGCGGTT TATCAGGCAA GCCTTGGTCT GACCCCGATC GATCAGGTCG GGCTGGTTGT GTTCGACGAT GCGGCGAATT GGGTGCTGCC GCTGCAACGC TTGCCTTCGG TCGTCGAAAT CGAACGGGCG CTCGGTTCGT TTGGCATCGG CGGCGGCACG AATATTCGAC CGGGCATCGA ACAGGCAGCA CAGGCGCTGG CTTCCGCCGA TGCAAAGGTC AAGCACGTCA TTCTGTTGAC CGATGGCATC GCAGAAAGCA ACTACAGCGA TCTGATCGCG CAGATGCGCG CCGCCGGCGT CACCATTTCC ACGGTTGCAA TCGGTGAAGA CGCTAATCCC AATCTGGTCG ATGTGGCGAA TGCCGGCGGC GGTCGTTCCT ATCGTGTGAC CAGGATCGAG GACGTGCCGC GCATTTTTTT GCAGGAGACA ATCATCGCCG CCGGGCGCGA TATTGTCGAG GAGCGGATTG AACCGCAGGC GGGTCTTCCC TCGCCGATCA TCCGCAGTCT GGGAGGGTTG CCGCCGCTCT ATGGCTACAA TGGCACCGAG GTGCGCGAAG CGGCGCGCAC GTTTTTGTTC ACACCTGATG GGAAGCCTTT GCTGGCGCAA TGGCAGTATG GTTTGGGGCG CGTTGTCGCC TGGACGAGCG ACGCACAGGG GCGCTGGGCG CGCGACTGGA TTGCCTGGGA TCAGTTTCCG CGCTTTGCCG GCGGGATGAC CGATCTCTTG CTTCCACCAC GCGAAAGCGG AACGCTCGAA CTCCGCGCGA CTGCCGCTGG TCCGCGCGCA TTGATCGAGT TGACCGCTCA GGACGAGCAG GGACGTCCGC TTAACAATCT GGTTATTGCA GGGCGCGCCG TCGATCCGCA GAATCAGGGA ACTGCGGTCC AATTTCAGCA GATCGGTCCG GGCCAGTATC GCGCAGTCGT CGATACATCG TCGCCGGGCG TCTACCTGGC GCAGGTGGCG GTTTCCGATG CGGAAGGACG ACAAATTGGC GTCGCGGTGA CCGGCATTGT GGTCAGTTAT TCGCTGGAAT ACAGCGCGCA GCGCGAAAAT CTGCCACTGC TCAGCGACGT AGCAGGCATC AGCAGCGGGC GGATCAATCC TCCACCTGAT GTGGTCTTTG CGTCACCCAA TCAAAATGTC GGTTCAGTGC GAGAGATTGG ACTTCCGCTC CTCTGGCTGG CCCTTCTCCT CTGGCCCCTC GATATTGCCG CGCGGCGTGT GATGGTACGG ATGGACGACG TGGCGCCGTG GCTTGAACGG CTCCGTCGGC GGCGACCGTC GTCGGTGGCT GCGCCGGAGG CGGCGTCAAC CATGACACGA CTCGGCGCTG CGAAACGGCG CGCGATGATT GCGCGCACAT CGCCGAATCG CGCCGCTGCC AGCTCGGAGC AATCGGTTAC GCCGCCGGTG ATGACGCAGT CTCGCCAGAC GCCACCTCCT GCGCCAGAGT CCCGCCCGTC TGCGCCAGCG TCGGGCGCAT CCGAGACGCG CGCCCGATCA TCCGAAAAAC GTCCGGTCTC GCCGGAAGCG ACGGAAGAAC AGTTCGCCCG GTTGTTGGCG GCGAAACAGC GCGCGCGGCG CAAATCGGAG GAGCGGTAA
|
Protein sequence | MIRLSFITPL ALTLLALIPA LWALTLLTPR RLAPWRFWSS LVLRSIILLA LTLALAGTQI VLPVRELTTV FLVDVSDSMT PAQRERALQY VNDALAAMPP GDQAAVVVFG DNALVERAPG PIGPLSRLTS VPITTRTNLQ EAVQLGLALF PAETQKRLVL ISDGGENAGR VADAAQLAAI RKVPIDVVYM PGERGPDVIV AGLSAPAVVR EGQDLTLQAN ITSNYATSGR LQTFVDGQLI GEQELSIPEG ASTIDIRVPS GETGFRRIEV RLDADGDTEP QNNRGAAFTE VLGPPRLLLI ASNEARAVNL RDALRAAEVR VDVLPPDQAP ATLDQLGAYA GVIIVDTPAR DMPRTLMEAL PVYVRELGRG LAMVGGIDSF GAGGYRRTPL EPVLPVLLDP LDTKQQPDLA LVMVIDRSGS MSELVGGSRR NRLDLAKEAV YQASLGLTPI DQVGLVVFDD AANWVLPLQR LPSVVEIERA LGSFGIGGGT NIRPGIEQAA QALASADAKV KHVILLTDGI AESNYSDLIA QMRAAGVTIS TVAIGEDANP NLVDVANAGG GRSYRVTRIE DVPRIFLQET IIAAGRDIVE ERIEPQAGLP SPIIRSLGGL PPLYGYNGTE VREAARTFLF TPDGKPLLAQ WQYGLGRVVA WTSDAQGRWA RDWIAWDQFP RFAGGMTDLL LPPRESGTLE LRATAAGPRA LIELTAQDEQ GRPLNNLVIA GRAVDPQNQG TAVQFQQIGP GQYRAVVDTS SPGVYLAQVA VSDAEGRQIG VAVTGIVVSY SLEYSAQREN LPLLSDVAGI SSGRINPPPD VVFASPNQNV GSVREIGLPL LWLALLLWPL DIAARRVMVR MDDVAPWLER LRRRRPSSVA APEAASTMTR LGAAKRRAMI ARTSPNRAAA SSEQSVTPPV MTQSRQTPPP APESRPSAPA SGASETRARS SEKRPVSPEA TEEQFARLLA AKQRARRKSE ER
|
| |