Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3380 |
Symbol | |
ID | 5210357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4242716 |
End bp | 4244095 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640596977 |
Product | von Willebrand factor, type A |
Protein accession | YP_001277690 |
Protein GI | 148657485 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.178617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00546751 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTCGTT TGATGCGTTG TTTGTACTGT GGTGCATTGC AGGATGAACC TGTCGGGGTC AAGTCCTGCG TCCGCTGCGG CGGCGAACTC AAGTACGAGG ACGCCGCCCG CACAGCCAGC GGCGGTTCCT ACCTCACCGC TCAACTGGAA CTGGACCAGG TCTATGCCCC GCCCGGTCAG AACGTGGATC GCTACCTGCT GCTCACCCTC TGCTCCCCGG CCAAAGTGCC GCCCGAACAT GCGCTGCCCC GCGAACAGCA CCGCCCGCCG CTGCACCTTG TCGCCGTCCT GGATGTGTCT GGCTCAATGA GTGGCACAAA ACTGGCTTCC GCCAAAGAAG CCCTGCGTCA GGCGTTGCAC TTTTTGCAGG ACGGAGACGT CTTCTCGCTG GTTACCTTTT CCGATCAAGT GCAAACCCAC CTCAAAGCCG AGTCCTACGC CCAACGAAAG CGCGACAAGA TGGAAAACCT GCTGGACGAA ATCAGGGCGA GCGGCATGAC CGCCCTGGAT GGTGGGCTGG CACAGGGGAT TGACCTGGGC CAAAAGAAAC GGCAGGCCAC CACCCTGGTC CTGCTGCTCA GCGATGGGCA GGCGAATGTC GGCGAGACCG ACCTGGAGAA AATCGGCTTG CGGGCGCAAA AGGCGCGGCA ATCCGGTCTC ATCGTCTCCA CGCTGGGGGT TGGCCTGGAT TACAACGAGG CGCTGATGGT GGAAATTGCC AACCAGGGCG GTGGGCGTTT CTACCACATT CAAGAGGGCA GTCAGATCCC GGCGGCTCTG ATGCAGGAAC TGGGCAGCGC CGCCATGCTC GCCGCGCGTC AGGTGGAAGT GGAGTTCGAT CTCCCGTCCG GCGCGGCGCT GGTCTCGCTC ACCGCGCTCT ACCCGCTGGA AATGGTCAAC AGCCGCCCAC TTCTGAAAGT GGGGGACTTG CTGCCGGATG TGCGGGTGGA GATTCCGCTG CGCCTGACCC TTTACCCCCA TGCCGCTGGT GAGCGTTTCA GCGTCAGTGG AGGAGTGCAT CATCAAACGC CGCGCGGCCA GACCCTGGGA TTGGCGCTGA ACGCCGTCAG CGTGCGCTTT GTGGAACAGC GTCAATTCGA GGAGAGACCA GGTTACGTTG CCCCGGTGAT GGAGCGCGTG CTGGAATTTC GCCGCGCTGC CCACCTGTTG GAGTTTGCCC GCCTGCAGGA ACGGGATAGC ACGCTGGCCA GGCAGCAAGC CGAGCGGGAA CGCCAGGCCC TGCGGGATTA CGCTCGCATG TTCAATCCAG ACAAGGCGAT GGAACTGGAG GTAGAAAAGA TAGATGCGCT CTTTGCCACG CCCAATGTGG CCAAACAAGC CATGCATCGC GCCGCCCGAA TGATCCGCGG GCTGGATTAG
|
Protein sequence | MARLMRCLYC GALQDEPVGV KSCVRCGGEL KYEDAARTAS GGSYLTAQLE LDQVYAPPGQ NVDRYLLLTL CSPAKVPPEH ALPREQHRPP LHLVAVLDVS GSMSGTKLAS AKEALRQALH FLQDGDVFSL VTFSDQVQTH LKAESYAQRK RDKMENLLDE IRASGMTALD GGLAQGIDLG QKKRQATTLV LLLSDGQANV GETDLEKIGL RAQKARQSGL IVSTLGVGLD YNEALMVEIA NQGGGRFYHI QEGSQIPAAL MQELGSAAML AARQVEVEFD LPSGAALVSL TALYPLEMVN SRPLLKVGDL LPDVRVEIPL RLTLYPHAAG ERFSVSGGVH HQTPRGQTLG LALNAVSVRF VEQRQFEERP GYVAPVMERV LEFRRAAHLL EFARLQERDS TLARQQAERE RQALRDYARM FNPDKAMELE VEKIDALFAT PNVAKQAMHR AARMIRGLD
|
| |