Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2520 |
Symbol | |
ID | 5209489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3114574 |
End bp | 3115932 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640596125 |
Product | von Willebrand factor, type A |
Protein accession | YP_001276847 |
Protein GI | 148656642 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.579263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAGCA TGCCAACCCT GACACTTTCC ACCAGCCCAG GACCACTGAT CCTCGAAGCG CGCTGCGAAC CGCAGGTCTG CTACGTGTTA CTCACGGTGC GTGCTGCGGC AACCGGCGAC GCTGGCGTGC GCCCGGTCAA CTGGGCGCTG GTCGCAGACG CGAGCCGCTC AATGCGCATT CCCATCGTTG ATGAAACCCA GTTTCGCTCA TTGTTGCGCA ACGGCTCAGC GCAGGAGACG CTGGTTGATG GCGTTCCGGT ATGGCAGTTG AGCGGTTCGG TGCCGCAGGA GGTGCGCAAG GCAGCATCGA GCGCCCTCGA TCATGTCGTC CATGCGCTGC ACACCGTGGT CGAACGGCTC GACCGGAATG ATCGGCTGTC GCTGGTCGTC TTTGCCGATC ACGCCCTCCT CCTGATACCG GGGATGGTCG GTTCGGATCG GGTCACGCTG GTACGTGCCA TCGAACGGTT GCCCGGTCTT GATCTGGGGG ATGGCACAAA TCTGGCAGAC GGGATTGCGC TGGCGCTCAA TCAGATCCGC GCCAACCGCG ACGCACGCCG CGCCAACCGG GTTCTGCTGC TGACCGATGG CTTCACCCGC GATCCGGCGG CCTGTCTGAC GCTGGCCGAC CAGGCGGCTG ACGAGCATAT CGCCATCACC ACCATCGGTC TGGGTGGTGA GTTTCAGGAC GATCTGCTGA CAGGGATTGC CGACCGGAGC GGGGGAAATG CTCTCTTTTT GAAGCGCGCT TCTGCCATCC CGCGCGCTAT CAGCGCCGAA CTCGAGTCGG CGCGCGCAGC CGCTCTGCCA GGGGTGGACA TTGCTATCGC CCCGATGCGT GGCGTCATGC TGCGGCGGGT CACCCGTACG CGACCGGTGC TGGCAATTCT CGCCGAGCCG ACCGGCACCG GGGCATCTGA TGTCGTTTCC GTTCTCCTGG GGGATCTGCC CGCCGGATCG CCGGTGACGC TGCTGCTGGA ATTCCTCGTA CCGGCAGCAA ACCCCGGGCC GCTATGGATT GCCGGAGTTG CGGCGCGGTC GGCGGGGGTG CGTCTGGCGG AAGCGGACAT CAGGGCGACC ATCGCACACG GCGCCCCGCC ACTGAGCGAC GATGTGCGCG CGGCAGCGGC GCGGGCAATG GCAGCTCGCC TGATGCGTCG TGCAATCGCC GCATCCGACT CGATCGAAGC GGCGCGTCTG ATGCGCGCCG CTGCCGCGCG CTTCGATGAA CTCGGCGAAC CGGCGCTTGC CGCTGCCGCT CGTGAACAGG CGGCGACCAT CGAGCGCAAC GGGCGCGCCG CCAGCATCGT CACCCGTGAA TTAATCTATG CGACTCGCCG ACTTGGCGAT GCATCCTGA
|
Protein sequence | MDSMPTLTLS TSPGPLILEA RCEPQVCYVL LTVRAAATGD AGVRPVNWAL VADASRSMRI PIVDETQFRS LLRNGSAQET LVDGVPVWQL SGSVPQEVRK AASSALDHVV HALHTVVERL DRNDRLSLVV FADHALLLIP GMVGSDRVTL VRAIERLPGL DLGDGTNLAD GIALALNQIR ANRDARRANR VLLLTDGFTR DPAACLTLAD QAADEHIAIT TIGLGGEFQD DLLTGIADRS GGNALFLKRA SAIPRAISAE LESARAAALP GVDIAIAPMR GVMLRRVTRT RPVLAILAEP TGTGASDVVS VLLGDLPAGS PVTLLLEFLV PAANPGPLWI AGVAARSAGV RLAEADIRAT IAHGAPPLSD DVRAAAARAM AARLMRRAIA ASDSIEAARL MRAAAARFDE LGEPALAAAA REQAATIERN GRAASIVTRE LIYATRRLGD AS
|
| |