Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1686 |
Symbol | |
ID | 5539162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2170008 |
End bp | 2172488 |
Gene Length | 2481 bp |
Protein Length | 826 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640893823 |
Product | von Willebrand factor type A |
Protein accession | YP_001431796 |
Protein GI | 156741667 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000000433208 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAGAC GTTTGCTTCA TCACAGGACA GAAGGGCAGA ATATCGTCCT GCTCGCCGGC ATTCTGGCAT TGCTGGTCGG CATGGCGGCC CTGGCTGTCG ACCTCGGCGT CACCTACGCC GAGCAGCGGA ACATCGTGCG CGGCACGAAC GCGGCGTCGC TGGCGGGCAT GAATCGCCTG ATCAGCGGCG GTCGCGATGC TGATGTCGCC CTTGCGATCT ATGAATCGCT TCGCTCGAAC GGCATCCAGG TCACCGTGCC GGGCGAGTCG CCGCAACCCG GCGACCGCGC GTTCGAGGCG CTCTACCTGG GAAGCGATGG CGCGCCGATC CCCGGCGCGT GCAGTCGCGT CGGCGCCTGC GGCTCGCAAC GTCCGCAGGG GGTGAAGTAT ATTCAGATCG ACCTCAAGGG CAATGTCGAT ACCTACTTTG CGCGCCTCTT CGGTCAGAGC ACCCTGCCGG TCGGCGCTAC GGCATACGCC AGCGTCGGCG CCTGCGCCAC CGGATACTAC CCCATCGGTG TGCGCACGAC AGTCGGCGGG CAGCCGATGT TCGATGAGTA TGGGTTCGCC AATTACGACG GCTTCTATGA AGACGAGACC TATGCTCAAT TGCGCTATCA GCGCCTCTAT CTGCGCACCG AAAACAACCC GAATGGCGGG TTTAGCCTGC TGCGCTGGCG CAACGACATC CCTGCCGGCA ATGCCAATGC GCTTGCGGAG ATGCTGACCG GCGACGGCAC CTATAGCCGG GGGTACGCCG AAGCGCCCTG GCCCGAAATC GCCAGCGGAA CGGACGGTCC GACCCGCCCT GACTCCTATC CGTTCGAGCC GAATACGATC AACACGGCGG ACTGGGTGTA TGGTAATGTC TTCGGCGGCA ATCCCTTCAA TGCGAATGTG ATCAATCAAC TCGATACGCT GAAGCGCAAT CGCACCCTGA TGACTCTGCC GATGTATCGC TTCGATAACG GCAGTCTTTC CAACCCGACG TACTACCTCG AGCGCTTTGG CGCCTTCCTG CTGGTTGATT ACGGTGTGGA TACCGATGGG CGGGCGCCCG ACGGTCCTTG GATGGAACTG GCTTACGTGC GCGAAGGCGG GCAGTGCGCC AACCTCGTCA CCGGTGCGCC GCCAACCAAC AACTTCACCG TCGGCGGCAG CGTGACCTAT ACGCCACGCT ATCAGACCTA CCCGGACCTC AATCGCCCGG TGCAGTTCCT ACTCATCCTC GACGTGTCCG GTTCGATGTC GTGGACGTTC GATGGGCGCG GCGTGCAGAA TGGGCAAACC GTCTTCTGCA CCAACCCGTC TCAAGGGTGC GTCTCGGTGC AAACGGCATG GCCCAACGCG CAGGAACGCC GGATCTATAC TGCGAAGCAG GTGCTGCGCT CGTTCGTGGC ACAGATCGAT CAGGATCGGC AGAGCGGTCT CCGACCGTAT GATACCGTGC GCCTCGTGAC CTTCAGCGGA CGCCTGGGCA GCTTTGTCAA CAGCAGCGGC GCCGTCGGCG ACAATAATCG CGCGTTGAAC GACCTGACCG AGGTGCTGCC TGCCGGTTGG ACGAATGACC GCGCGACATT GGAAGCGGCG ATCAATAGTG CCGGTATGGT GGACGGCGAT CCGTATATGA CGGCAGGCGC AACGCCCAGC GCCGTCGCCT TTGCCCGCGC CAGCCAGGTG TTCGCCAACG CTCCCGAACG CGCGCCCAAC GGGATGAAGT ATCGCCGCGT GGTGATTTTC GTGACCGACG GTGTGGCGAA TGTGTTGCGC AATGGTATGC AGAATAACTA CGGTGAGGGG TGCCAGTTGG GCGCTGAGAA CGTCGGTTGC CAGATGGGCG ATCCGCTGCC GGACGGCAGC CTGCGACCGC TCAATGCGAT GGTCGCAGAG GCGCAGGCGC TCAAGGAGGC GTATATCCGC CCCAGCGACG GTTCGGTCTA TGTGGTGGCG CTGAGCGGCA CATTCGAGGC GACCGGTCTG AACCTCGTCG CCAGCCAGCC GGACTATGTC AAGCGCGCCG ACCGGTCCGA GGAGTTGCAG CAGATTTTCG ACGACATCCA GGTTTCGGCG ATCCAGGGCG ATTGTACGCC ATCCCAGGGT GAACTGCGCG ACTCGATGGC GCCCTCCGAG GTGCCGACCG ACCTGCGACC GGAATTGACC GATCGGATCG TTGGACAGGT GACGCTCACC GATGCCAATA ACAATGTGCG CGAGGCATGG ATTACGGCCG ATCCGATCAC GCGCAAACTG TCGTATGCGT TCACGAACGT TCCGCCGGGC CAGTATACGC TGCGCGCCTG GCTCGGCTAT CGCGCGCCGC AGGATGGTAT CTCGCGCGAT AAGTACGAAC TGCTGGTGCG CGGACCGGAT GTGACGACCG AAATCACGGT GCAGGTCAGC GCCGGTCGCA GTCTCGGCGG CGTCATCGGC GTGCCGATCT CGCTCGACCT GAACGACAGC GTCTGCCCGG CATTGTCGTA A
|
Protein sequence | MKRRLLHHRT EGQNIVLLAG ILALLVGMAA LAVDLGVTYA EQRNIVRGTN AASLAGMNRL ISGGRDADVA LAIYESLRSN GIQVTVPGES PQPGDRAFEA LYLGSDGAPI PGACSRVGAC GSQRPQGVKY IQIDLKGNVD TYFARLFGQS TLPVGATAYA SVGACATGYY PIGVRTTVGG QPMFDEYGFA NYDGFYEDET YAQLRYQRLY LRTENNPNGG FSLLRWRNDI PAGNANALAE MLTGDGTYSR GYAEAPWPEI ASGTDGPTRP DSYPFEPNTI NTADWVYGNV FGGNPFNANV INQLDTLKRN RTLMTLPMYR FDNGSLSNPT YYLERFGAFL LVDYGVDTDG RAPDGPWMEL AYVREGGQCA NLVTGAPPTN NFTVGGSVTY TPRYQTYPDL NRPVQFLLIL DVSGSMSWTF DGRGVQNGQT VFCTNPSQGC VSVQTAWPNA QERRIYTAKQ VLRSFVAQID QDRQSGLRPY DTVRLVTFSG RLGSFVNSSG AVGDNNRALN DLTEVLPAGW TNDRATLEAA INSAGMVDGD PYMTAGATPS AVAFARASQV FANAPERAPN GMKYRRVVIF VTDGVANVLR NGMQNNYGEG CQLGAENVGC QMGDPLPDGS LRPLNAMVAE AQALKEAYIR PSDGSVYVVA LSGTFEATGL NLVASQPDYV KRADRSEELQ QIFDDIQVSA IQGDCTPSQG ELRDSMAPSE VPTDLRPELT DRIVGQVTLT DANNNVREAW ITADPITRKL SYAFTNVPPG QYTLRAWLGY RAPQDGISRD KYELLVRGPD VTTEITVQVS AGRSLGGVIG VPISLDLNDS VCPALS
|
| |