Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1116 |
Symbol | |
ID | 5733008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1278362 |
End bp | 1280050 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278255 |
Product | von Willebrand factor type A |
Protein accession | YP_001543892 |
Protein GI | 159897645 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTTC GATTTATACT ACGCTTATTA TTGCTCTTCA TATTTTTGAG CGCCTGTGGT CAGGCGGCTA ACCCAATGCA ACCAAGCCAA TCCGGCCAGG CGAGCAGTGA TGATCTGGTG TTGCGGTTGC TCTATGGCAG CGAAAAGCAA CTCTGGATTG ATGCGGTAGT GAGCGATTTT AACGCCAGCT CAGCTAAAAC CGCTAGCGGC CAACGCATTC AGGTCGAAGC AGTGCCCGTT GGCTCGCTCG AAACGATCAA CGGCTTACTT GATGGCTCGC AACAGGCCGA TTTGTGGAGC CCAGCGAGCA GTTTATCCTT GCCATTAGCC AATCAACGTT GGAAGACGGC TAAAGGCGAA TTGCTGTTTA GCGATCAAAC TCCCACCTCG TTGGTGCTTA GCCCAGTTGT GATTGGCATG TGGAAGCCGA TGGCTCAAGC CTTGGGCTAC CCCGATAAGC AAATTGGCTG GGCGGATTTG GCCGATTTGG CGACCAGTGG CAAAACTTGG GCCGATTTTG GTCATCCAGA GTGGGGTGCA TTTCAGTTTG GCCACACGCA CCCTGAGTTC TCCAATAGTG GGCTTGCCAC GATTGTGGCG ATGGCCTATG CTGCCAATCA AAAAACCAGC GATCTGACTG TCGCTGATTT GGATAAACCC GAAACTGCCA GTTTGATCAA TGCTGTCGAG CAATCAGTGA TTCACTATGG CTCAAGCACA GGCTTTTTTG CCAAAACCAT GTATGAGCAT GGCCCTTCGT ATTTGTCGGC GGCAATTTTG TATGAAAATC TCATCATCGA GTCGTATGAT CAAGCGTTGT ATCCCAACCT TGAGTTGCCG ATGGTGGCGA TTTACCCCAA GGAAGGCTCA TTTTGGAGCG ACCATCCCTT GGTGGTGCTG GAAACCGAGC GCATGAATGC CGACAAACGG GCTGCAGCGC AAGTATTTCA AGAGTTTTTG CTGGCTCAGC CTCAACAAGC CAAGGCCATG CAATATGGTT TTCGGCCAGC CAATGTTGAT ATTAGCCTCG CTGCGCCAAT TGATACGGCG CATGGCGTTG ACCCAAGCCA ATTGCAAGTC GCCTTGCCAA CGCCTTCGGC AGAGGTTTTG CAGGCCATAA CTCAATTGTG GCAGCAGCAC AAAAAGCAAG TTGATGTAGC GTTGATTATT GATACTTCTG GCTCAATGCG TCAAGAAAAC CGTTTGCGCG AAGCCAAAAC GGCGCTTGGC GATTTTATCG ATATCTTTGC CGATCAAGAT AATGTGCAAG TGACGATTTT TAGCACCAAT GCAACCGAGC TTTCCGATCT CTCGCCGATT GGCCCCAAAC GGGCCGATTT GCATACTCGC ATCGATGGAT TGGTGGCCGA TGGCGAAACT CGTTTGTACA GCACAATTGG CGAAGTCTAT ACCGATATTC AGCAACAAAC TGAAGTGCAG CGGATTCGCG CATTGGTGGT GTTGACTGAT GGCGAAGATA CGGCTAGCTC ATTGAGTTTA GAGCAATTGA ATGAACAAAT TCGCCAAGAT GAATCTGGCA CGTCGATTAA AATTTTCACG ATTGCCTATG GCTCTGATGC CAATCAAGAG GTTTTGCAAC GAATTGCCGA AATCACTGGA GCCAAATCAT ATACTGGCGA TCCGGCGACA ATTCGTCAGG TTTATCATGA AATTGCTACA TTTTTCTAG
|
Protein sequence | MRFRFILRLL LLFIFLSACG QAANPMQPSQ SGQASSDDLV LRLLYGSEKQ LWIDAVVSDF NASSAKTASG QRIQVEAVPV GSLETINGLL DGSQQADLWS PASSLSLPLA NQRWKTAKGE LLFSDQTPTS LVLSPVVIGM WKPMAQALGY PDKQIGWADL ADLATSGKTW ADFGHPEWGA FQFGHTHPEF SNSGLATIVA MAYAANQKTS DLTVADLDKP ETASLINAVE QSVIHYGSST GFFAKTMYEH GPSYLSAAIL YENLIIESYD QALYPNLELP MVAIYPKEGS FWSDHPLVVL ETERMNADKR AAAQVFQEFL LAQPQQAKAM QYGFRPANVD ISLAAPIDTA HGVDPSQLQV ALPTPSAEVL QAITQLWQQH KKQVDVALII DTSGSMRQEN RLREAKTALG DFIDIFADQD NVQVTIFSTN ATELSDLSPI GPKRADLHTR IDGLVADGET RLYSTIGEVY TDIQQQTEVQ RIRALVVLTD GEDTASSLSL EQLNEQIRQD ESGTSIKIFT IAYGSDANQE VLQRIAEITG AKSYTGDPAT IRQVYHEIAT FF
|
| |