Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4210 |
Symbol | |
ID | 5736922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5363894 |
End bp | 5365159 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281365 |
Product | von Willebrand factor type A |
Protein accession | YP_001546970 |
Protein GI | 159900723 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.466753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGGTG AAGTTCAATT AACTGGTACG TTGGCTCGAC CGGCGTTGCC AGCCTTGCAA ACCCAGCAGG TTGTTTATTT ACTGCTGGAT ATTACGGCAA CACCAGCGGT TGCTCACGTT CAAATGCCAG TTAATGTGAG CTTCGTGCTC GATCATAGTG GCTCGATGAA GGGCGACAAA ATGCGCTGTG TGCGCGAAGC TACCCAACGC GCTCTGGGCT TGATGGGGCC GCAAGATATT GTTTCGGTGG TGATTTTCGA CCATCGCCGC GAAACGATTA TCAGTGCTCA GCCTGTTCGC AACGTTGCTG CCTTACAAGC TGAAGTTGGT AAAATCAAAG ATGCAGGTGG TACAAAAATC GCACCTGCGC TCGAAGCTGC CTTGAATGAA ATTCGCCGTA GCCAAAATGC CAATACGATC AGCCGCATTA TTTTGCTGAC CGACGGTCAA ACCGAGGGCG AACGCGATTG TTTGCGCTTG GCCGAGGAAA TTGGCAAAGC TAGTGTGCCA TTGACGGCAC TGGGCGTTGG CGACGATTGG AACGAAGATC TGTTGATCGA AATGGCGAAT CGCTCAGGTG GCGTTGCCGA ATATTTCAGC AATCCCAACG ATATCGCCTC GTTCTTCCAA GGTGCGGTGC AGCAAGCCCA ATCGGCGGTG GTGCAAAACT CAGCCTTGAC CTTGCGCTTT GTGCAGGGAG TTGAGCCACG CGCCCTTTGG CAAGTAACCC CATTAATTCA ACAATTGCCC TATCGGCCAA TTAGCGATCG GGCGGTTGGC GTGAGCCTCG GCGATATTTC CAAAGACGAA CATCGGATGG TGCTAATCGA AATGCTGGTT GATCCCAAGC AGGCGGGCCA ATATCGGCTG GGCCAAATCG AGGTCAACTA CGATATTCCT CAAATGCAGG TAGTTGGCGA AAAAGCTCGC TACGATGTCA TGTTGAATTT TGTGGCTGAT CCGGCTCAGG CAACCGGAGT TGTGCCCCAA GTGATGAATA TTGTTGAAAA GGTCAGCGCC CACAAGCTGC AAACTCGGGC CTTAGAAGAT TTGGCCGAGG GCAATATTGG TGCAGCGACC CAAAAGCTTC AAGGTGCTGT GACCCGCTTG CTCAACCAAG GCGAAACCGA GCTAGCCCAA ACCATGCAAC AAGAGATCGA AAATCTACAA ACCAATGGGC AAATGACCTC AGCTGGTCAA AAAACCATCA AATTTGGTAC CCGCAAAACC GTGCGGCTCA GCGATTTGGA TCTACCAAAA AGTTAG
|
Protein sequence | MAGEVQLTGT LARPALPALQ TQQVVYLLLD ITATPAVAHV QMPVNVSFVL DHSGSMKGDK MRCVREATQR ALGLMGPQDI VSVVIFDHRR ETIISAQPVR NVAALQAEVG KIKDAGGTKI APALEAALNE IRRSQNANTI SRIILLTDGQ TEGERDCLRL AEEIGKASVP LTALGVGDDW NEDLLIEMAN RSGGVAEYFS NPNDIASFFQ GAVQQAQSAV VQNSALTLRF VQGVEPRALW QVTPLIQQLP YRPISDRAVG VSLGDISKDE HRMVLIEMLV DPKQAGQYRL GQIEVNYDIP QMQVVGEKAR YDVMLNFVAD PAQATGVVPQ VMNIVEKVSA HKLQTRALED LAEGNIGAAT QKLQGAVTRL LNQGETELAQ TMQQEIENLQ TNGQMTSAGQ KTIKFGTRKT VRLSDLDLPK S
|
| |