Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4211 |
Symbol | |
ID | 5736923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5365387 |
End bp | 5366634 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281366 |
Product | von Willebrand factor type A |
Protein accession | YP_001546971 |
Protein GI | 159900724 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.290242 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAC CTGTAGCACT TTCAGCAGTT TGGAGCCGCG AACCTTTGCC AAGCGGCACC AGCCAAGTTA ATTATGTCTT GATTCAGGCC AAACCACATC ATGTGCCGAC TGTCCAAGCG GCTCCGCCAC TCAACTTTTG TTTGGTGCTT GATCGCTCTG GTTCGATGGC TGGCGATAAA ATTCAACATT TGCGCGAAGC TGTGCGTGAA ATTGTGGCCA ACTTACGTCC AATCGATGCC GTGAGCATTG TGTTGTTCGA TGATACCTTG GAAGTTCTCG TGCCAGCCCG TTTGGCCGAC GATCTCCCAG CCTTGCAAAA TGCGATCGAA TCAATCGGCG AGCAAGGTGG CACGGCCATG TCGTTGGGCT TGCAAGCAGG CCTTGCCGAA TTGCAAAAAT TCCAGGCCGC CGATCGAGTT GGCCGCGTGC TGCTTTTGAC CGACGGCCAA ACCTGGGGCG ATGAAGATAC CTGCCGCGAT TTAGCCAAAC AAATTGGCGA TTTAGGCGTT TCGATCACAG CACTGGGCTT GGGCACTGAA TGGAACGAGG CCTTGCTCGA CGATTTGGCT ACCGCATCCA ACGGCGAATC GGATTATATT GCCGACCCCA GCCAAATTAG CAAATATTTC CAACAAACCT TGCAAAGCGC CCAAACTACC ACCGTGGTCA ATGCGCGGTT GCTGTTGCGT TTGCTGCCTG GAGTTACCCC ACGCGCAGTT TATCGCGTCC AGCCAACGAT CGCCAACCTT GGCTACAAGC CGATTGGTGA ACGCGAAGTC ACGGTCAGCA TTGGCGAAAT TGCTGGCGAT GGAGCCAGTG TTTTGGTCGA TGTGATGCTG CCAGAGCGTG AAGCGGGTAC GTTCCGCATC GCCCAAGCTG AATTGCAATA CGATGCCCCA GTGCTTGGTA TCAAAGAAGG CAAAATTAAA ATTGACATTC CTTTGAGCTT TAACGTCGAT CCCAAGGCCA GCGTGGTCAA TCCGCCAATT ATGAACACGG TCGAAAAAGT GACCGCCTTC AAATTGCAAA CACGGGCACT TTCCGAGGCC GAGGCCGGCA ATATTGGCAG CGCAACCCAA AAATTACGCG CCGCCGCCAC CCGTTTGCTC GATTTAGGCG AAACTGAGTT GGCCCAAACC ATGGAACAAA GTGCCCAACA ACTTGAGGCT GGTGGTCAAA TCGCAGCCGC TGATCAAAAA GCCCTGCGCT ACGCCACCCG CAAACTAACC CAAAAATTAG AAGAGTAA
|
Protein sequence | MTEPVALSAV WSREPLPSGT SQVNYVLIQA KPHHVPTVQA APPLNFCLVL DRSGSMAGDK IQHLREAVRE IVANLRPIDA VSIVLFDDTL EVLVPARLAD DLPALQNAIE SIGEQGGTAM SLGLQAGLAE LQKFQAADRV GRVLLLTDGQ TWGDEDTCRD LAKQIGDLGV SITALGLGTE WNEALLDDLA TASNGESDYI ADPSQISKYF QQTLQSAQTT TVVNARLLLR LLPGVTPRAV YRVQPTIANL GYKPIGEREV TVSIGEIAGD GASVLVDVML PEREAGTFRI AQAELQYDAP VLGIKEGKIK IDIPLSFNVD PKASVVNPPI MNTVEKVTAF KLQTRALSEA EAGNIGSATQ KLRAAATRLL DLGETELAQT MEQSAQQLEA GGQIAAADQK ALRYATRKLT QKLEE
|
| |