Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4213 |
Symbol | |
ID | 5736925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5368374 |
End bp | 5369741 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281368 |
Product | von Willebrand factor type A |
Protein accession | YP_001546973 |
Protein GI | 159900726 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAGCT TAGATCTCAG GCTCACCCTT TCACGTAGCA GCGTGCCCGC AAACACTGGC GAACAAGTTT TGTATGTTTT AGCTGAGGTT GTTGGTCAAG GCAAAACCAA CCAGCGTGCG CCGGTGCATT GTGTGTTGTT GGTCGATTGT TCGCCCTCAA TGCGTGTGCC AATCGCTGAT GCGGCCTTGT TTCGCGAATT AATGCGCCGT GGTGCTGCCC ATGAGGTTAT GCTCGATGGC GTGCCAGTCT GGAAAATTCG CGATGATTTG ATGGATGAGG TGCGGCGACG CGCTCAAAGC CCAAGTCGTT GGCTGGGGCA GGCCGTGCGT GGCGCAGCTG AGCGCCTAAA TGCCGATGAT CAACTTTCGA TCATCGGCTT TGCTGAAAAA GCTCATGCAG TGCTCAAAGC CAAATCGCCA ACCGATGTGG TGCGCTTGGA AGGCTCAGTC GCCATTCTTG AACGCGGCGA TCTTGGCCGG AGCACCCGGC TGGCTCCAGG CTTGCGCAGC ACGATCGATC TGCTTGATAG CATGCCCAGC ACTGGTTTTT CACAACGGAT CGTTGTGCTA ACTGATGGCT TTGTTGAAGA TGAACAACAA GCCTTTGCCT ATGCTCGCCA TCTGGCTGCC CGCTCAATTC CGGTCTCGAC GATTGGCCTG GGGGTTGAGT TTCAAGAGCA ATTGCTGATG AGCTTTGCTG ATCAGAGCGG TGGTCAATCC AGTTTTATCA CCGACCCCAG CGACCTGCCT AATTTGCTTG ATGTGGAGTT TGGCAAGGCC CATGCGATTA TCGCTCAAAA GGCGCGGCTT GATTTGCGGC TCGCTCACGA TGTCCATATC AAACGAATTT TACGCATTAG GCCAGCCTTG GGCGAAGTAC CAATCCCTGA ATTTGAGGCT GGTTCGGGCA GTTTAAGTTT AGGCTCAATC GAACAACGGG CTAAAGCTGC TTGGTTACTC GAACTGCGCA TCCCTGATCA TGTGCCAAAT ATTTATCGTT TGTTGCGGCT TTCGTTGCAG GCCGACGATC CCCAAGGCCA AAGTTTGGAG CCAATTAACC GCGATGTGGT GGTCGAATAT CATCCGAATG CCGACCAGGC GCTCAACCCA GAGCTTGTAG CGATTTTGGA ACGGGTGACG GCATGGCGCT TGCAAAACAA AGCGCTGGAG CAAGCTGCTC AGGGTGATCA AGCTGGAGCA ACTCGCCAAT TGCAGGCCGC AGTCACCCGC TTGGTCGATT TGGGCGAGCA TGAATTAGCC AAACAAACTG CTGCGGCCAG CCAAACCTTG CAAGAAACTG GCCAAATTAA CCCTGAGCAA ACCAAAACCT TGCGCTATGC AACCAGAAGA TTGACAGAAG AACGATAA
|
Protein sequence | MTSLDLRLTL SRSSVPANTG EQVLYVLAEV VGQGKTNQRA PVHCVLLVDC SPSMRVPIAD AALFRELMRR GAAHEVMLDG VPVWKIRDDL MDEVRRRAQS PSRWLGQAVR GAAERLNADD QLSIIGFAEK AHAVLKAKSP TDVVRLEGSV AILERGDLGR STRLAPGLRS TIDLLDSMPS TGFSQRIVVL TDGFVEDEQQ AFAYARHLAA RSIPVSTIGL GVEFQEQLLM SFADQSGGQS SFITDPSDLP NLLDVEFGKA HAIIAQKARL DLRLAHDVHI KRILRIRPAL GEVPIPEFEA GSGSLSLGSI EQRAKAAWLL ELRIPDHVPN IYRLLRLSLQ ADDPQGQSLE PINRDVVVEY HPNADQALNP ELVAILERVT AWRLQNKALE QAAQGDQAGA TRQLQAAVTR LVDLGEHELA KQTAAASQTL QETGQINPEQ TKTLRYATRR LTEER
|
| |