Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2819 |
Symbol | |
ID | 5734700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3582124 |
End bp | 3586308 |
Gene Length | 4185 bp |
Protein Length | 1394 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279962 |
Product | AAA ATPase containing von Willebrand factor type A (vWA) protein-like omain |
Protein accession | YP_001545585 |
Protein GI | 159899338 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0967207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTCG ATCGGTCTCG GCGTAATTAT GGGCGACTTT TACTTGCCTT AATCGTGGTT CTTCACCTTG GTATGGTTCG CTCTGCCGCT GCTGCTGGAG AGACTCCCTA CGCTAATGCG GTTGATTCGA GTACTAGTGG TCTAATTGTT AATGCTGCTA ATTCTGTTGG AGCACCAAAT GGTACTGTAG CAACAGTGAT TGGTTTGCTC GGACAAAACC TGACCCTCGA TATGGGTGCT AATGAAGAAG GTACTGGCGA TCTAGTGGTG CACTATGGCG GGATTGGCGT GCAGTTGGCA GCGAACGTTC AGTTTCTGAA TGCTAGCCGC CAAGTGATTG CCTCGTCATC ATTAGATATT GTTGGTTTAT CACTTGGTAC TATCTATACG ACTACAGTTG ATTATCCCCA AAGTCCTACT CCCTATCGGT ACGTGCGCTT TGTTTCCTTA TTGACTGCAT ATCAAATTGA TGCAGTTGAA GCAACAACAT TTCGCCCTGA TAGCGATAAT GATGGGGTGA ATGATGTTGA TGAAATTACC AATGGCACTA ACCCCCTCGA TCCAGATAGT GATGATGATG GCTTGACCGA TGGTGAAGAA ATTACCAATG GCACCAACCC CAACGATTCA GATAGCGATA ATGATGGCTT GCCCGATGGT TGGGAAGTTG ATAACGGCTT GAATCCAAAC AATGGTACTG GTGCGACTGG TGCGACTGGT GATCCCGATA ATGATGGCTC CAGTAATTTG GATGAATATC AGAATGGCAC CGATCCCAAT GATCCTGATA GTGATGACGA TGGTTTGAAT GATGGTGCCG AAGATAGTGC GGGCACAAAT CCATTGGATA ACGATAGCGA TAACGATGGG CTACCTGATG GCTGGGAAGT CAATCATGGC CTAAATCCAT TGGATGCAAC AGGAAACAAT GGTGCGGCTG GTGACCCCGA TAACGATGGT TCAAACAATG CTGCTGAATT TGCCGCTGGT ACGCATCCAA ATGATGCTGA TAGTGATAAT GACGGTTTGA ATGATGGCGC TGAAGCTGCT TTGGGGACAA ATCCAAACAA TGCTGATAGC GATGGTGATG GCTTGCCCGA TGGCTGGGAA GTCTCGAATA GTCTCAATCC ATTGAATGCT ACTGGCAACC AGGGTGCTAC TGGTGACCCC GATAATGATG GCTCGACCAA CTTGCAAGAA TATCAAAATA GCACCAACCC TCATGATGCC GACAGCGATA ATGACGGCTT GAACGATGGT CAAGAAGCCG CTGCCGGAAC TGATCCAAAT GACAGTGACA GCGATAACGA TGGTTTGCCC GATGGTTGGG AAGTAGCCAA CAGCCTTGAC CCATTGAGTA GTGTTGGTGA TGATGGAGCC GCAGGCGATC CTGATAATGA CGGTTCAAAC AATGCCGCTG AATTTGCTAG CAATACTGAT CCAAACGATG CCGACAGCGA TAATGACGGC TTGAATGATG GTCAAGAAAC CGCTGCTGGA ACTGATCCAA ACGACAGTGA TACTGATAAT GATGGCTTGC CCGATGGCTG GGAAGTAGCC AACGGCCTTG ACCCATTGAG CAGCGTTGGT GATGATGGAG CCGCTGGCGA TCCTGATGGT GATGGCTCAA ACAACGCCGC TGAGTTTGCC GCAGGCACCA ATCCAAATGA TGCCGACAGC GATAACGATG GCTTGAACGA TGGTCCAGAA GCCGCTGCCG GAACTGATCC AAATGATGCC GACAGCGATA ACGATGGTTT GCCCGATGGT TGGGAAGTAG CCAACAGCCT TGACCCATTG AGCAGCGTTG GTGATGATGG AGCCGCAGGC GATCCTGATA ATGACGGTTC AAACAATGCC GCTGAATTTG CTAGCAATAC TGATCCAAAC GATGCCGACA GCGATAATGA CGGCTTGAAT GATGGTCAAG AAGCCGCTGC CGGAACTGAT CCAAATGACA GCGATACTGA TAATGATGGC TTGCCTGATG CATGGGAAGT CGCCAACAGC CTTGACCCAT TGAGCAGTGT TGGTGATGAT GGCGCAACAG GTGATCCTGA TAATGACGGT TCAAACAACG CCGCTGAATT CGCCAACAGC ACTAATCCGA ACGATACCGA CAGCGATAAT GACGGCTTGA ATGATGGTCA AGAAGCCGCT GCTGGTACGA ATCCAAACGA CAGCGATACT GATAATGATG GCTTGCCAGA TGGTTGGGAA GTGAGCAACG GGCTTGATCC GCTAAATCCA AACGATGCTG CTGGTGATCC AGATAATGAT GGTTTGGATA ATAGCGCTGA ATTTGCCAAC AATACCAATC CCCAGGATTC TGACAGCGAT AACGATGGCC TGAACGATGG TGCTGAAATT AGCGCGGGCA CAAATCCGAA TGATAGTGAT AGCGATAACG ATGGTTTGCC CGATGGTTGG GAAGTCGTCA ACAGCCTCGA TCCATTGAGT AGTGTTGGTG ATGATGGAGC CGCTGGCGAC CCTGATAACG ATGGCTCGAC CAACTTGCAA GAATATCAAA ATGGCACTGA TCCCAACGAT GCTGATAGCG ATAACGATGG CCTGACTGAT GGTCAAGAGG CTGGTTTGGG AACCAACCCC AATAATGCTG ATACCGATGG TGATGGCTTG CCTGATGGCT GGGAAATCAG CAATAATCTT AATCCAACCA GTACAACCGA AGGCAATGGA GCCAATGGTG ATCCTGACAA TGATGGCTCG ACCAACTTAC AAGAATATCA AAATGGCACG AATCCCCAAG ATGCCGACAG CGATAACGAC GGTTTGAACG ATGGCCAGGA AGCCGCTGCT GGAACAAATC CGAATGACAG TGATAGCGAT AATGATGGCT TGCCCGATGG TTGGGAAGTT GCCAACAGCC TCGACCCATT GAGCAGTGTT GGTAATAATG GAGCCGCTGG CGATCCTGAT AGTGATGGTT CGACCAACGC TACTGAATTT GCCAACAACA CCGATCCTCA AGATGCCGAC AGTGATAACG ACGGTTTGAA CGATGGCCAG GAAGCCGCAG CTGGAACTGA TCCAAATGAT AGTGATAGTG ATAATGATGG CTTGCCCGAT GGTTGGGAAG TTGTCAACAG CCTTGATCCA TTGAGCTCAG TTGGTGATGA TGGAGCCGCT GGCGACCCTG ATAACGATGG TTTGAGTAAT GCTGGTGAAT TTGCCAACAA CACCAATCCA AATGATAGCG ATAGTGATAA CGACAGCTTG CCGGATGGCT GGGAAGTAAA TTATGGCTTA GATCCATTGA GTTCAGTTGG TGATGATGGT GCTAGTGGTA ACCCTGATGG TGATAGCTAC GATAACGCAA CCGAGTTTGC TAATGGTACT AGCCCAATCG TGTTTGATGC TCCTGCAGCA ACCAATACAC CCGAACCAAC CCTGACGAAT ACGCCAGAAC CGACCGCAAC CAACACGCCA GAACCAACAG CGACCGATGT CCCGACTGCA ACAGCGACCG ATGTCCCGAC TGCAACAGCG ACCGATGTCC CGACTGCAAC GGCAACGGAT GTGCCAACGG CAACGGCGAC CAATACACCC GAACCAACCA TGACGAATAC GCCAGAGCCG ACGGCGACTG ACGTGCCAAC GGCGACGGCA ACCGATGTGC CAACGGCAAC GGCGACCAAT ACACCCGAAC CAACCATGAC GAATACGCCG GAGCCGACGG CGACTGACGT GCCAACGGCG ACGGCAACGG ATGTGCCAAC GGCGACGGCA ACGGATGTGC CAACGGCGAC GGCGACCGAT GTGCCAACGG CGACGGCGAC TGACGTGCCA ACGGCGACGG CGACTGATGT TCCGACCGCG ACGGCAACCG ATGTGCCAAC GCCAACAGCG ACCAACACGC CAGAACCAAC GGCAACCAAT GTGCCAACGG CAACGGCAAC CAATGTGCCA ACGGCAACGG CAACCAATGT GCCAACAGCG ACGGCGACAG CAATTGCAAC CGCGACAGCA ACTGCGATTG CAACCGTAAC GAATACACCA ATTCCGACGG TCACTGCGAC TGCGATTGCT ACGGCCACAG CGACCGCGAC GGCAACGGCA ACGGCGACCC TAACGCCAAC GCTAACACCA ACCGCGACGG CTACAAGTAC GGTTCAGCCA ACCCAAAACA AGATTTTCTT ACCAATGGCA ATGAAAGGCG AATAA
|
Protein sequence | MNFDRSRRNY GRLLLALIVV LHLGMVRSAA AAGETPYANA VDSSTSGLIV NAANSVGAPN GTVATVIGLL GQNLTLDMGA NEEGTGDLVV HYGGIGVQLA ANVQFLNASR QVIASSSLDI VGLSLGTIYT TTVDYPQSPT PYRYVRFVSL LTAYQIDAVE ATTFRPDSDN DGVNDVDEIT NGTNPLDPDS DDDGLTDGEE ITNGTNPNDS DSDNDGLPDG WEVDNGLNPN NGTGATGATG DPDNDGSSNL DEYQNGTDPN DPDSDDDGLN DGAEDSAGTN PLDNDSDNDG LPDGWEVNHG LNPLDATGNN GAAGDPDNDG SNNAAEFAAG THPNDADSDN DGLNDGAEAA LGTNPNNADS DGDGLPDGWE VSNSLNPLNA TGNQGATGDP DNDGSTNLQE YQNSTNPHDA DSDNDGLNDG QEAAAGTDPN DSDSDNDGLP DGWEVANSLD PLSSVGDDGA AGDPDNDGSN NAAEFASNTD PNDADSDNDG LNDGQETAAG TDPNDSDTDN DGLPDGWEVA NGLDPLSSVG DDGAAGDPDG DGSNNAAEFA AGTNPNDADS DNDGLNDGPE AAAGTDPNDA DSDNDGLPDG WEVANSLDPL SSVGDDGAAG DPDNDGSNNA AEFASNTDPN DADSDNDGLN DGQEAAAGTD PNDSDTDNDG LPDAWEVANS LDPLSSVGDD GATGDPDNDG SNNAAEFANS TNPNDTDSDN DGLNDGQEAA AGTNPNDSDT DNDGLPDGWE VSNGLDPLNP NDAAGDPDND GLDNSAEFAN NTNPQDSDSD NDGLNDGAEI SAGTNPNDSD SDNDGLPDGW EVVNSLDPLS SVGDDGAAGD PDNDGSTNLQ EYQNGTDPND ADSDNDGLTD GQEAGLGTNP NNADTDGDGL PDGWEISNNL NPTSTTEGNG ANGDPDNDGS TNLQEYQNGT NPQDADSDND GLNDGQEAAA GTNPNDSDSD NDGLPDGWEV ANSLDPLSSV GNNGAAGDPD SDGSTNATEF ANNTDPQDAD SDNDGLNDGQ EAAAGTDPND SDSDNDGLPD GWEVVNSLDP LSSVGDDGAA GDPDNDGLSN AGEFANNTNP NDSDSDNDSL PDGWEVNYGL DPLSSVGDDG ASGNPDGDSY DNATEFANGT SPIVFDAPAA TNTPEPTLTN TPEPTATNTP EPTATDVPTA TATDVPTATA TDVPTATATD VPTATATNTP EPTMTNTPEP TATDVPTATA TDVPTATATN TPEPTMTNTP EPTATDVPTA TATDVPTATA TDVPTATATD VPTATATDVP TATATDVPTA TATDVPTPTA TNTPEPTATN VPTATATNVP TATATNVPTA TATAIATATA TAIATVTNTP IPTVTATAIA TATATATATA TATLTPTLTP TATATSTVQP TQNKIFLPMA MKGE
|
| |