Gene Haur_2819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2819 
Symbol 
ID5734700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3582124 
End bp3586308 
Gene Length4185 bp 
Protein Length1394 aa 
Translation table11 
GC content51% 
IMG OID641279962 
ProductAAA ATPase containing von Willebrand factor type A (vWA) protein-like omain 
Protein accessionYP_001545585 
Protein GI159899338 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0967207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTCG ATCGGTCTCG GCGTAATTAT GGGCGACTTT TACTTGCCTT AATCGTGGTT 
CTTCACCTTG GTATGGTTCG CTCTGCCGCT GCTGCTGGAG AGACTCCCTA CGCTAATGCG
GTTGATTCGA GTACTAGTGG TCTAATTGTT AATGCTGCTA ATTCTGTTGG AGCACCAAAT
GGTACTGTAG CAACAGTGAT TGGTTTGCTC GGACAAAACC TGACCCTCGA TATGGGTGCT
AATGAAGAAG GTACTGGCGA TCTAGTGGTG CACTATGGCG GGATTGGCGT GCAGTTGGCA
GCGAACGTTC AGTTTCTGAA TGCTAGCCGC CAAGTGATTG CCTCGTCATC ATTAGATATT
GTTGGTTTAT CACTTGGTAC TATCTATACG ACTACAGTTG ATTATCCCCA AAGTCCTACT
CCCTATCGGT ACGTGCGCTT TGTTTCCTTA TTGACTGCAT ATCAAATTGA TGCAGTTGAA
GCAACAACAT TTCGCCCTGA TAGCGATAAT GATGGGGTGA ATGATGTTGA TGAAATTACC
AATGGCACTA ACCCCCTCGA TCCAGATAGT GATGATGATG GCTTGACCGA TGGTGAAGAA
ATTACCAATG GCACCAACCC CAACGATTCA GATAGCGATA ATGATGGCTT GCCCGATGGT
TGGGAAGTTG ATAACGGCTT GAATCCAAAC AATGGTACTG GTGCGACTGG TGCGACTGGT
GATCCCGATA ATGATGGCTC CAGTAATTTG GATGAATATC AGAATGGCAC CGATCCCAAT
GATCCTGATA GTGATGACGA TGGTTTGAAT GATGGTGCCG AAGATAGTGC GGGCACAAAT
CCATTGGATA ACGATAGCGA TAACGATGGG CTACCTGATG GCTGGGAAGT CAATCATGGC
CTAAATCCAT TGGATGCAAC AGGAAACAAT GGTGCGGCTG GTGACCCCGA TAACGATGGT
TCAAACAATG CTGCTGAATT TGCCGCTGGT ACGCATCCAA ATGATGCTGA TAGTGATAAT
GACGGTTTGA ATGATGGCGC TGAAGCTGCT TTGGGGACAA ATCCAAACAA TGCTGATAGC
GATGGTGATG GCTTGCCCGA TGGCTGGGAA GTCTCGAATA GTCTCAATCC ATTGAATGCT
ACTGGCAACC AGGGTGCTAC TGGTGACCCC GATAATGATG GCTCGACCAA CTTGCAAGAA
TATCAAAATA GCACCAACCC TCATGATGCC GACAGCGATA ATGACGGCTT GAACGATGGT
CAAGAAGCCG CTGCCGGAAC TGATCCAAAT GACAGTGACA GCGATAACGA TGGTTTGCCC
GATGGTTGGG AAGTAGCCAA CAGCCTTGAC CCATTGAGTA GTGTTGGTGA TGATGGAGCC
GCAGGCGATC CTGATAATGA CGGTTCAAAC AATGCCGCTG AATTTGCTAG CAATACTGAT
CCAAACGATG CCGACAGCGA TAATGACGGC TTGAATGATG GTCAAGAAAC CGCTGCTGGA
ACTGATCCAA ACGACAGTGA TACTGATAAT GATGGCTTGC CCGATGGCTG GGAAGTAGCC
AACGGCCTTG ACCCATTGAG CAGCGTTGGT GATGATGGAG CCGCTGGCGA TCCTGATGGT
GATGGCTCAA ACAACGCCGC TGAGTTTGCC GCAGGCACCA ATCCAAATGA TGCCGACAGC
GATAACGATG GCTTGAACGA TGGTCCAGAA GCCGCTGCCG GAACTGATCC AAATGATGCC
GACAGCGATA ACGATGGTTT GCCCGATGGT TGGGAAGTAG CCAACAGCCT TGACCCATTG
AGCAGCGTTG GTGATGATGG AGCCGCAGGC GATCCTGATA ATGACGGTTC AAACAATGCC
GCTGAATTTG CTAGCAATAC TGATCCAAAC GATGCCGACA GCGATAATGA CGGCTTGAAT
GATGGTCAAG AAGCCGCTGC CGGAACTGAT CCAAATGACA GCGATACTGA TAATGATGGC
TTGCCTGATG CATGGGAAGT CGCCAACAGC CTTGACCCAT TGAGCAGTGT TGGTGATGAT
GGCGCAACAG GTGATCCTGA TAATGACGGT TCAAACAACG CCGCTGAATT CGCCAACAGC
ACTAATCCGA ACGATACCGA CAGCGATAAT GACGGCTTGA ATGATGGTCA AGAAGCCGCT
GCTGGTACGA ATCCAAACGA CAGCGATACT GATAATGATG GCTTGCCAGA TGGTTGGGAA
GTGAGCAACG GGCTTGATCC GCTAAATCCA AACGATGCTG CTGGTGATCC AGATAATGAT
GGTTTGGATA ATAGCGCTGA ATTTGCCAAC AATACCAATC CCCAGGATTC TGACAGCGAT
AACGATGGCC TGAACGATGG TGCTGAAATT AGCGCGGGCA CAAATCCGAA TGATAGTGAT
AGCGATAACG ATGGTTTGCC CGATGGTTGG GAAGTCGTCA ACAGCCTCGA TCCATTGAGT
AGTGTTGGTG ATGATGGAGC CGCTGGCGAC CCTGATAACG ATGGCTCGAC CAACTTGCAA
GAATATCAAA ATGGCACTGA TCCCAACGAT GCTGATAGCG ATAACGATGG CCTGACTGAT
GGTCAAGAGG CTGGTTTGGG AACCAACCCC AATAATGCTG ATACCGATGG TGATGGCTTG
CCTGATGGCT GGGAAATCAG CAATAATCTT AATCCAACCA GTACAACCGA AGGCAATGGA
GCCAATGGTG ATCCTGACAA TGATGGCTCG ACCAACTTAC AAGAATATCA AAATGGCACG
AATCCCCAAG ATGCCGACAG CGATAACGAC GGTTTGAACG ATGGCCAGGA AGCCGCTGCT
GGAACAAATC CGAATGACAG TGATAGCGAT AATGATGGCT TGCCCGATGG TTGGGAAGTT
GCCAACAGCC TCGACCCATT GAGCAGTGTT GGTAATAATG GAGCCGCTGG CGATCCTGAT
AGTGATGGTT CGACCAACGC TACTGAATTT GCCAACAACA CCGATCCTCA AGATGCCGAC
AGTGATAACG ACGGTTTGAA CGATGGCCAG GAAGCCGCAG CTGGAACTGA TCCAAATGAT
AGTGATAGTG ATAATGATGG CTTGCCCGAT GGTTGGGAAG TTGTCAACAG CCTTGATCCA
TTGAGCTCAG TTGGTGATGA TGGAGCCGCT GGCGACCCTG ATAACGATGG TTTGAGTAAT
GCTGGTGAAT TTGCCAACAA CACCAATCCA AATGATAGCG ATAGTGATAA CGACAGCTTG
CCGGATGGCT GGGAAGTAAA TTATGGCTTA GATCCATTGA GTTCAGTTGG TGATGATGGT
GCTAGTGGTA ACCCTGATGG TGATAGCTAC GATAACGCAA CCGAGTTTGC TAATGGTACT
AGCCCAATCG TGTTTGATGC TCCTGCAGCA ACCAATACAC CCGAACCAAC CCTGACGAAT
ACGCCAGAAC CGACCGCAAC CAACACGCCA GAACCAACAG CGACCGATGT CCCGACTGCA
ACAGCGACCG ATGTCCCGAC TGCAACAGCG ACCGATGTCC CGACTGCAAC GGCAACGGAT
GTGCCAACGG CAACGGCGAC CAATACACCC GAACCAACCA TGACGAATAC GCCAGAGCCG
ACGGCGACTG ACGTGCCAAC GGCGACGGCA ACCGATGTGC CAACGGCAAC GGCGACCAAT
ACACCCGAAC CAACCATGAC GAATACGCCG GAGCCGACGG CGACTGACGT GCCAACGGCG
ACGGCAACGG ATGTGCCAAC GGCGACGGCA ACGGATGTGC CAACGGCGAC GGCGACCGAT
GTGCCAACGG CGACGGCGAC TGACGTGCCA ACGGCGACGG CGACTGATGT TCCGACCGCG
ACGGCAACCG ATGTGCCAAC GCCAACAGCG ACCAACACGC CAGAACCAAC GGCAACCAAT
GTGCCAACGG CAACGGCAAC CAATGTGCCA ACGGCAACGG CAACCAATGT GCCAACAGCG
ACGGCGACAG CAATTGCAAC CGCGACAGCA ACTGCGATTG CAACCGTAAC GAATACACCA
ATTCCGACGG TCACTGCGAC TGCGATTGCT ACGGCCACAG CGACCGCGAC GGCAACGGCA
ACGGCGACCC TAACGCCAAC GCTAACACCA ACCGCGACGG CTACAAGTAC GGTTCAGCCA
ACCCAAAACA AGATTTTCTT ACCAATGGCA ATGAAAGGCG AATAA
 
Protein sequence
MNFDRSRRNY GRLLLALIVV LHLGMVRSAA AAGETPYANA VDSSTSGLIV NAANSVGAPN 
GTVATVIGLL GQNLTLDMGA NEEGTGDLVV HYGGIGVQLA ANVQFLNASR QVIASSSLDI
VGLSLGTIYT TTVDYPQSPT PYRYVRFVSL LTAYQIDAVE ATTFRPDSDN DGVNDVDEIT
NGTNPLDPDS DDDGLTDGEE ITNGTNPNDS DSDNDGLPDG WEVDNGLNPN NGTGATGATG
DPDNDGSSNL DEYQNGTDPN DPDSDDDGLN DGAEDSAGTN PLDNDSDNDG LPDGWEVNHG
LNPLDATGNN GAAGDPDNDG SNNAAEFAAG THPNDADSDN DGLNDGAEAA LGTNPNNADS
DGDGLPDGWE VSNSLNPLNA TGNQGATGDP DNDGSTNLQE YQNSTNPHDA DSDNDGLNDG
QEAAAGTDPN DSDSDNDGLP DGWEVANSLD PLSSVGDDGA AGDPDNDGSN NAAEFASNTD
PNDADSDNDG LNDGQETAAG TDPNDSDTDN DGLPDGWEVA NGLDPLSSVG DDGAAGDPDG
DGSNNAAEFA AGTNPNDADS DNDGLNDGPE AAAGTDPNDA DSDNDGLPDG WEVANSLDPL
SSVGDDGAAG DPDNDGSNNA AEFASNTDPN DADSDNDGLN DGQEAAAGTD PNDSDTDNDG
LPDAWEVANS LDPLSSVGDD GATGDPDNDG SNNAAEFANS TNPNDTDSDN DGLNDGQEAA
AGTNPNDSDT DNDGLPDGWE VSNGLDPLNP NDAAGDPDND GLDNSAEFAN NTNPQDSDSD
NDGLNDGAEI SAGTNPNDSD SDNDGLPDGW EVVNSLDPLS SVGDDGAAGD PDNDGSTNLQ
EYQNGTDPND ADSDNDGLTD GQEAGLGTNP NNADTDGDGL PDGWEISNNL NPTSTTEGNG
ANGDPDNDGS TNLQEYQNGT NPQDADSDND GLNDGQEAAA GTNPNDSDSD NDGLPDGWEV
ANSLDPLSSV GNNGAAGDPD SDGSTNATEF ANNTDPQDAD SDNDGLNDGQ EAAAGTDPND
SDSDNDGLPD GWEVVNSLDP LSSVGDDGAA GDPDNDGLSN AGEFANNTNP NDSDSDNDSL
PDGWEVNYGL DPLSSVGDDG ASGNPDGDSY DNATEFANGT SPIVFDAPAA TNTPEPTLTN
TPEPTATNTP EPTATDVPTA TATDVPTATA TDVPTATATD VPTATATNTP EPTMTNTPEP
TATDVPTATA TDVPTATATN TPEPTMTNTP EPTATDVPTA TATDVPTATA TDVPTATATD
VPTATATDVP TATATDVPTA TATDVPTPTA TNTPEPTATN VPTATATNVP TATATNVPTA
TATAIATATA TAIATVTNTP IPTVTATAIA TATATATATA TATLTPTLTP TATATSTVQP
TQNKIFLPMA MKGE