Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3942 |
Symbol | |
ID | 5735803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4940118 |
End bp | 4942613 |
Gene Length | 2496 bp |
Protein Length | 831 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641281093 |
Product | von Willebrand factor type A |
Protein accession | YP_001546704 |
Protein GI | 159900457 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00244396 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTTAA GTCGCATGAC GCGACGTGTA GCCCAACCAG TCGCAGGACA ATCATTAGTA TTCTCTGCTA TTTTGTTCTT CGTCATGATT GCGTTTGCTG CACTAGCCAT TGATACAGGC GAGGCGTTTA GCCGCCAACG CCAGCAACAA GCAGCTTCAA CTGCTGCTTC AATTGCCGGT TTGGAATCCA TGAATAGCGA AATCGACGGA ACTGATGGCG CTGTTCAACA GGCTATCCGC GATGCTTTGG CTGCCAACGG CATTACCAAT GCCGTTTATA TCAACGGCGA TTGGGGCAAC CTTGACCCAA GCCAAAACTA CTATCGGGCT TTCTATACCC AACGTGGCTC ACGGGTTGAA TATCCGGTTG GCAGCGGTGG TCAAGTTTCG ACCGAGTTCA ATGGCTTGCG GGTTGAAGTT CGTTCAGCAC GGAACACCAT CTTTGGCCAA GCCTTAGGTA TCGATACGCT CGAGGTTTCG GCTGAAAATA AAGCAACCTT GTGTCAATGT GCTACCAACA TCTTCCCGCT GGCGGTTGCG ACCCAAAAAA TGACGGGAAT GTTGCCAGGT GAAAGCAAAC CGATTGCTTG GACGAAAAAT AAAGTTGGCT CATCGGTCGA AAAAGACTAC TTCGTCTGGG CCGAATGGCA AACCTTGTCG GGCTTCAACG CCGATAATCG TTTGAAGGAA TCGTTGGGTG GTACTGGTGA TGTGATTAAA GGGGTCACCG AGGCTACCGC ACCTCCAGGC TATGCCAACG CTAGCAATAA CGGCGTGATC AACATCGACG ACTGGATCAA GGTTTCTGAT AGTACGTTGC GCCCAAATGC AGCAAGTTTG GCTACTGCCT TGACGGCCTT GAAGAACAAG CCAATCCTGA TTCCAACCTT TACCAAGTTG AGCTATTCGG GTACAACTGG CAACGAAAAA TATACCAGTT TCCGCTCAGG TGGCTTTGTC AAGGTCATGG TGACCAATTT TGACAGTACT GGGATTACCA TCCAATATCT CAACAGCAAC TATACCTGCC CATGTGTTGA TGTACCACAA CCACCACCAC CACCAGAAGT CAAGCTTGCC CTTGACCAAA AGTTGGTTTG GTATTTGCCA AGTATCAATA CCACGAGCTA CGATATTTCG TTAGTGGTTG ACATTTCAGG CTCGATGCAA TGGTGCTACG ATAGTCAACG GACTTGTAGC GTCGATGCCA ATGCCCGCTG GTATCGGGTT AAGGACTTCT TAGCCAAGTT CTCATATAAG ATGCTTGATG TTTGGAATGC TCCTGCTGGC CAAAATATGA ACAACGCTAG CTTGTTCCCT GGAGAAGCAT TGGTTGGTAA AGGTGGCGAT AACCGGATTG CTGCCGTGCG CTTCAGTGGT AATGCAGTTA CTAGCAGCCC ATCGTTTGGT TTTGTAACCA GCCCAGCTGG TAGCGACCAA GCCAGCGTGA GTGCTCGTAC CACCACGATG CGCTCTAACA TGAATTCCTT GATCAGCTGG ATTACTAAAG CCAATATGAG CGGTAGTACC TCAGGTGGTC GCGGTTTACG CGAAGGCATT CGCTACTTTG ATAATGTTAG CGCTCATACC CGGGTTGATC GTTTTGGTCG CCCAATCAAA TTGGTAATGG TGATGTTGAC CGACGGTTTG ACCAACGTGA TGTACGAAGG ACCTCAAGTT AACTCACAAA ATAGCCAAAA ATTGAAACAT ACCCGTTCTG GTGGCTATAA ATATTGTAAA GATCCTGCTG ATCCAACGGC TAATAACGTT TTGACCGTAG GTGGCGATAA TTATCCGATC ACCGACTTGC CTGAAGTTCA AGCCAACTGT CCATGGAATG GTCAAGGGGC AGGTAATGGT TATGCCAAGG CTCCAATTGT ATCGCTGGTT GAGGTTGCGA CCCAAGCCCG TAATCGGGTC TCGCCGCAGC GACCAGTCAA CATCTATGCA ATCTTGGTAG GTGATCAAGG TCGATACGAT TTACAAGACT TGCGGATTGA CCAAATTGCT TCACCTGGTG GCGCGTTCTA TGCGCAAAAC CCAAATGCCT TGGACTATGC ACTCGATGCG ATTCTGGATG ACCTAGCCCA GCCATGTTAC GAGCGCGATG CCACGATCAT TGCGGCAGGT GCTAAAGTTA CGGTCTACGA TAGCTCGAAT AACCCAGTTG CTGGGATGAG CAACATGTCG GCAGACACGA ATGGTACCTT ACTCTTTACC GTACCCGAGG CAGGCAACTA TAGTTTTGGG GCAACCCGTA GTGTAACAAG CTGGAGCGAA TTCCCATTAG GTAGTGGTGA AACGATTAAT CCAAGCTTGT ACCCTGCCAA CTACTTGCCA CAACTGTATA ATCGGTTACG CGGTGCTGAA GATACGATTG TTCAAAATCG CATCCAATTC AATATCCCCG CAGAGGCTGA TAGCTTGATT GATCTTGGTA AGTACACCAT GATCATTGCC GAAGCCCAAC AAAACAAAGC ACTCTGTCCA GAATAG
|
Protein sequence | MNLSRMTRRV AQPVAGQSLV FSAILFFVMI AFAALAIDTG EAFSRQRQQQ AASTAASIAG LESMNSEIDG TDGAVQQAIR DALAANGITN AVYINGDWGN LDPSQNYYRA FYTQRGSRVE YPVGSGGQVS TEFNGLRVEV RSARNTIFGQ ALGIDTLEVS AENKATLCQC ATNIFPLAVA TQKMTGMLPG ESKPIAWTKN KVGSSVEKDY FVWAEWQTLS GFNADNRLKE SLGGTGDVIK GVTEATAPPG YANASNNGVI NIDDWIKVSD STLRPNAASL ATALTALKNK PILIPTFTKL SYSGTTGNEK YTSFRSGGFV KVMVTNFDST GITIQYLNSN YTCPCVDVPQ PPPPPEVKLA LDQKLVWYLP SINTTSYDIS LVVDISGSMQ WCYDSQRTCS VDANARWYRV KDFLAKFSYK MLDVWNAPAG QNMNNASLFP GEALVGKGGD NRIAAVRFSG NAVTSSPSFG FVTSPAGSDQ ASVSARTTTM RSNMNSLISW ITKANMSGST SGGRGLREGI RYFDNVSAHT RVDRFGRPIK LVMVMLTDGL TNVMYEGPQV NSQNSQKLKH TRSGGYKYCK DPADPTANNV LTVGGDNYPI TDLPEVQANC PWNGQGAGNG YAKAPIVSLV EVATQARNRV SPQRPVNIYA ILVGDQGRYD LQDLRIDQIA SPGGAFYAQN PNALDYALDA ILDDLAQPCY ERDATIIAAG AKVTVYDSSN NPVAGMSNMS ADTNGTLLFT VPEAGNYSFG ATRSVTSWSE FPLGSGETIN PSLYPANYLP QLYNRLRGAE DTIVQNRIQF NIPAEADSLI DLGKYTMIIA EAQQNKALCP E
|
| |