Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0397 |
Symbol | |
ID | 5731965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 465641 |
End bp | 467380 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277520 |
Product | von Willebrand factor type A |
Protein accession | YP_001543176 |
Protein GI | 159896929 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1240] Mg-chelatase subunit ChlD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00126629 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCGT TAACCTATCC CTCCCTGCCC TCGCCATTTG AAATGATTGG CCCGTGCGGC AAACAATCGA CCCATCGCCG TTTTTTTATG GCCAAGAAAG CAGGCATCCC ATACCTGATT TGGTGGGAAA GCCGCATTCC TGAGGCCTTG CGCAAACAAC AAATGGTTGT GCAAATTGCC AACCAAGCTG GTAGCGAAGT AGCTATCTTA GATGTGCAAG CCTTATCGTT GCAACAGCTA TTGGATACTG CTGGAAGTTT ACCAGCACCA GTTGCCGCCC AATATGCCTT GGGCTTGATC GAAACCGCTG AATTTAATCA TCGCCAACGA ACTGCCCTCA CAACTAGTAA TTTGTATGGG CCAAGCCTGC TTGATCTTGG TTCGCAAGGC ACGATTGTGC CAAAAATTGA TCCACGGGCA GGCCAACAAG CTGGGCCAGT TCCCGCCCAA CTCTTGCCAA TCAATGAACC AGCCAGCCCA CAAACTGATA GCTATTACAT TGGGGCATTG ATTGCCTGGA TGATTAGCGG GCAATTTGCG CCCCAGGGAA ATCCCTTGGC AATGCTGCCA GCAATCGATG GCAATTTACG GAGCATTTTG CAACGCGCCA CTGTTGCCAA CCCCAGCCAA CGCCCAAGCG CCCAAGCCCT TGGTCAGCAA ATTCAAGATT GGTTGTCGGG CAAAGTTACG CCAGTCGCCA AAAAGAAGCC CTCGCAATGG GCTTGGCTAG CAGGCGCGAT CACAACCGCT GCGATTATGT GGTTGGTGAT CTATGGTTTA TCCCGTCAAG ATAAAGTGAC CGATACCGAG TTTGGTCAGG CGCAAATTGA GCAAGGCACT GGTGATTTCC AGCCAGGTAC TTCTAGCAAC GGCTTAGCCA TGCTGGGAGA TATCGACGGA GTTGAAGTTA ATCGGATTGA TGATACTCGC CATCCCGAAA TCGATATGTA TTTGAGTATT ATGCGGCCCA CGGGGGTAGT CACCGACGTG CCGCGGCAAA ATGTCAAAGT CTTTGAAAAT AACAATCAGA TTGAAGGCTT TTCGTGGGTC AATCTCTCAC GCGTCCAAGA TCCATTAAAC ATTATGTTGG TGATCGATAC CAGTGGGAGC ATGGGGCCAA GCAAAGAGGG TTTAACCGAT GGTGGCCTCG ATGCCGCCAA AATTGCTGCG CTTGACTTTA TCGATCACCT GCCTTCCAAC GCCAATGTTG GCTTGATTCA CTTTGGGACG CTGGTAACGG TTGACCATTC GCTCACCAAC GATATTGGGG CAGTACGCCA AAGTATCAGC GAGCTTAAAC CCGAAGGTCA AACCGCAATT TACGATGCCT TGGCCATCAG CTATACCCAA CTGCGTCGTG CCAAAGGTCA AACCTTCATC GTCTTGATTT CCGATGGCGC GGATACCGCC AGCAAAGGCG ATAACTACGA CAGCATCGTA GCGAAGGCCA CAAAAGCCAA CATTCCCACC TACATCATTG GTCTCACCAG CCCAGAGTTT GATGGTCAAT TGTTGGAAGA TTTGCAACGC GATACCAAAG CCATGATTTA CCAAACCCCC TCCAAAGAAC AACTTGGTGG CTTTTATACT GAGGTCGCGC AAGAGGTTTC TGGCCAATAT CGTGCCAGCT TTAACATTCC TGATACCTAT AAAACTGGCG ATGAAATTAT GCTCAAGGTC GAGGTCAATG CTGGCGATGG CCTCTTAGTC ACCAAAGAAC GCAAGTATAT TCATCCCTAA
|
Protein sequence | MSALTYPSLP SPFEMIGPCG KQSTHRRFFM AKKAGIPYLI WWESRIPEAL RKQQMVVQIA NQAGSEVAIL DVQALSLQQL LDTAGSLPAP VAAQYALGLI ETAEFNHRQR TALTTSNLYG PSLLDLGSQG TIVPKIDPRA GQQAGPVPAQ LLPINEPASP QTDSYYIGAL IAWMISGQFA PQGNPLAMLP AIDGNLRSIL QRATVANPSQ RPSAQALGQQ IQDWLSGKVT PVAKKKPSQW AWLAGAITTA AIMWLVIYGL SRQDKVTDTE FGQAQIEQGT GDFQPGTSSN GLAMLGDIDG VEVNRIDDTR HPEIDMYLSI MRPTGVVTDV PRQNVKVFEN NNQIEGFSWV NLSRVQDPLN IMLVIDTSGS MGPSKEGLTD GGLDAAKIAA LDFIDHLPSN ANVGLIHFGT LVTVDHSLTN DIGAVRQSIS ELKPEGQTAI YDALAISYTQ LRRAKGQTFI VLISDGADTA SKGDNYDSIV AKATKANIPT YIIGLTSPEF DGQLLEDLQR DTKAMIYQTP SKEQLGGFYT EVAQEVSGQY RASFNIPDTY KTGDEIMLKV EVNAGDGLLV TKERKYIHP
|
| |