Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4186 |
Symbol | |
ID | 5736048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5338761 |
End bp | 5341247 |
Gene Length | 2487 bp |
Protein Length | 828 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281341 |
Product | von Willebrand factor type A |
Protein accession | YP_001546946 |
Protein GI | 159900699 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1240] Mg-chelatase subunit ChlD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATTTT TGGGATTATT CCTGGTTATA GGATTGAGTG GCTGGTGGAT TGCCCGCAAA ACAACCCTCG ATTGGCGGGT AGTTGGGCTG CGCTTGACGA GCTTAGCCTG TATTTTGCTG GCACTGGCCT TACCGCGCAA CCAAGCTAAT CAACAAGCTA GCCCTTTGAT TTTGCTGGTT GATCAATCGG CCAACTTGCC TAGCGAGTTG CGTGATGCCG CTTGGAATGA GGCTGTGCGC TTCTATCAAC AACAAATCGA GCAACGTCCA GTGCGCTTAT TGGCCTTTGG GGCCGATGTG CGGGTGAGCC AAACCGACCA ACGTCCAGCG ATTGACCCCA ATGGCAGCGA TTTGGCGGGA GCATTGCAAT TTGCTAGTGG TTTGTTGCCG CAAGGTGGCG ATATTATTCT GCTCAGCGAT GGTGCTAGCA CTACCACCAA TGGGCAAAAT CAGGTTAGTA CATTTGCGCA GCGCTCAATT CGTTTGCATG GCGTGCCCAT CAGCTACCCC GAAACCGATA TTCGGGTGGA ATCGCTGCTT GTGCCGCCAG CTTTGCGCGA AGGCGAGCGA TTTAGTGCCG ATGTGGTGCT CTATTCGAGT GTTGATGGTC AAGTGCGCCT CGAATTGAGT AGCGACGGCG TGGGCTTGGC CGGTCAAACA ATTAATGTTG AACAAGGCCG CAATCTGGTT TCATTTCAAT CGACCGCTGG TGCTCGCGGC TTCCATCGTT TTCAGGCAAC ATTGCTAGCC ACCAACGATC AACAACCTGC CAACAATCAA CTTGATGCCT GGACGGTGGT TGGGCCACCG CCGCGAGTAT TGATCATCGA ACGCTCACCA GATAGCTCAG CCAACTTGCG CGATGCCTTA GAAGCTGCTA ATTTGGTGAC CGAAGCCTTA CGCCCTGCCG CCTTGCCGAC CAGCCTCAGC CAACTCAGGG TCTACGATTC AATCGTGCTC CAAGATATTT CTGCCAACGA TTTAAGCCTT GATCAGCAAT TGGCCTTGCG TGAATTTGTG CGCAGCCTTG GCCATGGTGT GGTTGTATTA GGTGGAACCA ATAGCTATAA CTTGGGCAGT TATGCTGGCA CGCCGCTCGA AGAATTGTTG CCAGTTTCAA TGGAGCCGCC GCCCCGCCGT GAGCGCCCAA CCGTCACTCT GCTGCTGATT CTGGATCGCT CGGCAAGTAT GTTGGGCGAG TCGGGCAAAG ATAAATTTAG CCTTGCCAAA GCTGCCGCGA TTGCCGCAAC CGATTCTTTG GGAGCCGATG ATACGATTGG CGTGCTGGCA TTCGATGATA CCAACGATTG GACAGTGACC TTTACCAAGG TTGGTCAAGG TGTGCAACTA AGCGAAATTC AAAATAATAT CGCTGGCTTG AGTGCTGGCG GTGGAACTGA TATTTATGCC GCTTTGGAAG TTGGGATGGG CGGTCTGGCT CAACAAACTG GCAAAGTGCG TCATGCCGTG CTGTTGACAG ATGGACGTTC TGGCGGCGAA AGCTCCTATG AATCGCTGAT CGCTCCGTTA CGTGCCCAAG GCATTACGCT TTCGACAATT GCGATCGGCG GCGATGCTGA TACCGTGCTG CTCGAATCGT TGGCCAAATT GGGTGCGGGA CGCTATCATT TTGCCTCTAG ACCCGATGAT TTGCCGCGAT TGACCTTGCA AGAAGCCGAA ATTGCCCGCG AAAATCCATT AACTGAGGGC CAATTTCAGG CTAATCTTGC TACGCCGCAC CCCGCGATTC GTGGCCTGAA CCTCGGCGAA ATCCCGCCGT TTGGTGGTTA TGTCGCGGTT ACGCCCAAAC CTGAAGCTGA GCAATTATTG ACCACTACCG AAGGCGATAT TTTGCTGGCA ACTTGGCAAT ATGGGCTTGG TCGCGCCACT GCTTTTACCT CGGATAGCGG CGAACGTTGG ACTGCCACAT GGCGACCTTG GCCAAATTGG GGCAATACCC TGGCGCAAAT TATCGCCGCA ACTTACCCCA ACCCCGCCCG GGGCGACCTC CGAGTCAGCA GCGAATTGCA ACAGAATCAA GCAATTATCA CTCTCGATGC GCAAGCTGAA ACGGGCGAAC TCTACGATTT GGCTGATGTA GGCTTGCGGG TGCTGGCTCC CAATGGCAGC GAACAAATCT TGCGTGCACC CCAAATTGCG CCAGGTCGTT ATCAAGCGCT GGCTGATGCC TCCCAAACTG GCGCGTACCA TATTTTGGCA GCGTTGGAGC AAGGCCCAAA TCGGCTCGAA ACCCAAGCTG GCGTGATTCA TCCCTACAAT CGTGAATGGG CGGTTTCGGC TAACCCCGCA CTGTTAGAGC AATTGGTCGG GCTTGGGCAA GGCCAAATCG GCAGCTTGGA GCAAATTGCC CCCAGCCTGC AAGTTGCCAA CCAAACCAGC AATACCCAAT GGTGGCCATG GCTGATTGCG CTTGCCTTAG GCTTATGGGT GGTTGAAATT GCCATCCGCC GTGGAGTTAT TCGCTGA
|
Protein sequence | MIFLGLFLVI GLSGWWIARK TTLDWRVVGL RLTSLACILL ALALPRNQAN QQASPLILLV DQSANLPSEL RDAAWNEAVR FYQQQIEQRP VRLLAFGADV RVSQTDQRPA IDPNGSDLAG ALQFASGLLP QGGDIILLSD GASTTTNGQN QVSTFAQRSI RLHGVPISYP ETDIRVESLL VPPALREGER FSADVVLYSS VDGQVRLELS SDGVGLAGQT INVEQGRNLV SFQSTAGARG FHRFQATLLA TNDQQPANNQ LDAWTVVGPP PRVLIIERSP DSSANLRDAL EAANLVTEAL RPAALPTSLS QLRVYDSIVL QDISANDLSL DQQLALREFV RSLGHGVVVL GGTNSYNLGS YAGTPLEELL PVSMEPPPRR ERPTVTLLLI LDRSASMLGE SGKDKFSLAK AAAIAATDSL GADDTIGVLA FDDTNDWTVT FTKVGQGVQL SEIQNNIAGL SAGGGTDIYA ALEVGMGGLA QQTGKVRHAV LLTDGRSGGE SSYESLIAPL RAQGITLSTI AIGGDADTVL LESLAKLGAG RYHFASRPDD LPRLTLQEAE IARENPLTEG QFQANLATPH PAIRGLNLGE IPPFGGYVAV TPKPEAEQLL TTTEGDILLA TWQYGLGRAT AFTSDSGERW TATWRPWPNW GNTLAQIIAA TYPNPARGDL RVSSELQQNQ AIITLDAQAE TGELYDLADV GLRVLAPNGS EQILRAPQIA PGRYQALADA SQTGAYHILA ALEQGPNRLE TQAGVIHPYN REWAVSANPA LLEQLVGLGQ GQIGSLEQIA PSLQVANQTS NTQWWPWLIA LALGLWVVEI AIRRGVIR
|
| |