Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0047 |
Symbol | |
ID | 5731919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 62267 |
End bp | 63649 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277168 |
Product | VWA containing CoxE family protein |
Protein accession | YP_001542827 |
Protein GI | 159896580 |
COG category | [R] General function prediction only |
COG ID | [COG3552] Protein containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGAAC GAATTGTGGC CTTTGTCAAG GCGCTGCGAG CAGCCGGAGT TCGTGTGTCG CTCGCCGAAA GCCTTGATAG TTTTAATGCC TTAGAACAGC TTGGCATCAG CGATCGCCAG CTTTTTCACG ATGCGCTGTT GGCAACCTTG GTCAAAGATG CTAGCCAGCA AGCGCAGTTC GAGGAGCTAT TTCCGCTTTT TTTTGGCCAT GGCGATGCCC CGATGATCAG CGGAGCCAAT GGCTTGAGCG GGCTAGATCC CGACGAATTA CAACAATTGC GCCAAGAAAT TGCCCAATTG CGCGAACGGA TTCGCGAATT AATGCAACGG CTGATGGATG GCCAAAATCT CACGCCTGAA GAATTGGCGG CACTAGCACG AAGTTCGGGC ATCAATCACA TTCAATCGCT CAACCAACGC CGCTGGGTCG AACGGCGCAT GGAACAACAA ATGGGCCTGA ATGAATTTAA ACAGGCGCTG GAAGCTTTGC TCAAACAATT GGAAGAAGGC GGGATGGATG CCGCCGCCTT GGCCGAAATT ATGCAGCAAA TGCAGGGCAA TGCCCAAGCT ATGCGCGACC AAATTAGCCA ATTCGTTGGG TCGGGCTTAG CCGAGCGCAT GAGCGACGAC TACAATCCGC AAACTGGTGA TGATCTCCAG CATCGGCCAT TTGGCTCGCT CTCCGATGCT GATGTGCAGC GTATGCGTCA AGAAGTACGC CGTTTGGCGG CTTTGTTGCG CTCACGAGCA GCGTTGCGCC AAAAACGCGA TAAAGCGGGT CAGGTTGATA TCAAACGCAC CATGCGCAAC AATATGCGCT ACGACGGCGT GCCGATGAAA TTGGAATATC GTAAAAAGCA ACAAAAACCT AAATTGGTAA TTATTTGCGA TATTTCGACC TCGATGCGAC CTGTGGCTGA ATTTATGCTG CGTATGATTT ACGAACTGCA AGATCAAGTT AGCAAAACCC ATTCATTTGC TTTTATCGCT AATTTGCACG ACATTACCGA GCAATTGAAT GATAGCCGCG CCGATATTAG CGTCAATGAT GTGCTCGAAA GTATCCCGCC TGGCTACTAC AACACCGACC TTGGTCATAG CCTCGATACG TTTTTGCATA GCCACCTTAG TACGGTCGAT TGGCGTACTA CGGTGATTAT TGTGGGTGAT GGCCGCAATA ATTTCAACAA TCCACGGCTG GAATCATTGC AAACAATTCG TCGCCATGCC AAGCGCTTAA TCTGGTTTAC TCCCGAAGAT CGCTGGCAAT GGGGCACTGG CGATAGCGAT ATGCAGCTCT ACGCACCGCT TTGCGACCGT GTGCATCTCG TGACCAACTT GGCTGAATTA ACGGCAGCGG TTGATCGGCT ATTGGCTAAC TAG
|
Protein sequence | MHERIVAFVK ALRAAGVRVS LAESLDSFNA LEQLGISDRQ LFHDALLATL VKDASQQAQF EELFPLFFGH GDAPMISGAN GLSGLDPDEL QQLRQEIAQL RERIRELMQR LMDGQNLTPE ELAALARSSG INHIQSLNQR RWVERRMEQQ MGLNEFKQAL EALLKQLEEG GMDAAALAEI MQQMQGNAQA MRDQISQFVG SGLAERMSDD YNPQTGDDLQ HRPFGSLSDA DVQRMRQEVR RLAALLRSRA ALRQKRDKAG QVDIKRTMRN NMRYDGVPMK LEYRKKQQKP KLVIICDIST SMRPVAEFML RMIYELQDQV SKTHSFAFIA NLHDITEQLN DSRADISVND VLESIPPGYY NTDLGHSLDT FLHSHLSTVD WRTTVIIVGD GRNNFNNPRL ESLQTIRRHA KRLIWFTPED RWQWGTGDSD MQLYAPLCDR VHLVTNLAEL TAAVDRLLAN
|
| |