Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3408 |
Symbol | |
ID | 5735269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4294032 |
End bp | 4295414 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280555 |
Product | VWA containing CoxE family protein |
Protein accession | YP_001546172 |
Protein GI | 159899925 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCAAT CCGAGCTATT AAATCGTCGC CAAGTGCTCT ATTGGCGTAT GCTCAGCACT ATGTTTGGCT ATGACCAGCA GGGCGAAAAT TTCGACAGCA TGAGCCATGA GATTGCCCAA GATCTGGCCT TGCCTGAGTC GATTTTGCAC CCAACACTCT CGCTGGAGCA ATTGTTTCAA CGTTACCCTG AGCTTGAGCC GGAGTTCAAT CTGACTGAGC TTGACGATCG CCAAGATCCT ACGACTTTGC GCCGTTCGTT AATTATTTCG AAGTTGTTGC TGAATGTCTT TGGCCCTCAA ACCCAAAAAC GCTCGATCAG CGCTGCCGAG TATGCCCAAT GGCTCAAAGA TGTGGCCCAT CTTGAACGTT GTTTGGGTTT TCAGCCTGGA GCGTTGCGTC AAAGCCAACC TGGTCAAGGC CAAGCTAGCC AACCTGGTGG TTTGCAAGGT GGGCAGGGCG TTGGCTCAGG CTTCAATCTC TCCGAAGAAG AGTTGCGCCA AGTTATCCAA GGGCTGGAAA AAGATTTGAT CAAGCGCATG GCTTTGCGCG AAGTGCTGCA AGATAATCGG CTTGCCGCCC AACTTACGCC TTCGATGGCG GTGGTTGAGC AATTGCTGCG CGATAAAAGC CATCTTTCGG GCAATGCCTT GATTAACGCC AAACGCCTGA TCAAGCAATA TGTTGATGAA TTGGCCGATG TGTTGCGTTT GCAGGTGATG CAAGCCGTTT CAGCCAAAAT CGATCGTTCA GTGCCACCTA AGCGGGTGTT TCGCAACCTC GATTTGAAAC GCACAATTTG GCGCAATCTG ACCAATTGGA ATTCCAATGA AGGCCGTTTG TATGTTGATC GCTTGTATTA TCGTCAAACT GCCAAAAAAC GCACCCCAAT GCGCATGATC GTGGTCGTCG ATCAATCTGG CTCGATGGTT GATGCCATGG TGCAATGCAC AATTCTGGCT TCGATTTTTG CCGGTTTGCC CCATGTTGAT ATGCATTTGA TCGCCTTCGA CACGCGCATG CTTGATCTCA CGCCTTGGGT GCACGACCCG TTTGAGGTAT TGCTGCGCAC TCAGCTTGGC GGCGGCACAA GCATCAACGA AGCCTTGCTC TTTGCCAGCG AAAAAATTCA AGAGCCACGC AAAACCGCCG TGGTGCTGAT CACCGATTTT TACGAAGGCG GTTCGGATCA AGTGCTGCTC GATACAATCA AAGCCATGAT CGAATCGGGT GTGCATTTTA TTCCGGTCGG GGCGGTCACC AGTTCGGGCT ATTTCAGCGT CAACGATTGG TTCCGTACCA AGCTCAAAGA AATGGGTCGG CCAATTTTTG CTGGCAGCCC TCGCAAGCTG ATCGAACAAA TTAAGCAATT TATTACCTTG TAA
|
Protein sequence | MNQSELLNRR QVLYWRMLST MFGYDQQGEN FDSMSHEIAQ DLALPESILH PTLSLEQLFQ RYPELEPEFN LTELDDRQDP TTLRRSLIIS KLLLNVFGPQ TQKRSISAAE YAQWLKDVAH LERCLGFQPG ALRQSQPGQG QASQPGGLQG GQGVGSGFNL SEEELRQVIQ GLEKDLIKRM ALREVLQDNR LAAQLTPSMA VVEQLLRDKS HLSGNALINA KRLIKQYVDE LADVLRLQVM QAVSAKIDRS VPPKRVFRNL DLKRTIWRNL TNWNSNEGRL YVDRLYYRQT AKKRTPMRMI VVVDQSGSMV DAMVQCTILA SIFAGLPHVD MHLIAFDTRM LDLTPWVHDP FEVLLRTQLG GGTSINEALL FASEKIQEPR KTAVVLITDF YEGGSDQVLL DTIKAMIESG VHFIPVGAVT SSGYFSVNDW FRTKLKEMGR PIFAGSPRKL IEQIKQFITL
|
| |