Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0249 |
Symbol | |
ID | 5732144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 289880 |
End bp | 292732 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277373 |
Product | von Willebrand factor type A |
Protein accession | YP_001543029 |
Protein GI | 159896782 |
COG category | [S] Function unknown |
COG ID | [COG5426] Uncharacterized membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTATCGT TCGTCCGACC AGAATATCTC TGGTTTTTGC TGGCGCTGCC ACTGGTCTGG CTGTTGGGCT GGCTCAATAA CCGTGGTCGC ACAGGCCAAC GCCGTTGGTG GGCCTTGGGC TTACGCACGC TATTGCTGCT GTGTTTGATC GGCAGCCTTG CCGGAACCCA AGTACGCCAA CCAGTTCAAA ACCTAACCAC GGTATTTTTG CTCGATAGCT CCGACTCAAT TGCCCCGGGC CAACGCTCTA ACAACGAGCA ATTTATTGCC CAAGCGCTCG AAACCATGCA AGAAGGCGAT AAAGCTGCCG TCGTGGTGTT TGGCGAAAAC GCTTTGGTTG AGCGGGTTCC CTCTGAAATT CAGCGCTTAG GCACAATTCA ATCGGTGCCA ATTGCTGCGC GTACCGATAT TAGCGAAGCA ATTCAGCTTG GTTTGGCGCT GTTTCCCGCC GATACCCAAA AACGTTTGGT GTTGCTCTCT GATGGTGGCG AGAACAGTGG CCGCGCCTTG GAGATGATCC CACTGGCGCA ACGCCGCAAT GTGCCAATTG ATATTGTGCC AACAGGCATT GGCCAAGGCA ACCCCGAAGT GGCAATTAGT GCCTTTCGGG CACCATCAGC CGCCCGTTCA GGCCAAGAAA TTCAGTTGAT CGCCACGATT GAAAGTAATA CCGCCCAATC AGCCCAATTG CGCTGGCGAG CCGATGAGCA AATTGTACTG GAAGAAGCGA TCAATCTACC AGTTGGCACG AGTAGCTTTA CCACAACGCT CGTTGTCAAC GATCAAGGCT TCCATCGCTA TAGTGCCCAA GTTGTGCCAA CCAGCGATAC GCGTGCCCAA AATAATGTGG CGGCTTCGTT GGTGCAAATT GGTGGCCCGC CCAAGGTGCT GCTGGTTGAA GGCGAAGTCG GCGATGCAAG TGCGCTCAAG CCAGCACTCG AAGCTGCCAA CCTTGTACCA GTCGTTGTTC CAGCCACTGG CTTGCCCACC GATTTAGCAG CATTGAGCGA TTATGAAGCA GTCTTGTTGC TGAATGTGCC GTCGCGCGAT ATCGATCAAG ATACCCAAAA ATTATTACGC TCGTATGTTG GCGACCTTGG GCGTGGCTTG GCAATGATCG GCGGTCGCCA AAGTTTTGGC GTGGGTGGCT ACACGGCCAC CCCCATCGAA GAAGCCTTGC CCGTCAATAT GGATGTGCGC AATCGCCAAC AACGCCCTGA TATTGCTTTG GTGTTTATCA TCGACAAATC GGGCAGTATG GATGCTTGTC ACTGTAACGG TGGCGATATG GCGGCGCGTG AAGGTGGTGG CACGCGCAAA ATCGATATTG CCAAAGAAGC GGTGGCTCAA GCCGCTGCGG TGCTGGGCAA AGACGATAAA TTGGGTGTCG TGACCTTTGA TGATTCGGCG CATTGGACGA TTGAACTCGA TAAAGTGCCC AGCCAAGATG ATGTTGTCGC GGCTTTGGCT CCTGTGCCAC CAAGCGGCCA AACTAACGTG GTTAGTGGCA TGAACGCTGC CTATGAGCAA TTGCGCCAGA GCGATGCTAA AATCAAACAT GCGATTTTGC TGACCGATGG TTGGGGCCAT GCTACCGATA TCGGATCAAT CGCCGAAAAT ATGAACAAAG ATGGCATTAC GCTCTCGGTG GTTGCAGCAG GTAATGGCTC GGATAACGCT TTGCAACGCT ATGCTGAGCT GGGTGGTGGA CGTTATTATC CAGCCCGCGT GATGGAAGAA GTGCCGCAAA TCTTCTTGCA AGAAACGATT CAGGCGGTTG GCACTTATAT CGTTGAAGAA CAATTTACCC CGGCTTATGC TGGCGATAGC CCGGTGCTGG CCGATTTGCA AGAAGGCTTG CCAAGCTTGT TGGGCTATAA CGGCACAGTC GAAAAAGATA ACGCTCAAGT TATTTTGACT GCCAGCGATG GCTCTCCCAT TTTGGCCCAA TGGCAATATG GGCTTGGCCG GAGCATCGCT TGGACGAGCG ATCTCAAGGG CAAATGGGCC TCAAACTGGG TCACATGGGA AGAATTTCCA CGCTTCACGG CGCAGTTGGT TGGTTGGCTT TTGCCACGTA TCAGCAACGA TAATGTCAGT GGTGAGGCCT CGTTAATTGG CAGCGACGTG CAAATTGATA TTGTTGCCAA CGACGAGAAG GGCAATCCAC AAACTGCGAT GAACGTCAAT GCTCGTTTGA TCGGGCCAAC TGGCGAGGCG ATTGATGCAA CCTTGGCTGA AGTTGGGCCT GGCCAATATC GCGCACGGGT GGCTAGCCCA ATTGCTGGCA CCTATTTGAT TCAGGTGATC GGCAATGATG CAAACGGCAA GCCAGCCTTT GCCCGCACCT TGGGCTTGAT TGTTCCCTAC TCGCCAGAAT ATCGCCAAGG CCAATCTAAC CCTGAATTGC TGAGCACTTT GGCCAAAGCC ACTGCGGGCC GCAGCTTGAG CCAACCAATG CAAGCGTTTG ATCATACGCT GGATGCAGTG CGCCGCGCTA CGCCTATTGA TTTGGGCTTG TTGTTTGCAG CTTTGGTGTT GCTGTTGCTT GATATCGCAA TTCGCCGCCT CAACTTGCGC CGCAAAGATT TTGCCGCCTT GCAAGCAGCT CGCAAAGAGC GCCAAACGAT TGCTGCCGCC CCAACTGCCA CAATGAACAG TTTGCAGGGA GCCAAGGGGC GTGCCCGCCA GCAAATGTTC AGCGATAAGA GCGAGCGCGA AGTTAAGCCC AAAGAAAACC CAGCAACTAC GCCATTACCA AGTACACCAA ACAATCCAAC CAAAGCCGTT GATGAAGCCG AAGATCCACT CGAACGGCTC CGCGCCGCCA AAAATCGTGC CCGCAGGCAA TAA
|
Protein sequence | MLSFVRPEYL WFLLALPLVW LLGWLNNRGR TGQRRWWALG LRTLLLLCLI GSLAGTQVRQ PVQNLTTVFL LDSSDSIAPG QRSNNEQFIA QALETMQEGD KAAVVVFGEN ALVERVPSEI QRLGTIQSVP IAARTDISEA IQLGLALFPA DTQKRLVLLS DGGENSGRAL EMIPLAQRRN VPIDIVPTGI GQGNPEVAIS AFRAPSAARS GQEIQLIATI ESNTAQSAQL RWRADEQIVL EEAINLPVGT SSFTTTLVVN DQGFHRYSAQ VVPTSDTRAQ NNVAASLVQI GGPPKVLLVE GEVGDASALK PALEAANLVP VVVPATGLPT DLAALSDYEA VLLLNVPSRD IDQDTQKLLR SYVGDLGRGL AMIGGRQSFG VGGYTATPIE EALPVNMDVR NRQQRPDIAL VFIIDKSGSM DACHCNGGDM AAREGGGTRK IDIAKEAVAQ AAAVLGKDDK LGVVTFDDSA HWTIELDKVP SQDDVVAALA PVPPSGQTNV VSGMNAAYEQ LRQSDAKIKH AILLTDGWGH ATDIGSIAEN MNKDGITLSV VAAGNGSDNA LQRYAELGGG RYYPARVMEE VPQIFLQETI QAVGTYIVEE QFTPAYAGDS PVLADLQEGL PSLLGYNGTV EKDNAQVILT ASDGSPILAQ WQYGLGRSIA WTSDLKGKWA SNWVTWEEFP RFTAQLVGWL LPRISNDNVS GEASLIGSDV QIDIVANDEK GNPQTAMNVN ARLIGPTGEA IDATLAEVGP GQYRARVASP IAGTYLIQVI GNDANGKPAF ARTLGLIVPY SPEYRQGQSN PELLSTLAKA TAGRSLSQPM QAFDHTLDAV RRATPIDLGL LFAALVLLLL DIAIRRLNLR RKDFAALQAA RKERQTIAAA PTATMNSLQG AKGRARQQMF SDKSEREVKP KENPATTPLP STPNNPTKAV DEAEDPLERL RAAKNRARRQ
|
| |