Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4043 |
Symbol | |
ID | 5735905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5162033 |
End bp | 5163304 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281194 |
Product | von Willebrand factor type A |
Protein accession | YP_001546803 |
Protein GI | 159900556 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACCC AAGCAATGGT TCAACTGCGC ATCACCCCTG GGCGGCCAGC AGTTGCCCAA AGTAACGATC CGCAAATTGT TTATTTGTTG GTCGAAGCCT CGCCTGCTGG CATTCCCGAT GCCGATTTGG CGATTCCGGT CAACCTTGGT TTTATTGTTG ATCGTAGCTC TTCGATGCGC GGCGAACGGC TTTACCAAGT CAAAGAGGCT TGTAATAACG TCGTTAATCA GCTCAATCGC CAAGATTATT TCTCGGTGGT GTCGTTCAAC GATCGGGCTG AGGTGGTTGT ACCCTGCCAA CGCCCCAACG ATAAAGACCA AATTAAACGC GCGATTGGCA TGATCGAGGC CAAAGGTGGC ACTGAGATGG CCACAGGCAT GATGATGGGC TTACAGGAAA TTTCACGCCC TATGATGAGC CGCGGCATCA GCCGTATGGT CTTGTTGACC GATGGCCGTA CTTATGGCGA TGAAAGCCGC TGTGTCGAAA TTGCGCGGCG TGCTCAATCC AAAGGCATCG GCATTACAGC CTTGGGCATT GGCGATGAGT GGAATGAAGA TCTGCTCGAA ACAATCGCCT CAGCCGAAAA CAGCCGCACC GAATATATCA CCAATGCTCA GCAAATTGTC AACGTTTTCT CTGACGAGAT CAAACGTTTG CAAAATGTGA TGGCGCATAA AGTTGAAATG CGTTTCCATC TGCACCCGCA GGCTGAAATT CGTTCGCTGT TTCGGGTGCG CCCATTTATT GCCGCGCTCA CCCCACAATT GCATAACGAA ACGCTGTGGC GTATGCCACT GGGCGAGTGG GTTGGCCGCG AAGATCAAAT CTTTTTGTTA GAGCTGGTCG TGCCGCCGCT GCCCGCAGGC AATCAAACGA TCTGTCGGAT CGAGATGTTT TACGAAGTGC CCAGCATCAG TAGCCAAGCC TTACAAACCA AGGTCGATGT CCAACTGCCG GTACGGCCTG CCGAGCAAAT TCGGCCTGAT GTTGATGGCG TGGTTAAACA TTGGCTCGAA CGCACCGTGG CCTATCGTTT GCAAGCCTCG GCGTGGCAAC ATGTTGAGCA AGGCAACATC GAGGAAGCGA CCAAAAAGTT ACGCATGGCT GGCACACGCT TGCTCGAATC GGGCCAAACT GAGCTTGCCC AAACCGTTCA AGAAGAGGCC ACCCGCCTGT TGCGTAGCGG CACAACCAGC GATGAAGGTC GCAAACGGAT TAAATACGGC ACCCGTGGCT TGGTCGCTCG CGAACGGGGC GGAGAGCAAT AG
|
Protein sequence | MSTQAMVQLR ITPGRPAVAQ SNDPQIVYLL VEASPAGIPD ADLAIPVNLG FIVDRSSSMR GERLYQVKEA CNNVVNQLNR QDYFSVVSFN DRAEVVVPCQ RPNDKDQIKR AIGMIEAKGG TEMATGMMMG LQEISRPMMS RGISRMVLLT DGRTYGDESR CVEIARRAQS KGIGITALGI GDEWNEDLLE TIASAENSRT EYITNAQQIV NVFSDEIKRL QNVMAHKVEM RFHLHPQAEI RSLFRVRPFI AALTPQLHNE TLWRMPLGEW VGREDQIFLL ELVVPPLPAG NQTICRIEMF YEVPSISSQA LQTKVDVQLP VRPAEQIRPD VDGVVKHWLE RTVAYRLQAS AWQHVEQGNI EEATKKLRMA GTRLLESGQT ELAQTVQEEA TRLLRSGTTS DEGRKRIKYG TRGLVARERG GEQ
|
| |