Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1059 |
Symbol | |
ID | 8534206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1144401 |
End bp | 1146278 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 646383443 |
Product | von Willebrand factor type A |
Protein accession | YP_003262942 |
Protein GI | 261855659 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03503] conserved hypothetical protein TIGR03503 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0405627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGTAC CACCGACAAG CCCGCTCATC AGAATGCATG CATCCACCAT TTGCGGGCTA CTTATTCTGC TGATTACCGG GCTGGCTTTC GTACCCGCAC AGGCAGCAAC GCCTCCGGAA CTTCATGTGT TAATCGACGT TTCCGGCAGC ATGAAGCAAA CCGACCCAAA CAATCTAAGA CGCCCCGCCT TGCGCTTACT GGGTGATTTG CTGCCTCCCT CTGCGCAAAT TGGCATCTGG TTTTTCGGTG ACAAAGTCAG CCTTATGTTA AAAACGGCCG GTGCTGATCC AAAAGTAAAG GAACGCGTCC GTCAAACCGC CAAAAAAATA CGATCCAACG AACCCTTCAC CGATATTCCT GCTGCCCTAG CCGCGGCTGC CGCCACCTGG AACGATGGCA CGGACCGTAA CATTCTGTTG TTGTCCGATG GCATGGTTGA TATATCACCG GAAAAAGCCA TTAATGTTCG AGCCCAAGAA GAACTTTTGC AGAAACTCGT GCCGCAACTC AGGGCCGAAC ATATTCGCGT GCACACGATC GCCTTGTCCA AAGATGCCGA CAGTAAGCTG CTTTCCCAGA TTGCAGCCGA CACGGGCGGC ATTTTTGTCG AAGCCGACTC GGCCGACGCA TTGCAACGCG CTTTTCTCAA GATATTCGAA GCGGCTGCCC CACGTGACGG CCTTCCACTG AAAGACAACA AGTTCCTGGT TGATCGTGCC GTCAAAGAAC TGACCATCCT CGCGTTCCGA GACAAACCGG AAAACCAGAC AAAACTCAAG CTACCCGATG GTCAAATCGT TGACGCCCAA GGCAGCAGGT CCAAAATGGG TTGGCGTTGG GATGACAGTG GCGGCCGTGA TCTGGTTACC ATCGAAAATC CGCCGGCTGG AGCGTGGCAG ATAATCGGCG GACTCGATCC GGACAACCGA GCGCTGATCA TCACTGATCT CAAACTTCAC CTGACGCCCT TGCCGACGCG TATCTACCCG GGGGAACGGA TCGATGGCGC GCTGATGCTG ACCAATCATG ATCAACCGAT CACCAAGGCC GCATTAACGC AGACAATCAA CGCAACGATC GATGAACAAA AAGGAAGCGA ACTGATTCAG TCGATCAAAC TTAATGATCA GGGCGCAGAC CCGGACATTA TCGGCGGTGA TGGAAAATTC AACTATCAGC TCCATCTGAT TGATCCTGCG GGTATTTACA GCCTGGTCGC CACGGCAAGC AGCCCGACCT TCCAACGTGA TTGGCGCCAG AATTTTGCTA TGGCACCGTT GCCACCAGTG CAACTCAAAC TCGTTTCATC CGTAGTCGAA GTATCACCAG CCGAATCTTC GAATAGCCCG GATGGACACC CCGCCCAACC CGTCAAAAAG ACCCTGCGTC AGATTCAAGT CACTCAGGAT CCCACGGTAC TTGAACCTAA TACGGCCAAA TTAATCGGAC AATGGCAATG CGAACGGCCC CATTCAAATA AGCCCGATGC AAATAAACCC GATTCAAATA AATATGGCGC GCCCATACCC ATCGAATGGC ATTTAACCGA CACCAGCATG ATGTTCCCTG CGCCCGATGA GACGACGGCA GACTGTGTGC TGAACGCTGT GCTTAAAGCA AAACTGACGA CGCACAGAGA CATCGATCTG GTGCTTGAAC CATTCAAGCT ACCCGCGCTA TCTCACCCCA TGCCACAACC CAAGCAGGCA GAACCTGTCC AACAGGCGCA AGAGGGTGCT TCGGGTAGGA TCAACTGGAT GCTCATCATC GTTATCAATG CCGTGGCGCT GCTCCTGATC GGACTGGGGA CATGGCTTTG GAAACGAAAA GTGAAACGTA CACACCGCCA ACTGCTTGAA GAGGCACAAT CAGTATGA
|
Protein sequence | MPVPPTSPLI RMHASTICGL LILLITGLAF VPAQAATPPE LHVLIDVSGS MKQTDPNNLR RPALRLLGDL LPPSAQIGIW FFGDKVSLML KTAGADPKVK ERVRQTAKKI RSNEPFTDIP AALAAAAATW NDGTDRNILL LSDGMVDISP EKAINVRAQE ELLQKLVPQL RAEHIRVHTI ALSKDADSKL LSQIAADTGG IFVEADSADA LQRAFLKIFE AAAPRDGLPL KDNKFLVDRA VKELTILAFR DKPENQTKLK LPDGQIVDAQ GSRSKMGWRW DDSGGRDLVT IENPPAGAWQ IIGGLDPDNR ALIITDLKLH LTPLPTRIYP GERIDGALML TNHDQPITKA ALTQTINATI DEQKGSELIQ SIKLNDQGAD PDIIGGDGKF NYQLHLIDPA GIYSLVATAS SPTFQRDWRQ NFAMAPLPPV QLKLVSSVVE VSPAESSNSP DGHPAQPVKK TLRQIQVTQD PTVLEPNTAK LIGQWQCERP HSNKPDANKP DSNKYGAPIP IEWHLTDTSM MFPAPDETTA DCVLNAVLKA KLTTHRDIDL VLEPFKLPAL SHPMPQPKQA EPVQQAQEGA SGRINWMLII VINAVALLLI GLGTWLWKRK VKRTHRQLLE EAQSV
|
| |