Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0125 |
Symbol | |
ID | 7401646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 132752 |
End bp | 134467 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643707189 |
Product | von Willebrand factor type A |
Protein accession | YP_002564801 |
Protein GI | 222478564 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.178394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGACA ACGACAACAT CGACACAATC GGACTTTCGC GGCGCACGCT GCTGGCGGGC CTCGGCGCGA TCGGCGTGGC CTCAGCGGGG GCTGGGCTCG GGACGACGGC GTACTTCAAC GACACCGAGT CGTTCGAGAA CAACACGCTC ACCGCCGGCC AGCTCGACCT GCTCGTCGAC TGGCAGCAGA CGTACGACTT CGGCGATGGC CATCAGTTCG TCAGCGCGCA CCCGGACCAC GACGGCGACG GGGAGCAGTC GATCGCCGCT GACAACGACG CCGGAGAGAT CAAGTACAGC GACTTCCCGG ACGACGAGGA CGAAGACAGC AACGGGGCGA ACATCCCCGT CCTCGACTGC GAGACCATCC CGCCGCTTTC GGAGGCGGAC TTCGGCACGG ACCCCGTCAC CGGCGAGGAG ATGGAGACGC TCGTCCAGTT CACCGACGTG AAGCCCGGCG ACTCGGGCGA GATCACCTTC TCGTTACACC TCTGTGACAA CCCCGGCTAC ATCTGGATGC AGGCGGGCAA CGTCAGCGAT GAGGGCGGCG CGTTCACGGA GCCGGAAGAC GTCGCTGCCG GCGAGAACGC CGCCGACCTC GCGGACGCCA TCGACGCGAA GCTCTGGTAC GACGAGGACT GCGACAACGT TCACGACGAC GCGGGACCGA TCGACATCAT GCTCACGCTC GACTTCTCGG GGTCGATGCT GTACGACCAG TACGGCGGCG TGGTGAGCAC GGACCCGATC CAGGTCGACG GCCAGAGCTA CGGCGAGACG ACGAAGATCG ACCTCGTCGA GCTCGGCACG CGGCAGTTCA TCGACTACCT GCAGGCGCAG AATGCCGACG TGCAGGTCGG CGTCGTCTAC TTCGACGGCG AGGGGAGCGG CGAGAACACC CCTCGGACCG GCATCCTCGA ACCGCTAACG ACCAACCTCT CGGCGGTCGA CACCGCGCTG TCGAACCTCC GTCAGAAGCT CGCGAACGTT GTGAGCGACG CGGCCCCGTC GACGCCGTTC GACAACGACG GGAACCCGGA CCCGTACTCG AACGCCGACG GCATCGCCAC GGGCACCTAC ATCAGCGAGG GGCTCGACGA CGCGCAGACG GAACTCGCCA GCAACGGTCG AGCGAGCGCG GAAAAGCGGA ACATCGTCCT CTCGGACGGC GAGTCGTTCA ACGGCGACGG GAACACGAAC TACGCGCCGC CCGCGAGCGC CGCGGCCAAC GCCCGCGCCG CGTCCCCGGC GCCCGCGACC GACGTGTACA CGATCAACGT CAACGGCAGC GCCAGCACGC TCCAAGCGAT GGCCGGCCCG GCCGGCGGCT CGGGCGGCGA TCCCGTGTTC TTCAACGACA TCAACGACCC GCTCAACATC CCGACGGTGT TCGGCAACCT CGCCGCACAG ACCGTCGCGG AGAAGGTCAT CATGGAGGAC ACGCTCGCGA ACGTCCTCGA CGCGCTCGCG GACGGGAACG GCGTCCCGCT CGACGGGAAC CGTGCGACGC TGTACGACGA ACTCAGCGAC GACCCCAACG ACCCGGACCG CGAGGCGTTC CGCGGTGACG GCGTGATGCA CTGCGTCGCG CTCTCGTGGG AGCTCCCGTT CGACGTGGGC AACGAGATCC AAGGCGACAC GCTCAGATTC GACCTCGGGT TCTACACCGA ACAGGCGCGC CACAACGACG GTACCGGTCC CGATCTGCCG GCCTGA
|
Protein sequence | MADNDNIDTI GLSRRTLLAG LGAIGVASAG AGLGTTAYFN DTESFENNTL TAGQLDLLVD WQQTYDFGDG HQFVSAHPDH DGDGEQSIAA DNDAGEIKYS DFPDDEDEDS NGANIPVLDC ETIPPLSEAD FGTDPVTGEE METLVQFTDV KPGDSGEITF SLHLCDNPGY IWMQAGNVSD EGGAFTEPED VAAGENAADL ADAIDAKLWY DEDCDNVHDD AGPIDIMLTL DFSGSMLYDQ YGGVVSTDPI QVDGQSYGET TKIDLVELGT RQFIDYLQAQ NADVQVGVVY FDGEGSGENT PRTGILEPLT TNLSAVDTAL SNLRQKLANV VSDAAPSTPF DNDGNPDPYS NADGIATGTY ISEGLDDAQT ELASNGRASA EKRNIVLSDG ESFNGDGNTN YAPPASAAAN ARAASPAPAT DVYTINVNGS ASTLQAMAGP AGGSGGDPVF FNDINDPLNI PTVFGNLAAQ TVAEKVIMED TLANVLDALA DGNGVPLDGN RATLYDELSD DPNDPDREAF RGDGVMHCVA LSWELPFDVG NEIQGDTLRF DLGFYTEQAR HNDGTGPDLP A
|
| |