Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1006 |
Symbol | |
ID | 8741591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 1037323 |
End bp | 1039572 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646511584 |
Product | von Willebrand factor type A |
Protein accession | YP_003402573 |
Protein GI | 284164294 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1239] Mg-chelatase subunit ChlI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGCAG AGTCCGGAGA CAAAAAGCTG TCGTCACTCC CCTTTCCGGC CATCGTCGGC CAGAACGAGT TGAAGCGGGT GCTGCTCGCC GTCGCCACCA ACGACGGCCT CGACGGCGCC CTGATCGTCG GCGAGAAGGG GACCGCGAAG TCGACCGCGG TGCGAGGGCT CGTGGATCTC CTCCCCGAAC AGCGGGCCGT GGCCGACTGT CCGTACGGCT GTGCGCCCGA CGCGCCGAAT CTGCAGTGCG ACGACTGCCG CGAACGCGAT CCGGAAGCGA TGCCCGTCGA AACCCGGCCA GTTCCGCTCG TGACGCTTCC GCTGGGTGCG ACTCGAGACC GCGTCGTCGG GACCCTCTCG GTCGAGGACG CGCTGGCCGG CGCGGCCGAC TTCGATCCCG GACTGCTCGC CCGCGCTCAT CGGGGCATCC TCTACGTCGA CGAGGTCAAC CTGCTGGACG ACCACCTCGT GGACGTCATT CTCGACGCGG CCGCCAGCGG GGTCAATACG GTCGAGCGTG ACGGAGTCAG CGTCTCTCAC CCCGCCGAGT TCACCCTCGT CGGGACGATG AACCCCGAGG AGGGCGACCT CCGCCCCCAA CTGCGGGACC GCTTCGCCCT GCAAGCCACC GTCGAGGGCT GTCGTGAGAT CGACCAGCGC GTCGAGATCA TCGATCGGGC GCTCGAGACG GACGCCAGTG GCGCTGGCGG ATCTGACGGG CCGGAACCCT GGACCGAGTA CGCCGACGAG ACGGCGACCC TGCGCGAGGA ACTCGCGGCG GCCCGCGACC GCCTTGCCGA AGTCACGCTC CCGGACGATT TCAAGGCGGA GATCGCCGAC CTCTGTCTCG AGGCCGGCGT CGACGGCCAC CGCGGCGACG TGGCGACCGC TCGGACGGCG ATGACCATTG CCGCGCTCGA GGGCCGAACG ACGGTCATCG AGTCCGACGT TCACGAGGCC GCCAGCTACA CGCTTCCCCA CCGCCTCCGG AGCACGCCGT TCGAAGACGA ACCCGACGTC GAGGACCTGC TCGAGGACCG CTTCGACGAG GAGTCGACGG ACGAGAGCGA AAGCGAGGAC GATGCGGACG GCGACGAGTC CGCGGACGGA GGCGACGGAG ACGAGTCCGA GACCGAGAGC GACGACGGTG AGACGGGAAC CGAACCCGGC GGCGAGCGCA GCGACGGTGA CGGTGACGAC GGCGAATCCG ACGGGAACGA CGGCGACGCC GATTCCGAGA CCGACGACTC GAGTCCCGAG CGTCGACCCG ACGAGGGCGA CGACCCGACC GGCGGTGAGC CGTCGCCCGC CGAAGCCGGC AACGGGAGCG ACGAGACCGG CGACTCGAGC GAATCCGATT CGAACGACGG TGACGGGAGC GACGGAGACG GCGAAAGCGG CGACGAGGGA GACGAGAACG CCCAGCCGCT CGTCCCCGGC CAGCAGCGCG CCGACGTCGC CGCGGTGGGC GAGGCCACGT CGCCGGACCT CGAGACGCCC GACGCCGAGA GCGCGGACGC GGCGACGGCG AGCGGTTCGC GAGCGAGCAC GGCCCCAAGC ACGGACAACC GCGGTGCTCG AGTTCGGACC GAACCCGCGT CGGGGGACGG CCCTATCGAC GCGGCGGCGT CGGTTCGCTC CGCCGCGACC CGCGGCGACT CCCGGGTACA GAAGCGGGAC TTGCGTCAGT CGGTCCGCAC CGGCGACACG TCGGCGACGA TCGTCTTCGC GGTCGACGCC AGCGCCTCGA TGCGCCCCGC GATGCGCACC GCGAAGGGCG TCGTCCTTGA TCTGCTGCGG GACAGCTACG AGCACCGCGA TCGGGTCGCG TTCGTCGCCT TCGCCGGCGA AGACGCCGAC GTCCTCCTGC CGCCGACCGA CAGCGTCTCG CTGGCCGCGC GCCACCTCAA GGAACTCCCC TCGGGCGATC GGACGCCCCT TCCCGCGGGC CTCGAGACCT CGCGGCGAGT CCTCGAGCGC GCGGAGACGG ACGCCTCGGT CGTCGTCTGC GTGACCGACG GCCGGGCGAA CGTCGCCGAC GGCAGTCCGA CCGAGGCGAC GCGACGGGCG GCCCGAGGGC TCGCGGCCGA GGACGCGACC GTGATCGTCG TCGACGCCGG CGACGACTCT CGGGCCGGCC TCTCGGAGCT GGTCGCCGCC GAAACGGGGG GCGAAGTGGT CGCCCTCGAG GCGCTGTCGT CCGAGACGGT CCGAGCGGCG GCGGACCACG CCGCATCGGG GGGACAGTAA
|
Protein sequence | MVAESGDKKL SSLPFPAIVG QNELKRVLLA VATNDGLDGA LIVGEKGTAK STAVRGLVDL LPEQRAVADC PYGCAPDAPN LQCDDCRERD PEAMPVETRP VPLVTLPLGA TRDRVVGTLS VEDALAGAAD FDPGLLARAH RGILYVDEVN LLDDHLVDVI LDAAASGVNT VERDGVSVSH PAEFTLVGTM NPEEGDLRPQ LRDRFALQAT VEGCREIDQR VEIIDRALET DASGAGGSDG PEPWTEYADE TATLREELAA ARDRLAEVTL PDDFKAEIAD LCLEAGVDGH RGDVATARTA MTIAALEGRT TVIESDVHEA ASYTLPHRLR STPFEDEPDV EDLLEDRFDE ESTDESESED DADGDESADG GDGDESETES DDGETGTEPG GERSDGDGDD GESDGNDGDA DSETDDSSPE RRPDEGDDPT GGEPSPAEAG NGSDETGDSS ESDSNDGDGS DGDGESGDEG DENAQPLVPG QQRADVAAVG EATSPDLETP DAESADAATA SGSRASTAPS TDNRGARVRT EPASGDGPID AAASVRSAAT RGDSRVQKRD LRQSVRTGDT SATIVFAVDA SASMRPAMRT AKGVVLDLLR DSYEHRDRVA FVAFAGEDAD VLLPPTDSVS LAARHLKELP SGDRTPLPAG LETSRRVLER AETDASVVVC VTDGRANVAD GSPTEATRRA ARGLAAEDAT VIVVDAGDDS RAGLSELVAA ETGGEVVALE ALSSETVRAA ADHAASGGQ
|
| |