Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3550 |
Symbol | |
ID | 8744170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3652816 |
End bp | 3654408 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646514131 |
Product | conserved repeat domain protein |
Protein accession | YP_003405085 |
Protein GI | 284166806 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGAC GGCTGCGCTG GCGCGGCGCG ATCGCGGCGA CGGTCGTGCT CGTCCTCGCC GGCCTGCTCG ACGCCAGTCC CGTCCTGTTG CTCAGTGCGA TCGTCCCGCT CGTCTACGTG GCCTACGGCT CGCTGTCGAC CGTCTCCGTG CCCGAGGGAC TCGCGGCGAC CCGCGAAATC TCGCCGACGC CGGCGGCGCC CGGCCGACCG GTCACCGTGA CGCTGACGGT GCGAAACGAG TCCGATCGGA CGGTCACCGA CTGTCGACTC GTCGACGGCG TCCCCGAGGA ACTCGCCGTC CTCGAGGGAT CGCCGCGGGC CGGCGTGACC CTCGAGCCCG GCGAGGAACG CCGCATCGAG TACTTGGTCG TCGCCAGACG CGGCGAGTAC AAGTTCGACG CCCCGGAGTG TCGCGTTCGC GGACTCGGCG CGAGCGCGGT CGCGACGACG CGGCTGTCGA CCACGGGCGC GGAACGGCTG GTGTGTCGGC TCGACGCCGA CGCGCCGCCG ATCGAGGAGA TCGGACGCGG CCGAATCGGA CAGTTGACGA CCGACCGACC CGGCGAGGGG CTCTCCTTTC ACTCCGTCCG CGAGCATCGG CCGGACGATC CGGCCGATCG GATCGACTGG CGCCACTACG CGAAACGCGG GACGCTCGCG ACGATCGAGT ACGAGCGCCA GGTCGCGGCG ACGGTCGTGC TGGTCGTCGA CGCCCGCCCG TCCAACGCGG TCGTCGCGGG GCCCGGCCGC CCGACCGCCG TCGAGTTCGC GGCCTACGCG GCGACCCGGA CGCTTTCGGA CCTGCTCGGA CACGGCCACG ACGTCGGCGT CGCCGTCGTC GGCCGCGACG GCAACGGGCC CGCCGGCCTC CACTGGATCG AGCCGGCGAA CGGCCGCGAG CAGCGCACGC GCGCACTCGA GGTCATCCGC TCGGCGACTT CCTCGAAGGC CTCGAGCGGG CGGTTCTCGG AACCTTCGTC GAGGAATCGG GACCGCGGAC GACTGCCCCG ACAGGTCCGT AAGGTGCGCG AACTCGCGCC AGCGGGGGCG CAGGTGGCGC TGTTCTCGCC GGTCCTCGAC GATCAGTCGG TCACGGCGGT CGAGCGCTGG CGCGGGGGCG GTCTCCCCGT CGTCGTGCTC TCGCCGGACG TCGTCCCGGG CAACACCGTC AGCGGCCAGT ACGCACAGCT TCGGCGGCGG ACCCGGCTGG CCCGCTGTCA GGCGCTGGGC GCCCGGACCT TCGACTGGCG CCGCGGGACG CCGCTGCCAG TCCTCATCGA ACACGCCTTC ACCGCCGACG CGCGGCTCTC GAGCGCCCGG CTCTCGGGCG GATCTGGCGG CGGTCGCGGC CGGAGCGGCG GCGAAACCGA CAGCGAGACC GGATTCGGTG GCGAAGCCGG ATCCGGGACC GGTGGCGTCG ATTCGACGGC GTCGGCGACG CTCGAGTCGG AATCGGAGGC GACCGAGTCG GAGTCGACGA CGGCCGAGTA CACGTGGCGG TCGCCGGCCG ATGGCTCCGA GGGGACCGAA CGCGAGAGAT CGGTCTCGAC CGACGCCGAC GGCGCCTCGG CGAAACGAGG AGGTGATCGG TAG
|
Protein sequence | MSRRLRWRGA IAATVVLVLA GLLDASPVLL LSAIVPLVYV AYGSLSTVSV PEGLAATREI SPTPAAPGRP VTVTLTVRNE SDRTVTDCRL VDGVPEELAV LEGSPRAGVT LEPGEERRIE YLVVARRGEY KFDAPECRVR GLGASAVATT RLSTTGAERL VCRLDADAPP IEEIGRGRIG QLTTDRPGEG LSFHSVREHR PDDPADRIDW RHYAKRGTLA TIEYERQVAA TVVLVVDARP SNAVVAGPGR PTAVEFAAYA ATRTLSDLLG HGHDVGVAVV GRDGNGPAGL HWIEPANGRE QRTRALEVIR SATSSKASSG RFSEPSSRNR DRGRLPRQVR KVRELAPAGA QVALFSPVLD DQSVTAVERW RGGGLPVVVL SPDVVPGNTV SGQYAQLRRR TRLARCQALG ARTFDWRRGT PLPVLIEHAF TADARLSSAR LSGGSGGGRG RSGGETDSET GFGGEAGSGT GGVDSTASAT LESESEATES ESTTAEYTWR SPADGSEGTE RERSVSTDAD GASAKRGGDR
|
| |