Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5036 |
Symbol | |
ID | 8745842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013748 |
Strand | + |
Start bp | 22809 |
End bp | 24488 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646515650 |
Product | hypothetical protein |
Protein accession | YP_003406597 |
Protein GI | 284176321 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 0.605501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGATGC CGGATACGCC CGACAACGAT CCCGAGGAAA CCGATTCTGA CGGGGCCAGC GGTGGGGTAT GGGCGGCGAT TTGTGCCCTC CCCGGCCGTC TTTGGGCAGT ACTGGCGTCG ATTCTGGCGA CGCTCCTCGT CGGTCTCCGT GGCGTCGGTC GCCGAGTCCG CGGCGTGTGG ACGACAGTCC GCTACCGGGC CACGCAGGCG CTCGTGAGCT TCATCATGGG CTCGCATCCG AACACCTCGC GATTGCTCGC GACGGCAGTC GTCGGCAGCG GCGTCGCCGC TGGTGTCGGG ATCATGGCCG TGGCCGTCAA CCAAGCCGAA ACGAGTGCGT CCTCCGGCCC GATCGTCCGC TTTGCGCTCG GACTGGCGAC GAGTCCGTGG GTGTGGATTC TCGCAATCGT CCTGTTGTTC CGGCAGGTGC TGTTTTTCGG CGACCGGCTG CTCGCCCGCG TGACCGCAAG CGAGTCGGGA TACAGCTACA AGACGATTCG CCGACTCGCC GAAGAGGCCA AACGCCCCGA CTTAGACCGA TGTGAGCGCG TCCTCGTCCA GTCGGGTGAC AGCGCCGAAC GCATCACCGA GTGGTGTCTC GACGCCCTCT CGGGCAACGG CCACGACGCA CCGACGTTCA ACCCGCCCGG TGCCGACGAC GACTCGAGCG ACGAGACCAA CCTCCCACGC CAGCGGACGG CCGCCGACGA ACCGATCGAC GTCGATCCGG CCGACGATGA CGACGAGCCG GACTTTTGGA CGCAACTCCG CTTGTTCCGC CTCGAGTTGG GATCGGCGAT CGACCTGAAC GGGATTCTCT GGCGGTTTCT CGTCCCGGCC GGACTCACGT TCGTCGGCAT CATGCTGTGG CTTCGGATCT GGATACAGCC GTGGGTCCTG CCGGTCGTCG TCGCGGCCGC TGCGTTCGTC GGCGGTGCCT ACTACTGGGC CGTCGACCTC CGACATCGCC GCCGCCTCAA GGCGCTGCGT GCCGAGGAGA CCACGACGCA GTGGACCGAC CTCGCAATCT TAACGAAGAC CGTCGAGGTG CCCGAGACGA CGATGTACTA CGGCTATCTC GATGGCAAGG TCTACGCCAG CGAGGACAAA CGCGACCTCG CCGAGACACT CGCCGACCGG GCGCTCGACC GACTCGAGGG CCGCCAGCCC GCGCCAGCGA TCGAAGAAAA GAACGCCTAC CTGTTGAAGC GCTATCTCCC CATGCTCGAG GCGTGGGAAC AGGAATACGA ACGCAAGGCG ATCATGGACC AACTCATCGA TACGGTCGCC GACGCGCCGG AGGGACTGCT CCCACGGGAT ATTCTGATTG ACGAAGTAGT CGAGTACGAC CGCCGCTACG TCGCGTTCGG GCTGTTGTTC ATCGGCCGCG GCCGTGACCC CGATCTCGTC CGCGAGGTCT ACCAGGACTT AGTCGAGATT CACGCCCTCG CCGAGACGCC GGTGACGGTA CAGGACACCG AGACCGGCGG CGAACGCGAG CTGATCGCTG TCTCGAAGGG TGACGACTCG TTCCCGCCGA ACGTCGTGCA ACTTCGCGGC GAGTTCTCGA GTCTGTTCGG GAAACAAGCC TTCCAGACGC GGTACGACGC CCCGGAGATC GACGCGAACA CGACACCCGC ACCGTTCATC CGACCGGAAA CCCGCGAGAC TGCCGACTAA
|
Protein sequence | MEMPDTPDND PEETDSDGAS GGVWAAICAL PGRLWAVLAS ILATLLVGLR GVGRRVRGVW TTVRYRATQA LVSFIMGSHP NTSRLLATAV VGSGVAAGVG IMAVAVNQAE TSASSGPIVR FALGLATSPW VWILAIVLLF RQVLFFGDRL LARVTASESG YSYKTIRRLA EEAKRPDLDR CERVLVQSGD SAERITEWCL DALSGNGHDA PTFNPPGADD DSSDETNLPR QRTAADEPID VDPADDDDEP DFWTQLRLFR LELGSAIDLN GILWRFLVPA GLTFVGIMLW LRIWIQPWVL PVVVAAAAFV GGAYYWAVDL RHRRRLKALR AEETTTQWTD LAILTKTVEV PETTMYYGYL DGKVYASEDK RDLAETLADR ALDRLEGRQP APAIEEKNAY LLKRYLPMLE AWEQEYERKA IMDQLIDTVA DAPEGLLPRD ILIDEVVEYD RRYVAFGLLF IGRGRDPDLV REVYQDLVEI HALAETPVTV QDTETGGERE LIAVSKGDDS FPPNVVQLRG EFSSLFGKQA FQTRYDAPEI DANTTPAPFI RPETRETAD
|
| |