Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3989 |
Symbol | |
ID | 8744617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 244512 |
End bp | 245732 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646514564 |
Product | hypothetical protein |
Protein accession | YP_003405511 |
Protein GI | 284167233 |
COG category | [S] Function unknown |
COG ID | [COG5441] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.138068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTCG TCATCATCGG AACGTTGGAT ACGAAAGCGG AAGAGATCGG CTTCGCCAGG GACGTCCTCG AAGCCCAGGG TGTGGACGTC CACGTCGTCG ACGCGGGCGT GATGGGCGAA CCGGGATTCG AGCCGGAGAC GACTGCGAGC GAGGTCGCCG ATGCAGCGGG AACGACCCTC GAGCACCTCC GCGAGGAGGC TGACCGCGGC GAGGCGATAG AGGCGATGGG CGATGGCGCG GCCGAGGTCG CCCAGCGACT CCACGATGAG GGTGTCCTCG ACGGCGTCCT CGGATTGGGC GGGTCGGGAA ACACCTCGAT CGCGACGGCG GCCATGCGGG CGCTGCCCGT CGGCGTCCCG AAGGTCATGG TTTCGACGAT GGCGTCGGGC GATACGGAGC CCTACGTCGG GTCCCGGGAC GTCACGATGA TGTACTCGGT CGCGGATATC GAGGGGTTGA ACCAACTTTC GCGGCGGATC ATCTCCAACG CCGCGCTGGC GATGGTCGGC ATGGTCTCGA ACGACCCCGA CGTCGACGTA GAAGAGCGAC CCACGATCGC CATGACGATG TTCGGCGTCA CGACGCCCTG CGTGCAGGCG GCCCGCGAGC GACTCGAGGA CATGGGCTAC GAGGCGATCG TCTTCCACGC CACCGGGACC GGCGGGCGCG CAATGGAATC GCTCGTCGAG GAGGGTGTCG TCGACGGTGT GCTCGACGTC ACGACGACCG AGTGGGCCGA CGAACTGGTC GGCGGCGTCC TGAGCGCCGG TCCGGACCGA CTCGAGGCGG CCGGCGACGA GGGCATCCCG CAGGTCGTGT CGACGGGCGC GCTCGACATG GTCAACTTCG GCCCTCGCGA TTCGGTCCCC GAGGAGTTCG AGGGCCGTCA GTTCCACGTT CACAACCCGC AGGTGACACT CATGCGGACG ACGCCCGAGG AAAACGCCGA ACTCGGGGAG ATCATCTCCG AGAAGCTCAA CGACGCGACC GGACCCACCG CGCTCGTCCT TCCCCTCGAG GGCGTCTCGG CGATCGACGT CGAGGGAGAG GACTTCTATG ATCCCGAGGC CGACGCGGCG CTGTTCGACG CGCTCCGGTC GTCGCTCGAA GACGACGTCG AACTCCTCGA GATGGAGACC GACATCAACG ACGAGGCCTT CGCGGCGAAA CTGGCGGAGA CCCTCGACGG GTACATGCGA GAGGCTGGAC GAGCCCCGTA A
|
Protein sequence | MSVVIIGTLD TKAEEIGFAR DVLEAQGVDV HVVDAGVMGE PGFEPETTAS EVADAAGTTL EHLREEADRG EAIEAMGDGA AEVAQRLHDE GVLDGVLGLG GSGNTSIATA AMRALPVGVP KVMVSTMASG DTEPYVGSRD VTMMYSVADI EGLNQLSRRI ISNAALAMVG MVSNDPDVDV EERPTIAMTM FGVTTPCVQA ARERLEDMGY EAIVFHATGT GGRAMESLVE EGVVDGVLDV TTTEWADELV GGVLSAGPDR LEAAGDEGIP QVVSTGALDM VNFGPRDSVP EEFEGRQFHV HNPQVTLMRT TPEENAELGE IISEKLNDAT GPTALVLPLE GVSAIDVEGE DFYDPEADAA LFDALRSSLE DDVELLEMET DINDEAFAAK LAETLDGYMR EAGRAP
|
| |