Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5039 |
Symbol | |
ID | 8745845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013748 |
Strand | + |
Start bp | 28084 |
End bp | 29574 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 646515653 |
Product | hypothetical protein |
Protein accession | YP_003406600 |
Protein GI | 284176324 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 59 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACCCG ACGACCACGA TCGACCAGCA CAGTCTCCCC GCGAACAGAT CGACGCGCTC GCCGAGGAGT ACGGCGTCGA TGTCGACGAC CTGTTGATCC AGAGTCGCCG CCGCGACCCG ATGTACAAGG GCACAGACGC CGACCACGCG AAAGCCGAGT GGTTCGCCCG CCTCTGGCAG CAAGCCGTCG AGCAACGAGA GAGCGACCGT ATCCACGTCC GCGGCGTCCA CTACACAGTC TACATGTCCG ATATGGACGT CGAGCCACCG ACGAACTGTT CGTGGGAGAG CTACGACAAC ACCCAGCGCT GTTATGACTA CCTCGAAGAG TGTGCTGTCC TCGCGCGAAT CCTCGGCTAC ATCCCGCTGG ATGGGATTAT CGACAAGCGC GCGGATACGC GAACCGTCAC GGAGTATGGA ACCCACACCC TCGAGCCCGA CCCCGAGGGT GTGAGTGCCC CGACCGGGGT TGCAACGCCG ACGATTCCCC ATCCAGAGGC TCGCGCCGGC CTTGTCTTCG ATCCTGCGGA AATCGACTAC TCTCAGTGGG TCGGCGGCCG CGTGGCCTCG AGCGCCCGCG AGCAACTGTC GTTTGACGAG GCCCGCCAGT CGCCGTATCA CATCGAACTG TGGTCTGAAA AGACGCTTCC CGATTACATC CGCGGTCCCG GTGGGCTGGC CGCCGAGTAC GGCTGCAACG TCATCGTCGA AGGCGAGGGC GACCTATCGT TGACCGTCGC GAACGAGCTG GCCCAGCGAA TCGAGGCCGC CGGGAAGCCC GCGGTGATTC TCTATCTTGC GGACTTCGAT CCGAAAGGCT ACGATATGCC GGCGAACATG GCGGGCAAGC TGGCGTGGCT TCACCAGCGC GGCGATCTCG AGCAACGCGT CGCCATCGAG CGGCTGGCCG TGACGAAAGA CCAGATCGAA CAGCTGGAAC TCCCGCGAAA ACCCATCGAG GAGAGTACGG CGACCGGCAC CGGCGGCGTC GCGTACAACC GCCGCGTGAC CGAGTGGGAA GAACAACACG GCGCCGGGGC GACCGAGTTG AACGCTCTCG AGCAACAGCC CGAGGAGTTC CGCCGAATCG TTCGGTCGGC GTTGGAGCGA TACACGGACC CCGACCTCGA GTCCAAGAAC GAACGCCGCG GCGACGAGTG GGAGGACGAC GTCGAATCAC GGATCGAGGC GCGGCTTCGC GAGGCTGGCG CCAATGACGA TCTCGATGAC CTGGAGGCGT GGATCGACGA TTTCAACGAC GCCTATGCGG AGGTCGCGGA CGTATTCGGG CGCTTACGCG GGATGATGGA CGACGAGTCG GCGCTCGGGG CGTGGGAATC GATGGTCGAC GAACTGCTCG CAGACACCGA GTTTCCCGTC GCGACCGTTC CCAAGGGCGA CGCGGCGTTG CCCGATGATC CGATCTACGA CTCGGGGCGT TCCTACGCGG AAAATAAGAT GCGGATCGAT CGGTATCGGG CGTCGGAGTA G
|
Protein sequence | MPPDDHDRPA QSPREQIDAL AEEYGVDVDD LLIQSRRRDP MYKGTDADHA KAEWFARLWQ QAVEQRESDR IHVRGVHYTV YMSDMDVEPP TNCSWESYDN TQRCYDYLEE CAVLARILGY IPLDGIIDKR ADTRTVTEYG THTLEPDPEG VSAPTGVATP TIPHPEARAG LVFDPAEIDY SQWVGGRVAS SAREQLSFDE ARQSPYHIEL WSEKTLPDYI RGPGGLAAEY GCNVIVEGEG DLSLTVANEL AQRIEAAGKP AVILYLADFD PKGYDMPANM AGKLAWLHQR GDLEQRVAIE RLAVTKDQIE QLELPRKPIE ESTATGTGGV AYNRRVTEWE EQHGAGATEL NALEQQPEEF RRIVRSALER YTDPDLESKN ERRGDEWEDD VESRIEARLR EAGANDDLDD LEAWIDDFND AYAEVADVFG RLRGMMDDES ALGAWESMVD ELLADTEFPV ATVPKGDAAL PDDPIYDSGR SYAENKMRID RYRASE
|
| |