Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5209 |
Symbol | |
ID | 8745757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | + |
Start bp | 99106 |
End bp | 100575 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646515566 |
Product | hypothetical protein |
Protein accession | YP_003406513 |
Protein GI | 284176236 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGGA CAGCGACGAC CGATCGATCG GGCCCCTCGG CGCGGCGGCT GCTCGGCGTG CTCTTCCGCG AGGAGTGGCG GCTCCACACC CAGCTGTTCG GCGGCTGGCG GTTCGCGCTC TTCCCGGAAG TCATCGCCGT GCTCGCGGTG GGTGCGACGG TGGCGTTGCG GGAGACGGGG ACTGCGGACG GAACGATCCT GATAGGGCTT CACGTCCTCG CGCTCGGTTT CGGCCTCTAC AGCGGGACCG CCGGGTTCGC GGGTTCGGAT ATGCTCGAGA ACGTGTTCGG CGAGCTCTCG TTGCTCCTCT CGTCGTCGAC GACGCTCCCG CTGTCCCGAC GGCGCCTGCT GGGCGTCTTT CTGCTGAAGG ACGCGCTGTT CTACGCGGTC GCGTTCGTCC TGCCGATGGC GCTCTCGAAC GCCGTACTGG CCGACCGTCT CGCGGGGGCG CCAGTCGCGG TGGGAGCGCT GTGGCTCTCG CTGTCACTGG TGTTCGCCGC CGGGATGGCC CTGACCGTCG CGCTGATCGC GGTCCGGACC CGCGGCGTGC CGACGTGGGC GATTACCGGT GCGACCGTCG TCGTCGCCGG CGCGGCGTGG CTGACGGGAA CCGGCGGGGC CGTCTGGAAC GCGTTCGTGC CGATCGAGGG GGAACCAGCC AGTGCGCTCG GACTGGCGGT CGGAACCGGA GTCGTCGGCG CGGCGTCGCT CGCCCTGTAC GATCCGACCT ACGGCCGGCC GTCGCGGACC GCCAGCGATC GGTTCGCCAG CCTCAGCGAC GCGTTGCCGG ACGAGGTTGT CGGCGCCGAC AGCGCGCTCG TCTCGAAAAC GCTGCTCGAT CTGGCCCGTT CCTCGGGCGG CGTCATGAAG CCGTTCGTCT CCGCAGCCAT CCTGCTGGCG CTGGTCGCCG CGCTCGTCGG CGTCGTCGAC TCGATCACCG GGATCGCACC GGCACCGGGC GTCTTCTTCG GCGGCGTGCT GGGGCTGACC GCATTTACCA CGTACAACTG GCTGACCCAG TTCGACTCGC TCGAGGCCTA CCTCGCCTAT CCCGTCTCGA TCGACGACGT CTTCCGGGCG AAACGGATCG CATTCGTCCT CGTCGGCGCG CCGACGGTCG CGGTGCCGTA CCTCGCGGCC GTGCTCTGGT TCGAGGCGAC GCTGGTCGAC GCAGTCGTCG GCGCGATCTT GCTCGCGGGC TACGCGCTGT ACTACTACGG GCTGACCGTC TACATCGCCG GCTTCGATCC CAACGAGTTC CTCTTCGACG CCGTGCGGTT CTCGCTGTTC ACCCTCGGCG TCGCTGTCGC ACTCGTGCCG ACGCTCGTCG CCGGGTTCGT CGTGGTGCCG CCTACCGGGG CCGTCGCGGC GGCGCTGGGT GTTGGTGGGG TCGGATTCGG TGTCGTCGGG TTGGTTCTCT CGAGTCGGGC CGGACCGCGG TGGGAGCAGC GGTCTCGAGA TGGCGACTGA
|
Protein sequence | MSGTATTDRS GPSARRLLGV LFREEWRLHT QLFGGWRFAL FPEVIAVLAV GATVALRETG TADGTILIGL HVLALGFGLY SGTAGFAGSD MLENVFGELS LLLSSSTTLP LSRRRLLGVF LLKDALFYAV AFVLPMALSN AVLADRLAGA PVAVGALWLS LSLVFAAGMA LTVALIAVRT RGVPTWAITG ATVVVAGAAW LTGTGGAVWN AFVPIEGEPA SALGLAVGTG VVGAASLALY DPTYGRPSRT ASDRFASLSD ALPDEVVGAD SALVSKTLLD LARSSGGVMK PFVSAAILLA LVAALVGVVD SITGIAPAPG VFFGGVLGLT AFTTYNWLTQ FDSLEAYLAY PVSIDDVFRA KRIAFVLVGA PTVAVPYLAA VLWFEATLVD AVVGAILLAG YALYYYGLTV YIAGFDPNEF LFDAVRFSLF TLGVAVALVP TLVAGFVVVP PTGAVAAALG VGGVGFGVVG LVLSSRAGPR WEQRSRDGD
|
| |