Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4780 |
Symbol | |
ID | 8745370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 394304 |
End bp | 395635 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646515278 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003406225 |
Protein GI | 284172843 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000717269 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGTCG GCGCAGTCGC CGGCGGCCTG CTCCCATCGG ATCTCGTGGC TCCCGCCGGC GTCGCGGCGC TGCTGGTGTT GCTGGTCCTG TCAGGGTTCT TCTCCTCGGC GGAGATCGCG ATGTTCTCGC TGGCCCAGCA CCGCATCGAG GCGCTCGTCG AGGACGGCAC TCCCGGCGCC GAGACCGTCC AGGCGCTCAA GGACGATCCC CATCGGCTGC TGGTGACGAT CCTCGTCGGG AACAACCTCG TCAACATCGC GATGTCGTCG ATCGCGACCG GACTGTTCGC GATGTACATG AGCCAGGGCC GAGCGGTGCT GGCGGCGACG TTCGGCGTGA CGGCCGTCGT CCTGCTGTTC GGCGAGAGTG CTCCCAAGTC CTACGCCATC GAGAACACCG AATCGTGGGC GCTGTCGGTC GCTCGTCCCC TCAAAATCTC GGAGTACGCG CTGTTTCCGC TCGTGATCAC GTTCGACTGG CTGACCCGCG TAATCAATCG GCTGACCGGC GGCGGTACGG CCGTCGAAGA GTCGTACGTG ACCCGCGAGG AACTCCGGAA CCTGATCCGG ACCGGCGAGA GCGAGGGGAT CATCGAGACC GACGAGCGCG AGATGCTCCA GCGCGTGTTC CGGTTCACCG ACACCATCGC CAAAGAGGTG ATGACGCCGC GACTCGACGT CACCGCGGTC GCTCGAGAGG CCAGCGTCGA CGAGGCCGTC GCGAAATGCG TCGAGAGCGG CCACACCCGC CTGCCGGTCT ACGACGGCGA CCTCGACACC GTCGTCGGCG TCGTCGAACT CGGCGACCTC GTCCGCGACC GTCAGTACGG CGAGACCGAA GACGAAACCC TCGAACTGTA CCTCGAGGAG ACGCTGCACG TACCGGAGAG CAAGCAGGTC GACGAACTGT TCCGCGAGAT GCGCCAGCAG CGCGTCGAGC AGGTCGTCGT CATCGACGAG TTCGGAACGA CGGAAGGGAT CGTCACGACC GAGGACATCG TCGAGGCGAT CGTCGGCGAG ATCCTGGAGA CGCAGGAGGA CGAACCAATC GAGGTCGTCG ACGACCGAAC CGTTCGCGTC AACGGCGAGG TCAACATCGA GGACGTCAAC GACGCCCTCG AGATCGACCT GCCGGAGGGC GAGGAGTTCG AGACGATCGC CGGCTTCGTC TTCAATCTCG CCGGCCGACT GGTCGAACCC GGCGAGACGT TCACGTACGA CGGTGTCGAT CTCACGGTCG AAACCGTCGA TACGACGCGC ATCAAACGCG TTCGGATCGT CGAACCGGAA CCGTCGGCGA CTGATGACTC CGGTATCTCC GCGACGAGTT GA
|
Protein sequence | MLVGAVAGGL LPSDLVAPAG VAALLVLLVL SGFFSSAEIA MFSLAQHRIE ALVEDGTPGA ETVQALKDDP HRLLVTILVG NNLVNIAMSS IATGLFAMYM SQGRAVLAAT FGVTAVVLLF GESAPKSYAI ENTESWALSV ARPLKISEYA LFPLVITFDW LTRVINRLTG GGTAVEESYV TREELRNLIR TGESEGIIET DEREMLQRVF RFTDTIAKEV MTPRLDVTAV AREASVDEAV AKCVESGHTR LPVYDGDLDT VVGVVELGDL VRDRQYGETE DETLELYLEE TLHVPESKQV DELFREMRQQ RVEQVVVIDE FGTTEGIVTT EDIVEAIVGE ILETQEDEPI EVVDDRTVRV NGEVNIEDVN DALEIDLPEG EEFETIAGFV FNLAGRLVEP GETFTYDGVD LTVETVDTTR IKRVRIVEPE PSATDDSGIS ATS
|
| |