Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5119 |
Symbol | |
ID | 8745667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | - |
Start bp | 16292 |
End bp | 18106 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646515476 |
Product | hypothetical protein |
Protein accession | YP_003406423 |
Protein GI | 284176146 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0474435 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCGTT CATCCGACTC CGCGACCGTC TCCCGCGTCG GCACGACGGA GATCAGGGAC GGCGCCGGAA TTACGGACGA ACGCTCCGAT ACCGCGCCCG ATGGCTGGCG CGCGACGCTC GTTCGTTCGC TCGCCACACT CTATCCCGCC GCCGTCGAAC CGTCCGACGA CCTCGAGGAA GCGCTATCGT TCGTCGGTTC AGCCCACAAC GCCGAGACGA TCGTTCGAGC GGGATACGGG GCCGGTATCT TAGCAGTAGT GCCGCCACTC CTGTTGCTCC CACTGGGAGC GCCGCTCTCG TTCGTTCTGT TTTTCACTCT CGTGGCGCCG GTCGCGACGA TCTACACAGT TCGGTCGCTC CCGCGTCTCC AGGCCGCGTT CCGGCGGACG GAAGCGCTCG GCGAGACGCC GAACCTCATC GGTCGCGCCG TCCTCCGCAT GCAGGTCCAG CCGTCGCTCG AGAGCGCCGT CCGGTTCGCG GCCGACACCG GAACGGGCCC GCTCGCCGCC GATCTCGCGG CACACATCGA CCGCTCGATC GGGACGCCGG AGACGGGGAT ACTCTCGTTC ACCGAGGTGT GGGCCGATCG ATTCCCGGCG CTTCGGCGCT CGGCGCATCT GCTCGCGGCG GCCCAGGACG CGCCGGCAGG CGAACGCGAG CGGACGCTCG ATCGGTCGCT CGCAGCGGTT CTCGACGGAA CACGCAACCA GATGGCGGAA TTTACGGCCT CGATTCGGAC GCTGACGACC GGGCTGTTCG CGTTCGGGAT CATGATCCCG CTGGCGCTGA TCGCGCTCGT CCCGACGGTT CCGATGGTCG GCGTCTCGAT CAACATCTGG GTCCTGGTCT TCCTCTACAA CGTCGTGTTG CCAGCCTGTC TGGTCGTCGC GGGGCTGTAC CTACTCGTTC GTCGACCGGT CGCGTTTCCG CCGCCGACGA TCGGGCGTGA CCATCCCGAC GTTCCCGACC GGCTGTGGCT CCGGGCACTG TGGGGCGTCC TCGCCGGGAT CGGCGTCTAC GCGATCGTAG ACGCGGTCGG CCCCGCACAT CTCGCGCCGA TCGTCGCCGG CGGCTTCGCG GTCGGTGTCG CGCTGCTGGC CGTCTACGGC CCGATCCTCG CGGTCCGTCA CTACGTTCGC GAGGTCGAGG ACCACCTCAC CGACGCCCTC TACATCGTCG GCCGGCAGGT CTCCGACGGC GAGTCCGTCG AGTCGGCGGT CGACCTCGCG GCGACCCGCG TCCCGGGGGA GACCGGCGCC GTCTTCGAAC GCGCCGCCGG ACTGCAGCGA CGCCTCCAGA TCGGCGTCGA ATCGGCGTTC CTCGGCCCCC ACGGCGCGCT CGAGGACGTT CCCAGTCCCC GCGCTCGAGG CACGGCTGCG CTGTTGGCGA TCGCGTCGAA GGAGGGCAAG CCTGCTGGAC GGGCGATCGT CTCTATGGCC GACCACTTAG AGGAGCTATC GGAGGTCGAA GCCGAGACGA AACGGAATCT CGCAAAGGTG ACGGGAACGC TCGACGCCAC GGCCGCGTAC TTCGCGCCAA TGGTCGCTGG TGTCACCGTC GGCATGGCCG CGATGATGGC CAGCCAGAAC GTATTCACCT CGAGCGAAGT CGACGCGGCC GCGTTCCCCG CCGAACCCCT CGCGATCGTG ATCGGGATCT ACCTCGTCAT GCTGTGTTTC ATCCTGCTGC CGCTCTCGAT CGCCCTGCGC CACGGCGTCG ATCGGGCGCT GATCGGCTAC CACGTCGGCC GCGCCCTGAC GACCTCGATG GTGCTGTACG CGGTGACCGT CGGACTGATA GACGTCTTCC TCTAG
|
Protein sequence | MARSSDSATV SRVGTTEIRD GAGITDERSD TAPDGWRATL VRSLATLYPA AVEPSDDLEE ALSFVGSAHN AETIVRAGYG AGILAVVPPL LLLPLGAPLS FVLFFTLVAP VATIYTVRSL PRLQAAFRRT EALGETPNLI GRAVLRMQVQ PSLESAVRFA ADTGTGPLAA DLAAHIDRSI GTPETGILSF TEVWADRFPA LRRSAHLLAA AQDAPAGERE RTLDRSLAAV LDGTRNQMAE FTASIRTLTT GLFAFGIMIP LALIALVPTV PMVGVSINIW VLVFLYNVVL PACLVVAGLY LLVRRPVAFP PPTIGRDHPD VPDRLWLRAL WGVLAGIGVY AIVDAVGPAH LAPIVAGGFA VGVALLAVYG PILAVRHYVR EVEDHLTDAL YIVGRQVSDG ESVESAVDLA ATRVPGETGA VFERAAGLQR RLQIGVESAF LGPHGALEDV PSPRARGTAA LLAIASKEGK PAGRAIVSMA DHLEELSEVE AETKRNLAKV TGTLDATAAY FAPMVAGVTV GMAAMMASQN VFTSSEVDAA AFPAEPLAIV IGIYLVMLCF ILLPLSIALR HGVDRALIGY HVGRALTTSM VLYAVTVGLI DVFL
|
| |