Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3876 |
Symbol | |
ID | 8744504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 107114 |
End bp | 108829 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646514460 |
Product | Glycoside hydrolase 97 |
Protein accession | YP_003405407 |
Protein GI | 284167129 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0907495 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCAGC TCACGGATCC CGACGGTACC ATCACGATCG ACGTGGACCC GAACGGGACG TTGCGCGTCG ACCGATCCGG AGAGACGGTT TTCGAACCGT CGCCCTACGG GCTTCCGACG CCGTACGGCT CGTTCCCCGA GGAATTCGAA CTCGACGACG TGCGAACCCG AAGGATCGAC GAATCGTACG AACTCGTTCA CGGGAAACGG TCGAGCCCGC GCCACCGAGC GGTCGAGAAG ACGCTCTCGT TCGAGGGCGA CGGCGGCTCG GTGGACCTCC AGGTCCGCGC CGCAGACGAC GGCATCGCGT ACCGATACCG GTTGCGAGGC GAGCGAGACG AGTATCTCCA CCCGGGCGAC GAGTCCGGCT TTCGGTTCCC GCCGGGCGCC GTCGCGTGGC TCTCCGACTA TCAGGCCAAC CACGAGGGAC ACAGCCGGCA GGTCCCGGTA ACAGCCGTCG ACGGAGAGTA CAATCTCCCC GGCCTCTTTC ACGTTGACGA CACCTGGGCG CTCGTCTGCG AGGCCGGCGT CGACGGCGAC TGGATGGCCG GCCGACTCGT CGGGACGCGC GACGACGCGC CGGGCGTCGA CATCGGATTT CCCGAGTCCC ACCCGACGTC GCACGTGTGG GCCGACGATC CCGCGACGAC ACCGTGGCGC GTCGCGATCG TCGGCGACCT CGCGACCGTG GTCGAGTCGA CGCTCCCGAC CGACCTCGTC GACGGGCCGC GGATCGACGG CGACTGGGTC GAACCCGGTC GCGTCGCGTG GTCGTGGTGG GCGAGCGGCT CGAGCGTCCG GTCGCTCGAG GAGGAGCGCG AGTACGTCGA CTACGCCGCG GAGCGAGGGT GGGAGTACGT GTTGGTCGAC GCCGGCTGGG ACGACGAGTG GCTCCCCGAC CTCGTCGCGT ACGCGAACGA CGCCGGCGTC GACGTCGAGC TCTGGTCCCA CTTCATCGAC CTGAACACGG AGTCCAAGCG CGAAGAGCGC CTCTCGAGGT GGGCCGAGTG GGGCGTCGCC GGCATCAAGG TCGACTTCAT GGACAGCGAC GATCAGGGAC GGATGCAGTT CTACGACGAC CTCGCTCGCG CCGCCGCCGA ACACGAACTG ACCGTGAACT ACCACGGGTC GGCCGTCCCG ACGGGGCTCC GTCGGCGCTG GCCCCACGTC ATGACGTACG AAGGCGTTCG CGGCGCCGAG TACTACAAGT GGACGACGAA CACGCCCGAG CACAACGCGA CGCTCCCCTT CACCCGCAAC GTCGTCGGCC CGATGGACTA CACGCCCGTG ACGTTCTCCG CCGAGCGACG CGCGACGTCG GCGGGTCACG AACTCGCGCT GTCGGTCGTC TACGAGTCCG GACTGCAGCA CTACGCCGAC GGGATCGAGA GCTACGAGAC GTATCCGATC GCAGAGCGCG TCCTCGAGTC CGTTCCGGCG GCCTGGGACG AGACGAGGTT CCTCCGCGGA CGACCCGGGT CGGAGGCCAC GTTCGCGCGA CGGAAAGGCG ACGGCTGGTT CGTCGGCTCG ATCACCGCCG GTCCGGCGGA ATCGATCGAG GTTCCGCTCT CGTTTCTCGA CGGGGAGACG ACGGCCGTCG TCGCGACCGA CGCCGACGAG GGAGACGGTC TCGAGGAGTA CGAACGCGCG GTATCGCCGG ACGAATCGCT TCGCGTGTCG GTCGCGGAGA ACGGCGGCTT CGTCGTCCGG CTCTGA
|
Protein sequence | MVQLTDPDGT ITIDVDPNGT LRVDRSGETV FEPSPYGLPT PYGSFPEEFE LDDVRTRRID ESYELVHGKR SSPRHRAVEK TLSFEGDGGS VDLQVRAADD GIAYRYRLRG ERDEYLHPGD ESGFRFPPGA VAWLSDYQAN HEGHSRQVPV TAVDGEYNLP GLFHVDDTWA LVCEAGVDGD WMAGRLVGTR DDAPGVDIGF PESHPTSHVW ADDPATTPWR VAIVGDLATV VESTLPTDLV DGPRIDGDWV EPGRVAWSWW ASGSSVRSLE EEREYVDYAA ERGWEYVLVD AGWDDEWLPD LVAYANDAGV DVELWSHFID LNTESKREER LSRWAEWGVA GIKVDFMDSD DQGRMQFYDD LARAAAEHEL TVNYHGSAVP TGLRRRWPHV MTYEGVRGAE YYKWTTNTPE HNATLPFTRN VVGPMDYTPV TFSAERRATS AGHELALSVV YESGLQHYAD GIESYETYPI AERVLESVPA AWDETRFLRG RPGSEATFAR RKGDGWFVGS ITAGPAESIE VPLSFLDGET TAVVATDADE GDGLEEYERA VSPDESLRVS VAENGGFVVR L
|
| |