Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5037 |
Symbol | |
ID | 8745843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013748 |
Strand | + |
Start bp | 24498 |
End bp | 26849 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646515651 |
Product | hypothetical protein |
Protein accession | YP_003406598 |
Protein GI | 284176322 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCAC AACATCACCT TCACCAACCG AGTGACAACA CCGAGACGCG AGTTGCGGTC CGGAACAAGC TCTCGGTCCG GATCGAACGC CAACGGCGTA AAGAGACGCT GAATACAGAG GCCCATGTCC GCGAGTTAAT CCGCGACGCC GAGGGCGCGA TCGTCGGCGA CGACGAGACT GACCGCCGCG CCGAGGAACT GCTCGAGGAG TTGCACCTCT ATCTGTGGCA TGACCCCGAA CGAAAGGCAC CAACCGACGG CGATACGCTC CCGGCAGACG CTCGGTACCG GTTCAAACAG ATTCACCGCC AATTCCAGGC CGGCCACGAT ATCGAATCGC GGACATGGTT CGAAACGCTG GTGTCGGTCG CGAAAGGCCT CAACTACGTC GAAGCGTTTA CCGATCTGAC CCAGTACGCA CCGGTCCGTC TCGAGACGTT AAACGAACAG GGCCGCCAGG GTAACGTCGA GACGGCGACG CCGATCGGTC GCCGTCGGAT CGGGGCCGAC GACGCCGCCG ACCTCGAAGA CCGAGCCGTC GAGATTACAC ACTCGTCGTG TGACCACATT CTCGCGGTTG CACTCCCGCG GTCGGGGAAG GACTCGACGA TCACCAGTAT CGGGATGAAC CTCTGGAAAG AGCACAACTA CTCGTATTTC AGCATTCTCG ACGACGGCCG GATGGAGACA CCAATGGTCT CGATACCCAA CGACGAGGAT GCGATTCAAC GAAATCTCGA GCGGATGGGG CAGGAACCAG AGGCGTTCGA CGCCGAGGTG TTCGTCCCGG CGATGGACGG CATTCCGTCA CACCTCCCGG CGAACTTCAA GCCGTTTACC ATCGGGATTG ACGACCTCAC ACCCCATCTG ATTCTCCGAC TCGCGGGGAT TACGAAATCC GACGCGAACA CGGAACAGCG GATCAAACAG GCACTCGATA AGACCCTCGA GCGCACCGGC GAAGTGACTG AGTTGGTCTC GCGGTTGCAG GTCTACGCGA AGGAAATGGA CGCGACAATC GAGTGGGTCG AGAAAGGCGA CGACTCGGTC GAATCCAAAA GCGTTCAATA CCACATGGAG GCGGAGGATG CACTAAACAA GGCCGCCCAG CGGCTTGCTC ACCTCGCTGG CGAGGGGCTC GTTGCGTCGC CCAGCGCCGA GACGAACATC GATATGGCCT CTGTCATCGC GAACCAGGAA CAGGCCGCCG TCCTCTGTTG TAACTTTCTC ACGCAAGGGC AAGAGGCGCT GAAATACACG ATTATGGACC TCTGGTTGCG GTTGATCTAT CAGGCCCGAG ACGGTGATAT GCGACTCCCG CGTGTCTGCC TCGAGATTCG CGAACTGAAG AACATCGCCC CGAGTAAGCT GGCCGACGTT CGGTACAAAG AGACCATCAA GACGCTTCGA CAGACCATCT TCTTTATTTC AACACAGGGT GGCAGCCGCC GAATCTTGAT GCTTGGCTCG ACCCAGAAAC TGAACGACGT CTACAAGCCG GTCCGGTCGA ACATGGCGAC GAAGATTCTG TTACGACTCG GCGAGGAGGA AATCGAGACG CTGGACCGGT CGTACAATTT CAGTAACGAA CAGATGCGAC AGCTCTCCGA GTTCGATATC GGACAGGGGA TGATCCTTGC CGGCGGCGAG GCCTACTGGC CGATCGAACT CCGCGGGGCG CCGTGTGGGC TCGGACTCGG CGATCGCCAT TGGCTCGACC GGTATGGCCT CGCGTGGGGC GCTCGCGTTC GCGAGAGCGA GTACGACGGC TGGCGGACGA AACACGGCGA CTGTGCGTAC TGGGTGGATC TAACCGAGAA CACGGTCGTC ACCGACGGCT CGGTCCCCGA GGTCGGGACG TGGCACCTAC TCCCCGAGGA CTTCGATGCT GACCTCGAGC CCGAGGCCGT CGACCAGGAG GCGATCGACG CCGCCCTCGA GCGGCGCCGC GAGTATCCCG TGAAGTCCGA CCTCTCACTC GAGCCGACCA GCTTTGGCGA CCGGCAGCGC GACCTCTCCT TACAGCAACA GGACCGAGAC ACGACGCTCT CGGAGGTGGT CGAACAGCAC AACATCCCCG AGGCCGTCGC GCCGTGGCTC TCGAAAGAAG CGCCGACGCG AGAGCAACTG GTTGCGGCCT GTCGGGCGAT CGACGAGCAC GACGACCTCG CACGACAGGC GGATATCGCC GAGCATATCG ACTGGTCTCG TAGCAACCTC GCGACGCATC TGAGCAAGAG CGAGAGTCTG AAGAAATGTG TCACGAAAAG CGGCGGCACC TACGAGTTGA CGCCGATCGG GAAGCGAGCG GCAGAGGTCA AATGGAAAGT CGTGATGGAG GAACTGAAAT AG
|
Protein sequence | MSSQHHLHQP SDNTETRVAV RNKLSVRIER QRRKETLNTE AHVRELIRDA EGAIVGDDET DRRAEELLEE LHLYLWHDPE RKAPTDGDTL PADARYRFKQ IHRQFQAGHD IESRTWFETL VSVAKGLNYV EAFTDLTQYA PVRLETLNEQ GRQGNVETAT PIGRRRIGAD DAADLEDRAV EITHSSCDHI LAVALPRSGK DSTITSIGMN LWKEHNYSYF SILDDGRMET PMVSIPNDED AIQRNLERMG QEPEAFDAEV FVPAMDGIPS HLPANFKPFT IGIDDLTPHL ILRLAGITKS DANTEQRIKQ ALDKTLERTG EVTELVSRLQ VYAKEMDATI EWVEKGDDSV ESKSVQYHME AEDALNKAAQ RLAHLAGEGL VASPSAETNI DMASVIANQE QAAVLCCNFL TQGQEALKYT IMDLWLRLIY QARDGDMRLP RVCLEIRELK NIAPSKLADV RYKETIKTLR QTIFFISTQG GSRRILMLGS TQKLNDVYKP VRSNMATKIL LRLGEEEIET LDRSYNFSNE QMRQLSEFDI GQGMILAGGE AYWPIELRGA PCGLGLGDRH WLDRYGLAWG ARVRESEYDG WRTKHGDCAY WVDLTENTVV TDGSVPEVGT WHLLPEDFDA DLEPEAVDQE AIDAALERRR EYPVKSDLSL EPTSFGDRQR DLSLQQQDRD TTLSEVVEQH NIPEAVAPWL SKEAPTREQL VAACRAIDEH DDLARQADIA EHIDWSRSNL ATHLSKSESL KKCVTKSGGT YELTPIGKRA AEVKWKVVME ELK
|
| |