Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3899 |
Symbol | |
ID | 8744527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 139858 |
End bp | 142590 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646514483 |
Product | alpha-L-rhamnosidase domain protein |
Protein accession | YP_003405430 |
Protein GI | 284167152 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.019543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGGGG ACCAGCCAGT AGACCTGCGA ACCGAGTACG CGTCGGACCC GATCGGGCTC GATACCCCGA CACCGGACCT ACGCTGGCGT GTCGATACTG ATCGCCGCGG AGCCGGACAA ACCGCGTTCA GAGTGCTCGT CGCGTCGACT CCCGACCGGC TCTCGCCCGG CGAGGCCGAC TGTTGGGACA CCGGCAAGCG ACCGTCGGAC CGGCCGGCGG TGACCTACGA CGGCGAGCCG CTCGAGTCCG GAACCGAATA CCACTGGGCG GTTCGCGTGT GGGACGAGGC CGACGAACCG AGCGAGTGGA GTGAGCCAGC CCGATGGGAG ACGGGGCTGC TCGAGCCCGC CGATTGGGAG GCCGCCTGGA TCCGCCGGCC GGGAGACGGC GCGTTCGAGC GCGGTCGATT CACCTACCTG CGCCGTGAGG TCTCCCTCGA GGGGGAAATC GAGCGAGCCA GAGCGTACGT CTCCGCGAGC CACCGGTACG AGCTCTCGGT CAACGGACGT ACGGTCGACT GCGGCCCGTC GGCGTCCTAT CCGGACTTCC AATACTACAG GACCGTCGAC CTCACGGACG ACCTCGAGAC GGGCGAGAAC GCGGTCGGCG CGCTGTGCAA CTGGAACGGC GCCGGACAGG GGCGACCCGC CGCCGAACCA GGGTTCGTCT GCCGGCTCGA GGTGACGTTC GCCGACGGGA CCGAGCGCAC TGTCGTCACC GACGAATCGT GGCGCGTTCG CGAGGCCGAA TGGCGAGAGG ACGCGCCGCT TCGAAACGAC GAGATCGCGG AGCCGATCGA GATAATCGAC GGCCGTCGAT CTCCGGACGG GTGGGACGAA CCGGGATTCG ACGCCGGAGC GTGGGATCCC GCGACCGTCG TCGGCACCCA TCCGACCGAG CCGTGGGAAC GGCTCGTCGC CCAGCGATCC GAAGTCGTTC GGTTGCCGAT CGAGCCCGTC TCGATGGACC GACTCGAGGA CGGCTCGTAC GTGGTCGACT TCGGACGCGT CTACGCCGGG CTCCCCGAGG TGCACTTTGC CGACGGAACC GACGGACACC GGGTCGCGCT CAGAGCGGGT TACCGGCTCG CGGACGACGG CTCGGTCGAC GAGACGGAGG GGACCCAGTG GACGGACATG CGCTACGAGT ACGTCCAGCG CGAGGGCGAG TGTCGGTTCC GTCCGTTCAA CTACCTCGGC GTCCGCTACC TGCAGATCGA CGATCCGGGC GAGGTACTCG AGCCCGAACA GGTGGGGCTG CTGGCGACGC GCAACGCCGT CCCCGACGAG CGGGCGGCGA CGTTCGAGTC CGACGACGAG ACGGTCGACG CGGTCTTCGA CCTCGCGCGC CACTCGGCGC TGTACGGCTG TCAGGGGCAG TTCGTCGACA CGCCGACTCG CGAGAAGGGC CAGTTCCTGA TGGACGGCTT CAACGTCTCG CGGACGACGA TGCGAGCGTT CGCGGAGCGC GCGCTGAGCC GACAGGCGAT CGAGGAGTTC CTCCGCTCGC ACTACCGGTA CTGGGCGGGC GAGGGCCGGC TGAACGCCGT CTATCCGAAC GGCGACGGCA AGCGCGACAT CCCCGACTTC ACGGTCTCGT TCCCCGAGTG GGTCTGGCGG TACTACCGCA CGACCGGCGA CCGGACGATC CTCGAGCGCG CCTATCCCGT CATCACGGCG ATCGCGGCGT ACGTCGAGCG CCACGTCGAC GGCGAGACGG GACTCGTCGC GAACCTCTCG GGCGGCGAGG GCGGCCCTTA CGAGGAGGGG ATCGTCGACT GGCCGCCCGA GATGCGCTAC GGCTACGATC GGGACTGGCC CGTCAGGACC ACGGTCAACG TCCTCTGTAC GAGCGCGCTC GGGCGGGCGG CCGATATCGC GGCCGAACTC GACCGACCGG ACCGCGAGCG GGCGCACTAC CGCGACCGCC AGCGCGCGCT CGAGTCGGCG ATCGACGACC GTCTCCGTCG GGGCGACTAC TACGTCGACG GCTGTGACGC GACCGAGGCG AGCGACTCAG CCTCCCAGCA CGCCAACGCA CTGCCGCTGG CGTTCGGTCT CGTTCCCGAC GCGCACGTCG ACGCCGTCGC CGAGCACGCG GCCGAGCAGG GAATGGCGAT GGGACCGATG ATGGTGCCGT GGCTGCTCGA GGCCCTCGAG ACCGCCGGTC GCCCTGACGC AGTCGTCGAC CTGCTGACGA ACGCCGCGGA CGACGGCTGG GCGAACATCC TCGCACAGGG CGGCACGTTC ACTTGGGAGA CCTGGCACTG CCGCGACTCC TCCCTGCCGG ACGATCAGCG GCACAATCGC AGCGAGTCCC ACGCCATGGG CGCGACGGTG CTGGTGTACA TTCTGCGGAC GCTTCTGGGC GTTCGTCCCG ACGGCGTCGC AGGCGAACAC CTCGAGATCC GCCCGCCCGA TGCGGGACTC GAGTCGGCGT CCGGCCGGAT CCCGACTGAG CGGGGCACCG TCGAGGTCTC GTGGAGTCGC GAAGCGGACG AGGACCGCGA GGACGGCGCG TCGTTCCGAC TCGAGGCGAC GATCCCGTGG AACGCGTCGG CGACGGTCGT CCTCCCAACG AGCACCGGCG ATGCCGTCGC GATCGTCGAC GGCGAACCGG TTCGGGGCGA CGAGGGCGCG GCGACGTTAC CCGACGGCGT CTCGGCCGTT CGGGAGGGCG ATCGGCTCGA GATCGACGTC GAATCGGGTA CCTACCGGTT CGCCCTCGAA TGA
|
Protein sequence | MSGDQPVDLR TEYASDPIGL DTPTPDLRWR VDTDRRGAGQ TAFRVLVAST PDRLSPGEAD CWDTGKRPSD RPAVTYDGEP LESGTEYHWA VRVWDEADEP SEWSEPARWE TGLLEPADWE AAWIRRPGDG AFERGRFTYL RREVSLEGEI ERARAYVSAS HRYELSVNGR TVDCGPSASY PDFQYYRTVD LTDDLETGEN AVGALCNWNG AGQGRPAAEP GFVCRLEVTF ADGTERTVVT DESWRVREAE WREDAPLRND EIAEPIEIID GRRSPDGWDE PGFDAGAWDP ATVVGTHPTE PWERLVAQRS EVVRLPIEPV SMDRLEDGSY VVDFGRVYAG LPEVHFADGT DGHRVALRAG YRLADDGSVD ETEGTQWTDM RYEYVQREGE CRFRPFNYLG VRYLQIDDPG EVLEPEQVGL LATRNAVPDE RAATFESDDE TVDAVFDLAR HSALYGCQGQ FVDTPTREKG QFLMDGFNVS RTTMRAFAER ALSRQAIEEF LRSHYRYWAG EGRLNAVYPN GDGKRDIPDF TVSFPEWVWR YYRTTGDRTI LERAYPVITA IAAYVERHVD GETGLVANLS GGEGGPYEEG IVDWPPEMRY GYDRDWPVRT TVNVLCTSAL GRAADIAAEL DRPDRERAHY RDRQRALESA IDDRLRRGDY YVDGCDATEA SDSASQHANA LPLAFGLVPD AHVDAVAEHA AEQGMAMGPM MVPWLLEALE TAGRPDAVVD LLTNAADDGW ANILAQGGTF TWETWHCRDS SLPDDQRHNR SESHAMGATV LVYILRTLLG VRPDGVAGEH LEIRPPDAGL ESASGRIPTE RGTVEVSWSR EADEDREDGA SFRLEATIPW NASATVVLPT STGDAVAIVD GEPVRGDEGA ATLPDGVSAV REGDRLEIDV ESGTYRFALE
|
| |