Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4460 |
Symbol | |
ID | 8745089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 47407 |
End bp | 49605 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646514997 |
Product | alpha-L-arabinofuranosidase domain protein |
Protein accession | YP_003405944 |
Protein GI | 284172562 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.551805 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCTC TCTTTACGCA TACCAGCGAC GAGTCCGAAT CGCAGGCGAC CGTTCGTCTC GACCCCTCGA GACAGGGCGA GACGGTAAAC CCCGAACTCT ACTCGAAGTT CGGCGAACAC CTCTACACGC CCCGGAACGT GACGAACGTT CTCGAGGCGC AGGCGCTGTA CAATCCCACG TTCGGATCCT GGAAGTTCGG CAGCCAGCAG TACGGTGTGG ACGGCGGCAA CAGCGCTGCC TTCGACCCCG ACGAGATCGA CGACCGGATC GAAACGTACG CCGAGACGCA CGGAATTCCG GACGCCGAAC GGCTCCGGAC GGCCTATCGA GACGGGACGG CGCTGTGGTG GTTTCCGTAC GGGGACGCCC GGACGAGCCC CGATGTCGGA ACGGCCGACG ACCGCGCACA GCGCTTCGAG GTCGGGTCGA CGGACGACTA CGCCGGGCTC GCGCAGTGGT GTCACCTGCC GGTCCACCGT ACGAACGCGT TCGAGGGGCA GATAACGCTG CGCGCCGCCG CGGAGACGCC GGTTCGACTC GCAGTTCACG ACGTCGATCC CGACGACGGA GCGATCGGTG AGGTGCTCAC GGAGACCGAC GTGACCGCTC GAGACGCGAA CCGAACCGTC GACTTCTCGT TGAAACTGCC ACGGGACGAA CTCGAGGACG ACGAGCGGCT GTTCGGGTTC AGCGTGACCA CGCGGGTTTC CGACGCGAAC GTCGTCCTCG ACCGCGTCTG CTGTTATCCC GACGATCACG TCGCCACGGC CGATCCCGAG ATCGTCGACC TGCTCGCCGA CCTGAATCTC TCCGTGCTCC GGTGGCCCGG CGGGAACTTC GTCTCGGGCT ATCACTGGGA GGACGGCGTC GGCCCGATCG AAGAGCGGCC GACGAAGCCG AATCCGGCGT GGGACGCCCT CGAGACGAAC CTCTTCGGGA CCGACGAGTT CGTCGCGCTC TGTGCGGCGG TCGGCTGCGA GCCGATGATC TGCGTCAACG CCGGCAGCGG AACGCCCGAG GAGGCGGCCC GGTGGGTGGA GTACTGCAAC GGCTCGACCG ACACCGAGAT GGGCGCCCTG CGGGCCGAAC ACGGCCATCC CGAACCGTAC AACGTGACCT ACTGGGAAAT CGGCAACGAG CTCTACGGCT CGTGGCAGAT CGGCTGGACG ACGCCGACGG GCAACGCCGA TCGCTTTCGC CGATTCAAGG CGGCGATGGA GGCCGTCGAC GACTCCATCG AGGTGATGGC CTGCGGGAAC CGCCACACCG ACTGGAACGA GCCGCTTCTC GGGGAACTCG AGCCGAGCGA CTGGCTCACC GACCACGTCC TCATGGAGTG TCACGCCGAT CCGGAGACCG ACCCGGTCGA ACTGTTCAAC GCTCACTCGG GCGTTGCGAG CCAACTCCGC GAGGAGTACG AGGCGGTCGC CGCCGACTGC CGGGACGCCG GCCTCGAGGA CGTCCGACAG GCGATCACGG AACTGCAGCT GTTCACCCGA TTCGACGGAC GCGAGGACGA CGAGGCGGCC GATGGTGAGT CGGCGAGCGA GGAAAGCGCG GACCAGCGGC TGAGCCGAGA GACGCTTCCG ACGAACAAGA GCGTCACCGA GGCCGTCTTC GACGCCACGA TCATCCACGA GTGCATCCGG AGCGGAACCG TCGACATGGT GACCCACTCC GGCGTCGGGA ACCACGGCGG CGGCCTCCGG AAGTCCCAGG GACGAGTGTG GGCCGATCCC TGTTACTACG GCCAGCGACT CGAAACGGGG CTCGTCGGCG GCACCCCGAT CGGCGTCGAC GTGACGTGTC ACTCTTTCTC CACGGAGACT GCGTGGGGGA CGGACACGAG CCAGTGGTTC GGCGAACTCG AGCCCGTGAC CGACGCGCCG GCCGTCGACG CGATGGCGGT GACTGACGCC GACGAACACG ATTTCGCAGT CGTGCTGGTC CACCGCGACG CCGGGGCGGA CGCCATTGAC GTTACGCTCT CGGGTGAGCC GCTGGAATCG CTCGACGTGG TGACGGTCGA TCGGCTCTCG GCGGAGACGA TGTACGATAC GAACACGCTT GAGGACCCCC GGCGGATCAC GCCGACGACC GACAGGACGG CGGTCGACGA CGGGACCGTC ACGGTGTCAC TTCCGCCGTA CTCGTTGATT CGGGTGACTG GCGATCAGGA GCCGGCCGTC GGCGAGTAG
|
Protein sequence | MTALFTHTSD ESESQATVRL DPSRQGETVN PELYSKFGEH LYTPRNVTNV LEAQALYNPT FGSWKFGSQQ YGVDGGNSAA FDPDEIDDRI ETYAETHGIP DAERLRTAYR DGTALWWFPY GDARTSPDVG TADDRAQRFE VGSTDDYAGL AQWCHLPVHR TNAFEGQITL RAAAETPVRL AVHDVDPDDG AIGEVLTETD VTARDANRTV DFSLKLPRDE LEDDERLFGF SVTTRVSDAN VVLDRVCCYP DDHVATADPE IVDLLADLNL SVLRWPGGNF VSGYHWEDGV GPIEERPTKP NPAWDALETN LFGTDEFVAL CAAVGCEPMI CVNAGSGTPE EAARWVEYCN GSTDTEMGAL RAEHGHPEPY NVTYWEIGNE LYGSWQIGWT TPTGNADRFR RFKAAMEAVD DSIEVMACGN RHTDWNEPLL GELEPSDWLT DHVLMECHAD PETDPVELFN AHSGVASQLR EEYEAVAADC RDAGLEDVRQ AITELQLFTR FDGREDDEAA DGESASEESA DQRLSRETLP TNKSVTEAVF DATIIHECIR SGTVDMVTHS GVGNHGGGLR KSQGRVWADP CYYGQRLETG LVGGTPIGVD VTCHSFSTET AWGTDTSQWF GELEPVTDAP AVDAMAVTDA DEHDFAVVLV HRDAGADAID VTLSGEPLES LDVVTVDRLS AETMYDTNTL EDPRRITPTT DRTAVDDGTV TVSLPPYSLI RVTGDQEPAV GE
|
| |