Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4495 |
Symbol | |
ID | 8745124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 92258 |
End bp | 94576 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646515032 |
Product | alpha-L-arabinofuranosidase domain protein |
Protein accession | YP_003405979 |
Protein GI | 284172597 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.183163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGTGG CCGGGGCCGC GACCGGAACG GCGGCGGCCG ATTCGCACAC TGGAGCCGAA TCCGGTGACG GCATCCGGAA CACCGTCTCG CTGGACCTCT CGGAACGGAC CGAAGACGAG GTGTCCGACC AGTTGTTCGG CCGGCTCTGC GAGCACTACG AGTCGGGGAC GATTTACCCC GGCGTCTACT CCGAACACGT CAAGAACAAC TCGTTCTATC CGAGAACGTG GTCGGAAGAC GATTACTTCG GCCCGAAGAC GCGCTTCGAT CCCGAATCGA TAGCACGCCA CGAGAACGTC CCGTTCCCGT GGGAACCCGT CGATAACTCC GGGGTCTCTT TCGAGCAGCG CGAGGGCGGC GTCGCCGCCG TGGATACGAC GAACTACCAG CGGGTCTCCC TCGAGGACGC CCGTGGAGGG ATCTCACAGA AGATCGTCCT GCCAGACTTC CGCACGCTGG GCTACGACCT CTCGTTCTCG GTGCGCGGCG ACGGTCTCGA GTCGGTCACC GCCGCCATCA CGACGCTTGA CGGTGAGACC CTCGCGACCG CCGACGTCGA CGTCACCGAC GACTGGACCC GCCACGAGCT CGCGCTCGAA TTGAGCGAGG CCAGCGGCGA TCAGTACGTC GCCGGCTCGG TCGCGAACGT CGACACCCCC TACGGGGAGT ACGTCCTCGA GTTCACCGCC GAGGGAAGCG GTCACGTCGA TCTCGACTGG ATCACCCTCG GGGCCGACGA CGCGATCAAC GGCAAGTTCA ACCCGTCGAC CGTCGAGTTG ATGCGAGAGC AGAACACGAC CTGGCTGAAG TGGCCGGGCG GGAACTTCAC GAGCCAGTAC AACTGGCGCG ACGGCATCGG TCCGCTCGAC GAGCGGCCGA TGCGCTTCAA TCACGCGTGG GGCGGCGTCA ATCCGAACTA CTTCGGCATC GACGAGTACC TCGAGCTGTG CGAGATCGCC GACCTCACGC CGCGACTGAC TATCGGCTGG TGGGACAACC CCGGCGAGTG GGCCTCGGAG CGCCAGATCC TCCCCGAAGA CGCCGCCGAC TGGGTCGAGT ACTGTAACGG GTCGACGGAG ACGGAGATGG GCGCGCTGCG CGCCGAGAAC GGCCACCCGG AGCCGTACGA CGTCAAACAC TGGGAAGTCG GCAACGAGGT CTGGGGGCCG TGGCAGCGCG GCCACACCGC CGATCCCTCC GAGTACGCGA GCGGCTCCGA GGAGCGGATC GGGTTCAACG AGTACTACGA CGCGATGATG GCAGTCGACG ATTCCATTAC CGTCCTCGCG GACGGGATGG ATCCGGGCTA CGACGAGGCG GAGACGCCGG ATCCCGACGA GTGGAACGGG ACGCTGTTCG AGGAGTCCGG AGACCGCATC GACGGGCTGG ACCTCCACCG CTACAACTGG GGTATCGAGG ACCAAGAGGC GCGAGACGCG TGGTTCGACG AGAACGACGC GGACGCCATC GACTACAACG AGGTGCTGGT CATGTTCCCG ACGCAGTTCG GGGTGTTGAT GGACGACCTC AGCGCCGAAG CCGCCGACGC GGGGATCGAG GACTTCCGGA TCAACGTCGG CGAGTACGGG CTCTTTCCGT CGGTCAACGA GGGCGATCCG TACCCCGGCC CGGAGACGAT GCCGGGCGGC TCCTATATCG CGGGCATGCT CAACTCGTTC ATCCGGCAGA GCGAGACCGT CGTCGAGGCC TCTCAGACGT GGGTGCCCGT GCGCATGTTC CCGCCGGAGT TCACCGAGGC GCCGCCGGAT CCGAACCCGC TGGCGCCGGC GGGATCGGTG TTCGGTCTCT ACTCGGCGGT GTTCGAGACC AACGCCGAGT GGCACGCGGT CGATACCGAG ATCGACGGCG ACGGCCGTGA CATCCCCGAC ACCGGACCGC GCATCGACCC CATGGAGGAC GTTCCCTACG TCGACGCCGC CGCCATGCAG AACAAGCGGG GCAAAGAACT GTGCGTCTTC CTCACGAATC GGAACCTCCG GGAGCACGGC GAGGTCACGA TCGATCTCCC CGAGAAGTAC GCCGGCAAGT CTGTGGCGAT CACTCGCCAC CGCGCGACCG CGTCCGAGCG ACCGCTTCCG CACGACTTCC AGGACTCGTG GGAGGAACCG GACGTCTACG CGGTGGACAC CGCCATCGAA TCAGTCGACC GTGACGGCTC GCTCACGCTC GAGGTCGGCC CTGCCTCGGT CGTCCGGTTA CTCGTCGATA ACGACCACGG ACGTCCCGCG ACGATCGGCG ACGACGGCGT GTGGTCGGGG CTCAACGGTA ACGAGTGCGA CCGCCGTCCC CGAAAGTGA
|
Protein sequence | MVVAGAATGT AAADSHTGAE SGDGIRNTVS LDLSERTEDE VSDQLFGRLC EHYESGTIYP GVYSEHVKNN SFYPRTWSED DYFGPKTRFD PESIARHENV PFPWEPVDNS GVSFEQREGG VAAVDTTNYQ RVSLEDARGG ISQKIVLPDF RTLGYDLSFS VRGDGLESVT AAITTLDGET LATADVDVTD DWTRHELALE LSEASGDQYV AGSVANVDTP YGEYVLEFTA EGSGHVDLDW ITLGADDAIN GKFNPSTVEL MREQNTTWLK WPGGNFTSQY NWRDGIGPLD ERPMRFNHAW GGVNPNYFGI DEYLELCEIA DLTPRLTIGW WDNPGEWASE RQILPEDAAD WVEYCNGSTE TEMGALRAEN GHPEPYDVKH WEVGNEVWGP WQRGHTADPS EYASGSEERI GFNEYYDAMM AVDDSITVLA DGMDPGYDEA ETPDPDEWNG TLFEESGDRI DGLDLHRYNW GIEDQEARDA WFDENDADAI DYNEVLVMFP TQFGVLMDDL SAEAADAGIE DFRINVGEYG LFPSVNEGDP YPGPETMPGG SYIAGMLNSF IRQSETVVEA SQTWVPVRMF PPEFTEAPPD PNPLAPAGSV FGLYSAVFET NAEWHAVDTE IDGDGRDIPD TGPRIDPMED VPYVDAAAMQ NKRGKELCVF LTNRNLREHG EVTIDLPEKY AGKSVAITRH RATASERPLP HDFQDSWEEP DVYAVDTAIE SVDRDGSLTL EVGPASVVRL LVDNDHGRPA TIGDDGVWSG LNGNECDRRP RK
|
| |