Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4470 |
Symbol | |
ID | 8745099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 59258 |
End bp | 60763 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646515007 |
Product | alpha-L-arabinofuranosidase domain protein |
Protein accession | YP_003405954 |
Protein GI | 284172572 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0994522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAACG CACGGATTAC GGTTCATACT GCAGCGGATA TCGACCGCAT CGCCCCCGAG GTTCACGGTC ACTTCTCCGA ACACCTCGGA CGGTGCGTCT ACGAGGGACT CTGGACCAGT GACAGCTCCG AGGAGAACGG ATTCCGGGAG GACGTCGTCG AGCTCCTTTC CGATCTCGAG ATTCCCGTTC TTCGCTGGCC CGGCGGCTGT TTCGCCGACG ACTACCACTG GGAAGACGGC GTCGGTCCCC AGGAGGAGCG ACCGCGGCGA CGGAACCTCT TCTGGGCGCA GGGCCCCGAG GACCTCCCCG AGGAGTCCAA CGCCTTCGGC ACCGACGAGT TCCTGCAACT CTGCGAGCGA ATCGACACGG AGCCGTACCT CGCGGCCAAC GTCGGTTCCG GCACCCCGCA GGAGGCTGCC AACTGGGTCG AGTACTGCAA CTACGACGGC GACACCGAGC TCGCCGACCG ACGACGCGAT AACGGTCACG AGGAACCCTA CGGCGTCAAG TACTGGGGCC TCGGTAACGA GAACTGGGGC TGTGGCGGTC AGATGTCGCC CGAACAGTAC GCTCGCGAGT ACCGCCGGTA CGCCACCTAC GTCGGCACCC AGTCGAACCT GATGTTCGAC CACGACATCG AACTCATCGC CTGCGGCTTC GAGGGTCACG AGTGGAACCG CCGCTTCCTC GAGGAGATCA ACCAGTCCCG GTGGGGCGTG GAGTTCCCGC TCGACCACCT CACGCTGCAC CACTACTACG GCCGCGGGAT GAACGTCGAC GAGGCCGACG AGGACCAGTA CGACCGCATG CTCGTCGAAG CCCTCGAGAT GGAACGCCAC ATCGAGCGGA TGGCCGGCGC GATCAACGCC GTCGCGACGA CCCGCGACAT CGGCGTCATC ATCGACGAGT GGGGGACCTG GCATCCCGAG GCGACGGCCG ACAACGGGCT CGAGCAGCCG GGGACCGTCC TCGACGCGCT CTCCGCGGCG GCCGTTCTAG ACGTCTTCAA CCACCACAGC GACGTTCTGA CGATGACGAA CATCGCGCAG ACGGTCAACG TCCTGCAGTG TCTCATCGAG ACCGACGAGG ACGAGGCCTG GGCGCGTCCC ACCTACCGCG TCTTCGATCT GTACGCGCCG CACAAGGGTA GCGAGGCCGT GCAGACGTCG GTCGACACGC CGACGCGCGA ACTCGACGAC GACGAGGACA GCGAACTGCC GCTCGTCGGC GCGTCCGCGT CGGTCGACGA CGACGAGACC TACGTTACCG TCACGAACCT CGACTGCCGC GAGGAGAAGA CGATCGAAGT CGCCCTCGAG GGCGTCGACC TCGACTCGGC GACTATCGAG GGCGAACTGC TGTTCGCGGA TCAAGAGCCC GACCTGGAAG TCGACGCAGA CAACGCCGAC GAGTTCGCCG CCGAGGAACT CGACGTGTCG GTCGACAGCG ACACCCTGAT CGCCGAGCTG CCGGCGTCGA CGGTCGCTGG AATCTCGATC CAGTAA
|
Protein sequence | MANARITVHT AADIDRIAPE VHGHFSEHLG RCVYEGLWTS DSSEENGFRE DVVELLSDLE IPVLRWPGGC FADDYHWEDG VGPQEERPRR RNLFWAQGPE DLPEESNAFG TDEFLQLCER IDTEPYLAAN VGSGTPQEAA NWVEYCNYDG DTELADRRRD NGHEEPYGVK YWGLGNENWG CGGQMSPEQY AREYRRYATY VGTQSNLMFD HDIELIACGF EGHEWNRRFL EEINQSRWGV EFPLDHLTLH HYYGRGMNVD EADEDQYDRM LVEALEMERH IERMAGAINA VATTRDIGVI IDEWGTWHPE ATADNGLEQP GTVLDALSAA AVLDVFNHHS DVLTMTNIAQ TVNVLQCLIE TDEDEAWARP TYRVFDLYAP HKGSEAVQTS VDTPTRELDD DEDSELPLVG ASASVDDDET YVTVTNLDCR EEKTIEVALE GVDLDSATIE GELLFADQEP DLEVDADNAD EFAAEELDVS VDSDTLIAEL PASTVAGISI Q
|
| |