Gene Htur_4470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4470 
Symbol 
ID8745099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp59258 
End bp60763 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content65% 
IMG OID646515007 
Productalpha-L-arabinofuranosidase domain protein 
Protein accessionYP_003405954 
Protein GI284172572 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0994522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAACG CACGGATTAC GGTTCATACT GCAGCGGATA TCGACCGCAT CGCCCCCGAG 
GTTCACGGTC ACTTCTCCGA ACACCTCGGA CGGTGCGTCT ACGAGGGACT CTGGACCAGT
GACAGCTCCG AGGAGAACGG ATTCCGGGAG GACGTCGTCG AGCTCCTTTC CGATCTCGAG
ATTCCCGTTC TTCGCTGGCC CGGCGGCTGT TTCGCCGACG ACTACCACTG GGAAGACGGC
GTCGGTCCCC AGGAGGAGCG ACCGCGGCGA CGGAACCTCT TCTGGGCGCA GGGCCCCGAG
GACCTCCCCG AGGAGTCCAA CGCCTTCGGC ACCGACGAGT TCCTGCAACT CTGCGAGCGA
ATCGACACGG AGCCGTACCT CGCGGCCAAC GTCGGTTCCG GCACCCCGCA GGAGGCTGCC
AACTGGGTCG AGTACTGCAA CTACGACGGC GACACCGAGC TCGCCGACCG ACGACGCGAT
AACGGTCACG AGGAACCCTA CGGCGTCAAG TACTGGGGCC TCGGTAACGA GAACTGGGGC
TGTGGCGGTC AGATGTCGCC CGAACAGTAC GCTCGCGAGT ACCGCCGGTA CGCCACCTAC
GTCGGCACCC AGTCGAACCT GATGTTCGAC CACGACATCG AACTCATCGC CTGCGGCTTC
GAGGGTCACG AGTGGAACCG CCGCTTCCTC GAGGAGATCA ACCAGTCCCG GTGGGGCGTG
GAGTTCCCGC TCGACCACCT CACGCTGCAC CACTACTACG GCCGCGGGAT GAACGTCGAC
GAGGCCGACG AGGACCAGTA CGACCGCATG CTCGTCGAAG CCCTCGAGAT GGAACGCCAC
ATCGAGCGGA TGGCCGGCGC GATCAACGCC GTCGCGACGA CCCGCGACAT CGGCGTCATC
ATCGACGAGT GGGGGACCTG GCATCCCGAG GCGACGGCCG ACAACGGGCT CGAGCAGCCG
GGGACCGTCC TCGACGCGCT CTCCGCGGCG GCCGTTCTAG ACGTCTTCAA CCACCACAGC
GACGTTCTGA CGATGACGAA CATCGCGCAG ACGGTCAACG TCCTGCAGTG TCTCATCGAG
ACCGACGAGG ACGAGGCCTG GGCGCGTCCC ACCTACCGCG TCTTCGATCT GTACGCGCCG
CACAAGGGTA GCGAGGCCGT GCAGACGTCG GTCGACACGC CGACGCGCGA ACTCGACGAC
GACGAGGACA GCGAACTGCC GCTCGTCGGC GCGTCCGCGT CGGTCGACGA CGACGAGACC
TACGTTACCG TCACGAACCT CGACTGCCGC GAGGAGAAGA CGATCGAAGT CGCCCTCGAG
GGCGTCGACC TCGACTCGGC GACTATCGAG GGCGAACTGC TGTTCGCGGA TCAAGAGCCC
GACCTGGAAG TCGACGCAGA CAACGCCGAC GAGTTCGCCG CCGAGGAACT CGACGTGTCG
GTCGACAGCG ACACCCTGAT CGCCGAGCTG CCGGCGTCGA CGGTCGCTGG AATCTCGATC
CAGTAA
 
Protein sequence
MANARITVHT AADIDRIAPE VHGHFSEHLG RCVYEGLWTS DSSEENGFRE DVVELLSDLE 
IPVLRWPGGC FADDYHWEDG VGPQEERPRR RNLFWAQGPE DLPEESNAFG TDEFLQLCER
IDTEPYLAAN VGSGTPQEAA NWVEYCNYDG DTELADRRRD NGHEEPYGVK YWGLGNENWG
CGGQMSPEQY AREYRRYATY VGTQSNLMFD HDIELIACGF EGHEWNRRFL EEINQSRWGV
EFPLDHLTLH HYYGRGMNVD EADEDQYDRM LVEALEMERH IERMAGAINA VATTRDIGVI
IDEWGTWHPE ATADNGLEQP GTVLDALSAA AVLDVFNHHS DVLTMTNIAQ TVNVLQCLIE
TDEDEAWARP TYRVFDLYAP HKGSEAVQTS VDTPTRELDD DEDSELPLVG ASASVDDDET
YVTVTNLDCR EEKTIEVALE GVDLDSATIE GELLFADQEP DLEVDADNAD EFAAEELDVS
VDSDTLIAEL PASTVAGISI Q