Gene Htur_4094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4094 
Symbol 
ID8744722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp353742 
End bp355964 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content63% 
IMG OID646514654 
ProductBeta-fructofuranosidase 
Protein accessionYP_003405601 
Protein GI284167323 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.481915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGCTG AGGACTCAGA GTCGAAAGAC TCTCCCAGCC TTGGGTTTCT GACCGCGGGT 
GAACTGACGG ACGAACAGTG CGCAGCGCTT CGAGCTGTAC GGGACATTGC GGGCGTGGAC
TGTGTTTCAC TCGAGGATCT ACCGACGGAA CAATCGCTGT CGTCGTTCGA CGCGCTCTGG
TGGCACCGCG ATCTCTCTTT CGCGGACGAA ACGTTCCCTT CCGGTGCGGT CGACGCACTG
ATGGCGTATC TCGACGGGGG CGGCGGACTT CTGTTGAGTC TACACGGAGT CACGACCGTC
AAGGAGCTGG GGATCGATCC GCAGCCGCCG GACCTCATCG ACGAAACGTA CAAACCAGTC
CACAAGTGGG GTCAGCGCCC GGCGGGGTTC CTGATCGGTT CGCGGTTCGC CGACGCCGAC
CTGTTCGAGT CATGTGACGG AAACCGCGTT CACACGCAGC CGTCCCGTAG CGAATCGACA
CCTCGAGTGG CCTACGGACG CCGGATCCCG AAGCGTGGAA CGGTTCTCGC GAGCGCCGCT
GTCGGTGAGA CGGACGTCCC CCGAGAGAAC AGCGTCATCG GTTGGCAAGT CGGGGCGGGA
ACCGTCGTCG GTATCGGCCA ACACTTCACG TTCGATGGCG TACCGTCGGA GTACCTGACG
ACACTGCGGA CGATTCTTCG CGGGACGATC AAGCACATCG CCCGCGGGGA GGCCGCGATG
TACACTCGAC CGCGGAGTGG CGCGGAACTG ACACAGGTCC GGACAGAGAT CGACGACGAT
CCTCACCGCC CCTCGTACCA TTTCACGCCG CCGGCAAACT GGATGAACGA TCCGAACGGA
CTAGTCAAAT GGAACGGCGA GTATCACCTG TTCTACCAGT ACAACCCGGC CGGACCGTAC
CACGGATCGA TCCACTGGGG GCATGCGGTG AGCGACGACC TCGTTCACTG GGAGGATAGG
CCCATCGCGC TAGAGCCCGA CACAGGGGGA CCGGACCGGC ACGGGTGCTG GTCGGGCTGT
ACGGTACTCG ATGACGACGT CCCCACGTTC GTCTACACCG GCGGCGACGG ACATGATCAG
CTCCCCTGTC TCGCTCGCGC GGCCGATGAC GACCTCGATA CCTGGCAGAA GTCCCCACAG
AATCCGATCA TCACCGACCC TCCCGAGCGG CCGCAGATCC TCGCGAACGA CGACTGGAAC
GCCGAGTTTC GCGACCACGA CGTCTGGAAG GAGGACGGAA CCTGGTATCA CCTCATCGGC
TCCGGGACCG AGGATGCCGG TGGAACGGCG CTGTTGTATC AGTCGGACGA CCTTCTCGAT
TGGGCGTACG TCGGTCCAAT CCTAGTCGGA GACCGCGACG AAGACGGGCC GATCTGGGAG
TGCCCGGAAC TACTCGACTT CGGTGACCTG CAGCTGCTGC AGGTCTCGAA CTACGACAAG
GTCGCGTACT TTCTCGGAAC GTTCGACGGT CAGACGTTCG ACCGAAAGGA CTCGGGGACG
CTCGATCACG GGAACTACTA CGCCGCCCAA TCGATCCCCG ACGGCGACGG GCGGTACCTC
TCGTGGGGCT GGATCCGCGA GGATCGCAGT GCGTCCGCGC AGTGGGACGC GGGGTGGTCT
GGTGCCATGT CTGTCCCTCG TTCGCTCTCC CTTTCGTCCG ACGGTACTCT TGTAGTGCAG
CCGGCCGAGG AACTCACTCG TCTTCGGGGA GAACGCGAGA CGATCGACCG CCAGACGCTC
TCCCCGGACG ATCCGTCACC GTGTGACGGC GTCTCCGGCG ACGCGCTGGA AATTCAACTC
GAACTGGAAC TCGACGGTGC TGACGCGTTC GAGCTCGTCG TCGCGTGCTC CGATGACGGC
GAGGAACGAA CATCGATCCG GTACACGGAC GGCAATCGTC TGATCGTCGA TCGTGAACAC
TCCAGCCTCT CGGACGCCGC AAACTCCGAT CCGCAGTCAA TCGACGAGGT ACCCCAGTCC
GACGACGGCA TCGTTCACCT GCACGTGCTC ATCGACGCTT CGGTCATCGA AGTGTTCGTC
AACGATCGAA CCAGTGTGAG CAGCCGAATC TACCCGACTC GGGCCGATAG CACCGGCGTC
TCGCTCGAGG CGGTCGGCGG AGCCGTCGAA CTCTACTCTG CAGACATGTG GTCGCTGGAG
TCCGCGTTCA CAGACGGCTC GAGCGCCGAG GCGGCCTCAA ACCCCGAAAT AAACCTTGAT
TGA
 
Protein sequence
MQAEDSESKD SPSLGFLTAG ELTDEQCAAL RAVRDIAGVD CVSLEDLPTE QSLSSFDALW 
WHRDLSFADE TFPSGAVDAL MAYLDGGGGL LLSLHGVTTV KELGIDPQPP DLIDETYKPV
HKWGQRPAGF LIGSRFADAD LFESCDGNRV HTQPSRSEST PRVAYGRRIP KRGTVLASAA
VGETDVPREN SVIGWQVGAG TVVGIGQHFT FDGVPSEYLT TLRTILRGTI KHIARGEAAM
YTRPRSGAEL TQVRTEIDDD PHRPSYHFTP PANWMNDPNG LVKWNGEYHL FYQYNPAGPY
HGSIHWGHAV SDDLVHWEDR PIALEPDTGG PDRHGCWSGC TVLDDDVPTF VYTGGDGHDQ
LPCLARAADD DLDTWQKSPQ NPIITDPPER PQILANDDWN AEFRDHDVWK EDGTWYHLIG
SGTEDAGGTA LLYQSDDLLD WAYVGPILVG DRDEDGPIWE CPELLDFGDL QLLQVSNYDK
VAYFLGTFDG QTFDRKDSGT LDHGNYYAAQ SIPDGDGRYL SWGWIREDRS ASAQWDAGWS
GAMSVPRSLS LSSDGTLVVQ PAEELTRLRG ERETIDRQTL SPDDPSPCDG VSGDALEIQL
ELELDGADAF ELVVACSDDG EERTSIRYTD GNRLIVDREH SSLSDAANSD PQSIDEVPQS
DDGIVHLHVL IDASVIEVFV NDRTSVSSRI YPTRADSTGV SLEAVGGAVE LYSADMWSLE
SAFTDGSSAE AASNPEINLD