Gene Htur_4088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4088 
Symbol 
ID8744716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp344316 
End bp346580 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content62% 
IMG OID646514648 
ProductGlycosyl hydrolase family 32 domain protein 
Protein accessionYP_003405595 
Protein GI284167317 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACC GAGACGATAG CGATGACTGC CGTGTCGTAG CGTACTGGCC GTTCGACGAA 
AGCGAGGGCT CGTCCGCCGA GGAAGTCGTG TCCGGACGCC GGGATGCGAT CGAGCACGCG
TTCGCGGACG CACGGTTCAA ATCGGATAGT GATCCGCGGT GGATCGACGG CGTCACGGGA
AGCGGACTGT TGTTCGACGG TTACTCCACG CAAATCGAAC ACGACCGGCA GATTCTCGAC
GGAGAGACGA CCCGGCTCAC GGTCGAAGCG TGGATCGCCC CCCGCGCGTT CGAGGGTCAG
TCGTCAGAGC GTCTCTCCCC GATTATCAGC AACCACTCGG TCGACGATAA CCGCGGATTC
GAGTTTGGAC TCGACGGACA CGGCCGGTGC TCGTTCCAAG TCGGACTGGG AGACACGTGG
GTGGCTGTCC AGACGGAATC ACCGCTTCCC AAATATTCCT GGTCGCATGT CGCAGCAGTC
TTCGAGGGCG ACGACGGTAC ACTCCGACTC TACGTCGACG CCGCATGCAC TGCAGTGGAA
CGAGTTCCCG AGGGGAGCGT GATCTTCCCG GCCGACGTCC CGGTTCTGAT CGGAAAGAAC
AACCGAACCG AACGCGTCGA TGATACGTTC GCTTTACACA ACTTCGCAGG AGCGATCGAC
GAACTCAAAA TCTACGACTG CGCGTTCACC GCCTCCGACG TCCCCGAGCG GTACGAGACG
CCGTGTCACG ACGAACACCC CTCGATGAAC TACGAAACGA TCGCCCTCGA TCCGTCGCGA
TATCGCGGAG ATCGTCATCG TCCGCAGTAC CATCCGATCC CGCCCGGTCA CTGGATGAAC
GAACCGCATG CACCGCTGTA TCACGACGGT CAGTATCACC TGTTCTACCA GCACAACCCC
AGCGGACCGT ACTGGGGCAA TATCCATTGG GGGCACTGGG TGAGTGACGA TTTAGTCCAT
TGGCGGCACC TTAAACCGGC ACTGGCGCCC GAACGCGACG GACTGGCTCC CGACGGGATC
TGGTCCGGTG GGTCGACTCA CGACGCGGAC GGGGACCCTG TTCTCCTCTT TACCGCGGGA
GAAATAGACC GCACTCCCGA TCAGCGCGTC GTGGCAGCGT CTCCTGTTGA TCCCGACGAC
CCCGAATTGA CGTCTTGGCG TCAGGACGAC GAGTCCGCGA TCGAGCGCCC GCATTCGATC
GGGCTTCGGG ACAACGACTT TCGCGACCCG TTCGTCTGGC GGGAGAACGG GACGTGGTAC
TGTCTCGTCG GGTCCGGATT CGCTACCGGT GGCGGTGCGG CCCTCGTATA CGAGTCTGAG
AGCTTGGCGG AGTGGGTGTT CCGGGGCTGT CTCCATCGAA CGGATCACGA CGAGTACCCG
GAGCTCGGGC TCGTCTGGGA GTTGCCCGTC TTGCTGCCGA TCGGCGAAGA CGAGACCGAC
GCCGAGAAGT ATGCGTTCAT CGTCAGTCCG ATTGAGGGCG CCGCGGAAGT AGAAGTGTAC
TATTGGCTCG GCGAGTGGGA CCCCGACGCC TGTCGGTTCG TTCCGGACCA CGAGGACCCG
CGACGTATCG ATTACGGGGG GTTTCACTTC ACTGGTCCTC ACGGTATCGT CGACCCCGAA
ACGGGTCGGA GCCTCCTATT CACGATCGCA CAGGACGACC GTCGTCCGCG GGATCACTAC
GACGCCGGAT GGGCCCACAA CGGCGGTCTC CCGGTACACC TGTTTCTTCG CGACGACGGT
CGGCTCGGCA TCGAACCGAT CGAGGAACTG CGATCGCTCC GAGCCGAGCG ACTCGCCGAG
ATCCGGAACG CGCAGGTCTC AAATGCGAAC GACGAACTGG ACGGCGTTGG TGGAACCGCA
GTCGAGATTC GAGCAACAAT GGCGTCCGAC GGCGCTGAGA AGTACGGGCT GAAGGTAAGA
GCGAACCCTG ATGGGTCCGA GGAGACGCTG ATATATTACG ACGAACGGAC GGAGCGGATC
GTCGTCCACC GCGAACACAG CACTCGAAAC GCCGAAACAC GGGCGACTGT TTCGGAGCGA
AGCTCGCTCG TCCACCGTGG CGAGGTCAAC CGCGACGGCG AGGACCTCGA ACTACGAGTG
TACCTCGACG GGTCGATGCT CGAGGTATAC GTGAACAGCC TAAAGAGTGT CACTACTCGC
CTGTATCCGG AGGATGAGCG GTCGACCGGA ATCGAGGCGT GGGCGGACGG TGATGTGACC
GTCCAGCGAT TGGACGTCTG GGAACTGGAT AGCGCATACG AGTGA
 
Protein sequence
MTDRDDSDDC RVVAYWPFDE SEGSSAEEVV SGRRDAIEHA FADARFKSDS DPRWIDGVTG 
SGLLFDGYST QIEHDRQILD GETTRLTVEA WIAPRAFEGQ SSERLSPIIS NHSVDDNRGF
EFGLDGHGRC SFQVGLGDTW VAVQTESPLP KYSWSHVAAV FEGDDGTLRL YVDAACTAVE
RVPEGSVIFP ADVPVLIGKN NRTERVDDTF ALHNFAGAID ELKIYDCAFT ASDVPERYET
PCHDEHPSMN YETIALDPSR YRGDRHRPQY HPIPPGHWMN EPHAPLYHDG QYHLFYQHNP
SGPYWGNIHW GHWVSDDLVH WRHLKPALAP ERDGLAPDGI WSGGSTHDAD GDPVLLFTAG
EIDRTPDQRV VAASPVDPDD PELTSWRQDD ESAIERPHSI GLRDNDFRDP FVWRENGTWY
CLVGSGFATG GGAALVYESE SLAEWVFRGC LHRTDHDEYP ELGLVWELPV LLPIGEDETD
AEKYAFIVSP IEGAAEVEVY YWLGEWDPDA CRFVPDHEDP RRIDYGGFHF TGPHGIVDPE
TGRSLLFTIA QDDRRPRDHY DAGWAHNGGL PVHLFLRDDG RLGIEPIEEL RSLRAERLAE
IRNAQVSNAN DELDGVGGTA VEIRATMASD GAEKYGLKVR ANPDGSEETL IYYDERTERI
VVHREHSTRN AETRATVSER SSLVHRGEVN RDGEDLELRV YLDGSMLEVY VNSLKSVTTR
LYPEDERSTG IEAWADGDVT VQRLDVWELD SAYE