Gene Htur_4649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4649 
Symbol 
ID8745252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp229586 
End bp230893 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content58% 
IMG OID646515160 
Productglycoside hydrolase family 4 
Protein accessionYP_003406107 
Protein GI284172725 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAGG TCGCGTTCAT TGGAGCGGGA AGTATGGTCT TCGCCAAGAA CCTAGTCGGA 
GACATCCTCT CGTTCGAAGC GCTCAAGGAC AGCACGATCG CGCTCATGGA TATCGACGAG
CACCGCCTCG CTCAGACAAC CGAGGTTGCC GAACAGATAG TCGAGAACAG TCAGATCGAC
GCAACGATCG AGTCGACGAC TGATCGCCGC GAGGCACTCG ACGGTGCTGA CTATGTGCTC
AACATGATCA ATGTCGGTGG GACGGAACCA TTCGAAAATG AGATCCGCAT TCCCGAGGAG
TACGGCGTCA AGCAATCGAT CGGGGATACA CTGGGACCAG GGGGAATCTT CAGGGGACTC
CGGACGATTC CCACGATGCT CGACATCGCT CGAGACATGG AGGAGCTCTG TCCTAACGCA
TTGCTTATGA ACTACACCAA TCCCATGGCG ATCGTCTGCT GGGCTGTAGA CGAGGCCACA
GATATAGATA TCGTCGGACT CTGCCATAGC GTCCCCCACA CCGCGGAGGC GATTGCTGAG
TACGCCGACA TCCCGCTGGA GGAACTCAAT TACTGGGTCG CCGGAATCAA CCATATGGCG
TGGTTCCTTG AGTGTGACTG GGACGGACAA GACATCTATC CGCTGCTCGA GGATGCGACA
GACGACGAGG AGACATATCG GAAGGACACC GTCCGGTTCG AGTTGCTGAA ACACTTCGGT
GCTTTCGTCA CTGAATCGAG CCACCACAAC TCGGAATACC TTCCCTACTT CCGCACGGAC
GAAGATCTCA TCGACGAGTT GACGGGGACG AACTACGCCG AGCGCATGTC AACGGCGACG
TACCTTGAGG GTTGGAAGAA ACGCTCTGAG GAACGGGACG ACGCGCTGAC CGGCGTCAAT
CCCGACGATG TCTCAATCGA GCGCTCCGAG GAGTACGCCT CGCGGCTGAT CCACTCGATC
GAGACAGACA CGCCGCGACG GCTCAACTTG AACGTGCGAA ACGAAGCAGG TCACATCCAG
AACTTGGAGA ACGACGCCTG CATTGAAGTG CCCTGTCTGG TGGACGGCAC GGGAATTCGT
CCGTGTTCAG TCGGCGAGCT GCCACCGCAG CTCGCCGCGC TCAACCGAAC GAACGTGAAC
GTTCAGCGCC TCGCGGTCGA GGGTGCGCTT AAGGGCGACC GCGACGTCGT TCACCAGGCC
GTCAAACTTG ATCCGTTAAC GGCGGCCGAG CTCGACCTCG ATGAGATTCA CGAGATGACC
GAGGAACTGA TCGCAGCGAA CAAAGCATAT CTGCCGGCCC TCGACTAA
 
Protein sequence
MPKVAFIGAG SMVFAKNLVG DILSFEALKD STIALMDIDE HRLAQTTEVA EQIVENSQID 
ATIESTTDRR EALDGADYVL NMINVGGTEP FENEIRIPEE YGVKQSIGDT LGPGGIFRGL
RTIPTMLDIA RDMEELCPNA LLMNYTNPMA IVCWAVDEAT DIDIVGLCHS VPHTAEAIAE
YADIPLEELN YWVAGINHMA WFLECDWDGQ DIYPLLEDAT DDEETYRKDT VRFELLKHFG
AFVTESSHHN SEYLPYFRTD EDLIDELTGT NYAERMSTAT YLEGWKKRSE ERDDALTGVN
PDDVSIERSE EYASRLIHSI ETDTPRRLNL NVRNEAGHIQ NLENDACIEV PCLVDGTGIR
PCSVGELPPQ LAALNRTNVN VQRLAVEGAL KGDRDVVHQA VKLDPLTAAE LDLDEIHEMT
EELIAANKAY LPALD