Gene Huta_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1049 
Symbol 
ID8383323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1015469 
End bp1016956 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content68% 
IMG OID644972114 
Product4-alpha-glucanotransferase 
Protein accessionYP_003129965 
Protein GI257052132 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTTCG ACCGGCAAAG CGGCGTTTTC CTGCATCTCA CCTCGCTGCC CAGCCCGCAC 
GGGATCGGCG ACCTCGGTGA CGGCGCGCGG ACGTTTTTGG ACTTCCTGGA GCGTGCCGAG
CAATCGCTGT GGCAGTTCTG TCCCGTCACG CCGACCCGTG GCGTCCACGG CCACTCGCCG
TACGCCTCTC CCTCGGCGTT CGCCGGCAAC CCGCTCCTCG TCGACCTGAC CGCCCTCGTC
GAACGGGGAT GGCTCGACGA GGAAACACTC GAGAACCCGC CGGGCGACCC ACGTACAGTC
CAGTACGACA CTGTGACTGA TTTCAAGCGT GAACGTCTCA GCGCGGCCTT CGACGGGTTC
GAGGCGAGCG CCGAGGCGGA CGACCGGGCG GCCTTCGAGG CGTTCCGTGA GCGCGAAGCC
ACGTGGCTCA GGGACTATAC CCTGTTCACC GCGCTGAAAG CGGCCTACGA CGGGGTGCCC
TGGCCCGAGT GGCCGGCCGA CCTCGCCGGA CGCGACCCTC CGGCACTCGA GGCCGCGCGG
GAGACCCATG CCGAGGCGAT CCGGTATCAC GCGTTCGTCC AGTGGCTCTT CGACGAGCAG
TGGCGCGCGC TGCGGGCGGC GGCCGACGAG CGTGGTATCT CACTCGTCGG CGACCTCCCG
ATCTACGTCG CCTGGGACTC GGCGGACGTC TGGGCGAACC CCGAGGCCTT CGAACTCGAC
GACGAGGGGG GGCCGACCGC GGTCGCGGGT GTCCCGCCGA ATCCCGGCGA CGATGGTCAG
CGCTGGGGCA ACCCGGTCTA CGACTGGGAG ACGCTCCGGG CCGAGGACTA TGGCTGGTGG
CGCGACCGGC TGGACCGACT GCTCTCGCTG GTTGATATCG CCCGCATCGA CCACTTCAAG
GCCTTCGACG AGTACTGGGC CATCCCGGCC GACGCCGACG ACCCTGCCGC CGGCGAGTGG
CGACCCGGAC CCGGCGCGGA CTTCTTCGAG ACGATCCGGG CCGAACTCGG GGAGTTGCCG
TTCGTCGTCG AGGATCTGGG CTTTCTCGAC GAGAGCATGG TTGCACTCCG GGATCGCTTC
GAGTTTCCGG GGATGCGCGT CCCGCAGTAC GCCGACTGGT GTCGGGAGGG CCACCGCTAC
AAACCGACGG TCTATCCGGA CCACTGCGTC GGCTACACGT CGACGCACGA CACGGACACT
GCGGTGGGAT TCTACGAGAA GCTCTCGGCC GAGCAACGCG ATTGCCTCGA ATACGCGCTG
GCGACCGACG GGGATTCGAT CGCCTGGGAT CTGATCGAGG CCGTCTGGCA CTCCGACGCG
GCCCTGGCGA TGACGACAGT GCCGGATCTG CTCGAACGCG GGAGCGATGC CCGACTGAAC
CAGCCGGGTA CCGGCGAGGG CAATTGGACC TGGCGGGTGA CTGCCGACGA ACTCGACGCG
GACACCGCTG ATCGGCTGGC AGCAGTCACG CGCGCGTCCC TCCGGTAG
 
Protein sequence
MSFDRQSGVF LHLTSLPSPH GIGDLGDGAR TFLDFLERAE QSLWQFCPVT PTRGVHGHSP 
YASPSAFAGN PLLVDLTALV ERGWLDEETL ENPPGDPRTV QYDTVTDFKR ERLSAAFDGF
EASAEADDRA AFEAFREREA TWLRDYTLFT ALKAAYDGVP WPEWPADLAG RDPPALEAAR
ETHAEAIRYH AFVQWLFDEQ WRALRAAADE RGISLVGDLP IYVAWDSADV WANPEAFELD
DEGGPTAVAG VPPNPGDDGQ RWGNPVYDWE TLRAEDYGWW RDRLDRLLSL VDIARIDHFK
AFDEYWAIPA DADDPAAGEW RPGPGADFFE TIRAELGELP FVVEDLGFLD ESMVALRDRF
EFPGMRVPQY ADWCREGHRY KPTVYPDHCV GYTSTHDTDT AVGFYEKLSA EQRDCLEYAL
ATDGDSIAWD LIEAVWHSDA ALAMTTVPDL LERGSDARLN QPGTGEGNWT WRVTADELDA
DTADRLAAVT RASLR