Gene Huta_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1052 
Symbol 
ID8383326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1018643 
End bp1020667 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content69% 
IMG OID644972117 
Productalpha amylase catalytic region 
Protein accessionYP_003129968 
Protein GI257052135 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCACC CCGGCCCACC CCACTTCGTC GCCGTCGGCG AAGCAATCGA ACTCGCCCCC 
CGCGATCCGG ACCCGGAAGC GACCTACGCG TGGGACGTGA CCGCCCGACC GGACGGATCG
ACAGCGACCG TCGGCGACGA CCCCGTCGAG CATCTCGAAC CCGACGTCGC GGGAACCTAC
GTCGTTCACC TCGCGGCCCC CGACGGCGGC CACGACCTCA CCGTTCGCGC GTTCGCGTCG
GAACTGACCC CCTCGACGGG CGGCGTCTCC GGGGCATCGG GCGTTTCGGC GGGCGAGAGT
GGATTCGAGA GCGGGGGATC CGGATCGGGC GGTCGATCGG GCAGTGCTCG CGCCGACGCC
GTAACCGGTG ACGGGGGCCG ACCCCGGCTC ACCCTCGAAC CCGCGATCGA GGGGGACGAA
GCCGTCGTCC GGGCCGATCC GCTCCCGCAT CCCGACGGAC CGGAGACGTC GGCCGACCTG
GCCGTCGAAT TTCTGCTCGA CGACCGGGAC AATGTCGACC GCGAGGCGGT GACCATCGAT
GAGACCGAGC TCCGGGTTCC GCTCTCGGCG ATCGACGACA GACTACGGGT GCACGCCGTC
TCGGTCGGGG ATCGAGGCTA CAGCGTCCCG GACGCCGTCG AGTTCGCGCG GGAGGACCCG
ACCGGGTCGG CAGTCACGGA TGGGAGCGTG ACTGCCAGTC ACATCTACGA GCCACCCGCG
TGGGCCGAGG ACACGATCAT CTACGAGATC TACGTCCGGA CCTTCGCGGG CCAGGCAGGT
GATCAGCGGA GCGGTGGCGA TGGAGCGGCT GCGGGCGAGG GGGAAACCGA GCGCTCGGCC
TTCGACGCGA TCGTCGACCG GCTGGACTAC ATCGAGTCCC TCGGAGTGGA TACCCTCTGG
CTGACGCCGG TCCTGGAGAA CGACCACGCG CCCCACGGGT ACAACATCAC GGACTTCTTC
TCGATCGCCG AGGATCTTGG GTCACGGGCC GATTACGAGC GCCTCATCGC GGCCGCCCAC
GACCGCGGGA TGAACGTCCT GTTCGACCTC GTGTGCAATC ACTCCGCGCG AACCCATCCG
CACTTCCAGG CGGCCGTCGC CGATCCGGAC AGCGAGTACC ACGAGTGGTA CGAGTGGCGC
GGCCCCGGCG AGCCCGAGAC GTACTTCGAG TGGGAGCACA TCGCGAACTT CGACTTCACG
CACCTGCCGG TCCGGCGACA CCTCCTCGAC GCGATCGATC AGTGGGCCCC ACTGGTCGAC
GGCTTCCGGA TCGACATGGC GTGGGCCGTG CCGAACAACT TCTGGCGGGA GGTCCACGAC
CGGGCGAAAG CCATCGACAG CGAGTTCCTG CTACTCGACG AGACGATCCC GTACATTCCG
GACTTCCAGG GCGGGTGTTT CGACATGCAC TTCGACTCGA CCACGTACGC GGCGCTGCGT
CGGGTGGGCA ACGGCGCGCC GGCCGCGGAA GTGCTCGATG CTGTCGACGA ACGCGCAGCG
ATCGGCTTTC CGCCACATGC CGGGTTCATG CTGTACGCGG AGAACCACGA CGAGACGCGC
TATCTTGTCG AATGTGGTCG TGCGGCCGCC CGTGCCGCCA CGGGCGCGCT GTTCACGCTG
CCGGGGTCCC CACTGGTGTA CGCCGGCCAG GAGTTCGGCC AGCGCGGCAA GCGCGACGAC
CTCGCGTGGG AACACGCCGA CGAGGACCTC CAGGCGCACG TCCGACAGCT CGCGGCGGCG
CGTCGCGACG TGACGGCGCT GGAATCGGCC GCGACACTCC ATCGCATCGA GTGGACCGTT
CAGTCGGGTG CGGCCGACCG TGTCGTCGCG TTCGGACGCG TTCGTGGCGA CGATGCCGTC
GTCGTCGTCC TGAACTTCGG GCCGGAGACG GCGACAGTCG AACTACCGAT AGCGACCGGG
ACGACCGACG CAGTGTCCGG GAAGGCTGTC GGAACTGGCG AGGGGGGACT GCGGGTCGAC
AACGTTCTCG TCGTGCCGGC CGAGTCGGAG ACGATCAAGG GGTAG
 
Protein sequence
MHHPGPPHFV AVGEAIELAP RDPDPEATYA WDVTARPDGS TATVGDDPVE HLEPDVAGTY 
VVHLAAPDGG HDLTVRAFAS ELTPSTGGVS GASGVSAGES GFESGGSGSG GRSGSARADA
VTGDGGRPRL TLEPAIEGDE AVVRADPLPH PDGPETSADL AVEFLLDDRD NVDREAVTID
ETELRVPLSA IDDRLRVHAV SVGDRGYSVP DAVEFAREDP TGSAVTDGSV TASHIYEPPA
WAEDTIIYEI YVRTFAGQAG DQRSGGDGAA AGEGETERSA FDAIVDRLDY IESLGVDTLW
LTPVLENDHA PHGYNITDFF SIAEDLGSRA DYERLIAAAH DRGMNVLFDL VCNHSARTHP
HFQAAVADPD SEYHEWYEWR GPGEPETYFE WEHIANFDFT HLPVRRHLLD AIDQWAPLVD
GFRIDMAWAV PNNFWREVHD RAKAIDSEFL LLDETIPYIP DFQGGCFDMH FDSTTYAALR
RVGNGAPAAE VLDAVDERAA IGFPPHAGFM LYAENHDETR YLVECGRAAA RAATGALFTL
PGSPLVYAGQ EFGQRGKRDD LAWEHADEDL QAHVRQLAAA RRDVTALESA ATLHRIEWTV
QSGAADRVVA FGRVRGDDAV VVVLNFGPET ATVELPIATG TTDAVSGKAV GTGEGGLRVD
NVLVVPAESE TIKG