Gene Huta_2399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2399 
Symbol 
ID8384698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2459785 
End bp2462739 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content65% 
IMG OID644973472 
Productglycoside hydrolase family 9 
Protein accessionYP_003131298 
Protein GI257053465 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.466816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAC ACGACGCGAA ATCAGGGGGA ATGGGGCGGA CGACGACCGA CGGCGACACC 
GATCTGTTCC GTCGGGACCT GCTTGCGGCG ATGGGACTGG GCGCGGGATC GGTCGCGCTC
GGGACGGACG TGGCCACGCC GAGCGTGGTC TCCCGAGCAG CCGCACAGAC TGACCTGGGA
TTCGATTACG CCCATGCGCT CCAGCAGTCA CTGTACTTCT ACGACGCCAA TCGCTGTGGC
GCAACAACGA TGGGCAATCG CCTCCAGTGG CGCGGCGAGT GCCACCACTC GGACACTGAG
ATCCCGCTCG ATGCGGCGAC CGAGGACGGC GGGACCAACC TCTCTGGGAG CTTCATTGAG
GAGTACAGCG ACGTGCTCGA CCCCGACGGC ACCGGGACGA TCGACGTCAG CGGCGGGTTT
CACGACGCCG GCGACCACAT GAAGTTCGGG CTCCCCCAGT CCTACAGCGC CTCGACGCTT
TCCTGGGCGC TCTACGAGTT CGAGGACGCC TTCAGGGATG TCGGCTCCTA CGACCACATG
GTCGACATCC TGCGGCACTT CGCCGATTAC TTCCTGAAGT CGACGTTCCG GGATGACGAA
GGCAACGTCG TCGCCTTCTG CTATCACGTC GGCGAGGGGA GCATCGACCA CAACTACTGG
GGGCCGCCGG AACTCCAGTC GTCCGAGGAG TATCCCCGGC CAGCCTACTT TGCTACGCCG
GAGGATCCAG CCAGCGACCA GTGCGCCGGG ACGGCCGCGG CGCTGACGAT CACCTCGCTC
GTCCTGGAGT CCGAGGATTC GGCGTACGCG GCGGAGTGTC TCGACACGGC ACAGGCGCTG
TATGACTTCG CCGTCGAGAA CCGCGGACTT GGGTACGACG GGGGTTTCTA CGACTCGAGT
TACGACGAGG ACGAACTCTC CTGGGCGGCG GTCTGGTTGC ACATCGCGAC TGAGGACGAC
GCGTATCTGG ATGACATCCT CGCGACCGAC GATTCGGGCA CCTACACCGG GTATCTCGGG
GAGATCATCG ACTCGACCGA CGACGACTGG CAGAACATCT GGGTCCACTC CTGGGACACG
GTCTGGGGCG GTGTCTTCCT CAAACTCGCG CCGATCACCG ACGACCCCGA GCACTGGCAG
ATCGCCCGCT GGAACCTGGA GTATCTCTCT GGCGGTTCGG TCGAGCATGA AGACGACAAC
GATACGAACT ACGCCTCGAC GTCGGATGCT GGGTTCACCG TGCTCAACAC GTGGGGGTCG
GCCCGGTACA ACGCCGCCGC GCAGTTCCAG GCGATGGTCT ACCGGAAGTA CCGCGACACC
GAGAAGGCGG TCGCACTCAC CGACTGGGCG GCGACCCAGA TGAACTACAT CATGGGTGAC
AACTCCTTCG GGTACTCGCT GATCGTCGGG TTCACCGACG ACCACGCCGA GCATCCCCAC
CACCGGGCTG CCCACGGCTC GAAGGAGAAC AGCATGGAGG AACCCGAAGA GCACCGCCAC
ACGCTGTGGG GTGCCCTGGT CGGCGGGCCC GACGAGGACG ACACCCACGT CGACGAGACC
TCGGATTACG TCTACAACGA GGTCGCGATC GACTTCAACG CGGGACTGGT CGGCGCGCTC
GCGGGGTTCA ACACCTTCTA CGACGATACC GGCGAGGCAG TCGCGGAGTT TCCCCCGGGT
GAGGAGCCGA TCGACGCCTA CTACGCCGAG GGGGAGGTCC TCCAGGAGAA CGCCGACCGG
ACACAGGTTC GAGTGACGAT CCACAACGAA TCGATCCACC CGCCCCATCG CGAGGACGGC
CTCAGCGCCC GCTATTTCAT CGACGTCAGC GAGTTGCGCG ACGCCGGCCA GTCGATCGAC
GCCGTCTCGG TCGAGGTCCA GTACGACCAG CAATCAACCA TGGGCGATGG GTCGGCCGAC
GTCTCGGGGC CGATCGCCTG GGACGAGGAT GCCGGCATCT ACTACATCGA ACTCGACTGG
TCGGGCAACC AGATCTATGG CGCACGGGAG ATTCAGATCT CGATGATCGC CGAGCAGGAC
GACAACTGGG AGAGCAACTG GGATCCATCG AACGACCCGA GCTTCCAGGA CATCGGCGAG
GCGGCGACCG TCACCGAGGC GATCTCCGTC TACCTCGACG GCGAACTGGT TTACGGCCAG
CTTCCGGGTG AGTCCGAGTC TGAGCCCGAC GACACGACCG CTCCGACGGC TCCCTCGAAC
CTCTCTGTGG TCGAGACGAC AGCGTCCTCG GCCGAGGTCG AGTGGGAGGC CGCCAGCGAC
GAGGGCGGTA GCGGCCTCGA TCACTACACC ATCTCCGTTG CGGGCGACTT CGACCAGCAG
GTTGGGGCAG GGACGACGAC GGCGACTGTC GAGGAACTGG ACGCCGAGAC GACCTACGAG
ATCGGCGTCT CGGCGGTCGA CGGTGCCGGC AACGAGTCCG ATACGGTAAC CGTCGAGGCG
ACGACTGACG AGGCCGACGA CGGCGAGGAC GACAGCGATG ACGAGGAATC ACCGACGGAT
GCCCTCGTCG TCAACGACTA CGACGGCGAC CCGGCGTGGT CGAGCAACCG GAACGACCTC
GGCCAGTGGT GTGGGGCCGG CTCCTTCGAG AACGGGGCAG GCGAGGTAGC CGACGGGGCG
CTGGTCCTCG AATACGACAA CGGTGGCTGG TACCAGGAAC AGATCAACCG GGATGTGAGC
GACTATTCGA GCGTCGTCCT GGATGTGTGC GGTGCGAACG GTGGCGAGGA GAACGAAATC
CGCTTTGCCA TGGGCGGCGT CTCGGGCCTT CTGGGTGATC TCACCGGCGA TTCGATCGGG
ACGAGTGCCG GCGAGGTACG GATCGACATG GAATCAGCTG GCATCGACCC CACTGCTGAG
GGACTCGCGG TGCGGCTAAA CTTCTGGCAG GGGGGTGAGA GCACCCTCGC AATCGAGGCG
ATCCGTCTCG AGTAA
 
Protein sequence
MTEHDAKSGG MGRTTTDGDT DLFRRDLLAA MGLGAGSVAL GTDVATPSVV SRAAAQTDLG 
FDYAHALQQS LYFYDANRCG ATTMGNRLQW RGECHHSDTE IPLDAATEDG GTNLSGSFIE
EYSDVLDPDG TGTIDVSGGF HDAGDHMKFG LPQSYSASTL SWALYEFEDA FRDVGSYDHM
VDILRHFADY FLKSTFRDDE GNVVAFCYHV GEGSIDHNYW GPPELQSSEE YPRPAYFATP
EDPASDQCAG TAAALTITSL VLESEDSAYA AECLDTAQAL YDFAVENRGL GYDGGFYDSS
YDEDELSWAA VWLHIATEDD AYLDDILATD DSGTYTGYLG EIIDSTDDDW QNIWVHSWDT
VWGGVFLKLA PITDDPEHWQ IARWNLEYLS GGSVEHEDDN DTNYASTSDA GFTVLNTWGS
ARYNAAAQFQ AMVYRKYRDT EKAVALTDWA ATQMNYIMGD NSFGYSLIVG FTDDHAEHPH
HRAAHGSKEN SMEEPEEHRH TLWGALVGGP DEDDTHVDET SDYVYNEVAI DFNAGLVGAL
AGFNTFYDDT GEAVAEFPPG EEPIDAYYAE GEVLQENADR TQVRVTIHNE SIHPPHREDG
LSARYFIDVS ELRDAGQSID AVSVEVQYDQ QSTMGDGSAD VSGPIAWDED AGIYYIELDW
SGNQIYGARE IQISMIAEQD DNWESNWDPS NDPSFQDIGE AATVTEAISV YLDGELVYGQ
LPGESESEPD DTTAPTAPSN LSVVETTASS AEVEWEAASD EGGSGLDHYT ISVAGDFDQQ
VGAGTTTATV EELDAETTYE IGVSAVDGAG NESDTVTVEA TTDEADDGED DSDDEESPTD
ALVVNDYDGD PAWSSNRNDL GQWCGAGSFE NGAGEVADGA LVLEYDNGGW YQEQINRDVS
DYSSVVLDVC GANGGEENEI RFAMGGVSGL LGDLTGDSIG TSAGEVRIDM ESAGIDPTAE
GLAVRLNFWQ GGESTLAIEA IRLE