Gene Huta_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2111 
Symbol 
ID8384405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2146574 
End bp2148937 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content69% 
IMG OID644973180 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003131011 
Protein GI257053178 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCCT TCACGGGACC GACCGAGCAG CATCGACGGA CCCGCCGCCT CGAAACGGGC 
TGGCGATTTC ACCATGGGGA CGCCGAGGGG GCAGCCGACC CGGACTTCGC GGACGACGAC
TGGCGTCCCG TCGAGGCCCC TCACGACTGG AGCATCGAGG GACCGTTCGA TCCGGAGAGT
CCGGCCGGCA GCGCGCAGGC GTTCCTGCCC GGTGGCGTCG GGTGGTACCG CCGCGAGTTG
CCGAGCGACG TCGAAGGCGA GACCGTCTAC CTTCGCTTTG ACGGCGTCTA TCGCGACAGC
GACGTCTATC TCAAGGGCGA GCACGTCGGC AACCGGCCGA ACGGATACAC GAGTTTCAAT
CACGACGTCA GCGGGGTCGA GGGTAGCGAG ACACTCGCCG TCCGGGTCGA CAATACCGAC
CTGCCGAACT GCCGGTGGTA CTCCGGGTCG GGTATCTATC GACACGTCCA CCTGATCGAG
ACATCTGCGC TGCACGTGGT CCCGTGGGGC ACAGACGTCC GCACGCCGGC CGTGACGGAG
CGTCGTGCCA GGGTAGACGT CCACACCGAG GTGGCCAACG ACGCCGAGGA GGCGGCCGTC
TGTACGCTCT CGACGACGGT CTACGACCCG GCGGGCGAGG TCGTCGCCGA GGCCGCGACC
GAGCAGCGAT TCGCCGCCGG ACAGCAACAC ACGTTCGAGC AGGAACTCGC CGTCGCGGAG
CCCGCGTTGT GGTCGCCGGA GACGCCCGAA CGGTATCACG TCCGAAGCGT CGTCCACCGC
CAGGACCCGG ACGGGACGGC GGAGACGACG GGCGAACCGG TCGACGACTA CGTGACGCAG
TTCGGCATCC GGACGGTCAC GTTCTCGGCC GACGAGGGGG TGTTGCTCAA CGGCGAGTCG
ATGAACCTCA AGGGAGTCAA CCTCCATCAC AGCGCCGGTG CGCTCGGGGC CGCCGTGCCC
GAGCGCGCGC TCGAACGCCG ACTCGAAACT CTGGCAGCGA TGGGGGCGAA CGCCATCCGG
ACCGCCCACA ATCCGCCCCA GCCCGAGCTA CTGGAACTCT GTGACCGGAT GGGCTTTCTA
GTCATCGACG AAGCGTTCGA CAAGTGGCGT CACGAGAAGA CCGGGAAATT CTTCGAGGAG
TGGTGGCGCG AGGACCTCGC AGCGATGATC CGCCGGGACC GCAACCACCC GTCGGTGATC
GCCTGGAGCG TCGGCAACGA AAGCTACGAC CACGGCGAGG CGGAGATGCT CGACGATCTG
GAAATGCTGG TCGAGGCAGC CAACGACCTC GATCCGACGC GGCCGGCGAC CTACGGCAGC
CCGGCCTGGG GCGACGGCCA CGAGGGGATC CTCAAGAACG CCGAGGCGGT CGCCGAGCGG
GTCGATCTCT TCTCGGGCAA CTACATGGAA CACCACTACG ACGACCTCCG CGAGCGGGGC
GTCGACGTGC CGATCGTCGG TTCGGAGTGC CGGCCGTTCT TCCGTGGGTC GGGCGACGAT
CCGCTGGCGT TCGTCCCGCC GAACCCGTGG TTCGACGTCG CCGAGCGCGA CGACGTTGTG
GGGCAGTTCA TCTGGAGCGG CTTCGACTAT CTCGGAGAGG CGCGCGAGTG GCCGAGCAAG
GGCTGGCCGA CCGGCTTGAT CGACACCTGC GGCGTGCCCA AGCCCCCGGC CGCCTTCCAT
CGGAGCGTCT GGAGCGACGA ACCGATGGTC GAGATCGCGG CGTTCGATCC TGCTCGCGAG
CGCGCGCCGG CCCGACCGGC GTGGTCCTGG CCGGCGCTCG CCGGTCACTG GACGTTCCCC
GACCGCGAGG ACTCCCGCGG GTTCGTCCAC GTCGTGACGT TCACCAACGC CGAGACGGTG
ACGCTCTATC AGAACGACGA GCGCCTCGGC GTCCAGCACC TCGCGGACAA CCCCGACCAC
ATGATCGAGT GGTACGTCCC CTACGAGTCG GGGACGTTGC GGGCCGTGGC GGAGACCGAC
GGCGAGGTCG TCGCGACTCA CGAACTCCAG ACGGCGGGCG ATCCGGCCCG GGTCGAACTC
GATCCCGACC GCGAGGCCAT CACGGCCGAC GGGCAGGATC TGGTGTACGC CGACGCGCGG
ATCGTCGACG ACGACGGCGT CGTCGTGCCG CGGGCCGACC ACGAGATCGA ATGCTCCGTC
AGCGGCGCTG GCGACCTCGC CGGCGTCGAC AACGGCGACC TGGCGAGCAA CGAGTCCTAC
ACCGACTCCC GGCGGTCGGC CTACCACGGA ACGGCGCTGG CGATCGTCCA GGCTGATCGT
TCGCCCGGCG AGGTGACGAT CACGGCCGAC GTCGAGGGCC TCGAGGGCGA CGAGGTGACG
ATCCCGGTTC AGGCCCCGGA GTGA
 
Protein sequence
MTAFTGPTEQ HRRTRRLETG WRFHHGDAEG AADPDFADDD WRPVEAPHDW SIEGPFDPES 
PAGSAQAFLP GGVGWYRREL PSDVEGETVY LRFDGVYRDS DVYLKGEHVG NRPNGYTSFN
HDVSGVEGSE TLAVRVDNTD LPNCRWYSGS GIYRHVHLIE TSALHVVPWG TDVRTPAVTE
RRARVDVHTE VANDAEEAAV CTLSTTVYDP AGEVVAEAAT EQRFAAGQQH TFEQELAVAE
PALWSPETPE RYHVRSVVHR QDPDGTAETT GEPVDDYVTQ FGIRTVTFSA DEGVLLNGES
MNLKGVNLHH SAGALGAAVP ERALERRLET LAAMGANAIR TAHNPPQPEL LELCDRMGFL
VIDEAFDKWR HEKTGKFFEE WWREDLAAMI RRDRNHPSVI AWSVGNESYD HGEAEMLDDL
EMLVEAANDL DPTRPATYGS PAWGDGHEGI LKNAEAVAER VDLFSGNYME HHYDDLRERG
VDVPIVGSEC RPFFRGSGDD PLAFVPPNPW FDVAERDDVV GQFIWSGFDY LGEAREWPSK
GWPTGLIDTC GVPKPPAAFH RSVWSDEPMV EIAAFDPARE RAPARPAWSW PALAGHWTFP
DREDSRGFVH VVTFTNAETV TLYQNDERLG VQHLADNPDH MIEWYVPYES GTLRAVAETD
GEVVATHELQ TAGDPARVEL DPDREAITAD GQDLVYADAR IVDDDGVVVP RADHEIECSV
SGAGDLAGVD NGDLASNESY TDSRRSAYHG TALAIVQADR SPGEVTITAD VEGLEGDEVT
IPVQAPE