Gene Huta_2398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2398 
Symbol 
ID8384697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2456668 
End bp2459682 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content67% 
IMG OID644973471 
ProductCarbohydrate binding family 6 
Protein accessionYP_003131297 
Protein GI257053464 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0137535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACG AAGCGACCGA ATCGATTGAA GCATCGGCGA CTGATCACAC TGACGAGACA 
GCTGGAAATC GCAAGGACCC CGGTCTCACC TCGTCACGCC GGACGTTCCT CGGGGCGATG
GCGAGTGCTG GGACGATCGG TGCTGGGCTT TCGGCCGCCA CTGGGACCGC TGCCGCCGGT
GTGCCGACGC CACGGCTGCA CACCGAGGGG CGGTGGATCC GCGATCCGGC GGGCAACGAC
GTGACGCTCC GGGGGATGGC ACCCGCTGAC CCCGGTTTCT ACCGGCAGTA CCATCCCAAG
AGCTTCGAGG AAGTGCTGGA GTGGGCGACT GACACGGATC GGGGCTGGCA TCCCAACATT
GTCCGGCTAC CCTGTACGCA GGACTCGATC GACGCGCTGG GCCTGGAAAC GTACGTCACC
GAGGTCCTCC GCCCCGCGGT CGACCTGCTG GCCGCGCGGG ATGTCTACGC GCTGGTGGAC
TTCCACCTCA TCAGGCCCTA CACACAGGAT GCAACCGAGA CGTACAACGA GGAAAACGAC
GACGACCTCG CGCCGATCGA CGACGTGATG ACGACCTTCT GGGATCGGGT CGCCCCGGAG
TTCGCCGAGG ACGAACACGT CATCTACGAG CTGTTCAACG AGCCGACCCA GCCGGCGATG
TACGGCGACG ATGCCGGTGC CTTTCAGGCC TGGCGGGACG CCGCCCAGCC GTGGGTCGAC
CTCGTCCGCG AACACGCGCC GGAGACGCCG ATCATCATCG GCTCGCCGCG GTGGACGTCG
GTGACCCACA TGGCGCCGGA GTATCCCTTC GATGGGGAGA ACCTGATCTA CGCGGCGCAC
ATCTACCCCG ACAACGGCCC GCCCGCGGAC TTCGACCAGT GGTACGGCGA ACCCGCCACC
GAAGTCCCGG TCGTCGTCAC GGAGTTCGGC TGGGAACCCA CCGGGGGCTC CGTCGATCAG
GGCACCACCT CCGGGTGGGG CGAGCCGTTC CGCGAGTGGG TCGAGGGCTA CGAGAACATG
GGGTGGATCT CGTGGTGTTT CGACGACTCC TGGGAGCCGG CCTTCTTCGA GTCGCCGGAC
GCTGGGGCCA ACGAGCCCTG GACGCTCAAG GACGACGCAG ATCAGATGGG GGGGTACATC
AAGACCTGGC TGGAGGCAAC CAAAGATCAG GGCATCCCGG AGAGTGCGAT CGACGACGAC
GTCGCGCCGC CGGTTCCATC CGGCCTCGAG GTGACCCGTT CGACCGAGAT CAGCGTCGAG
ATCGCCTGGA ACGCCGTCAC CGACGAGGGC GAGGCCGGCC TCTCCCATTA CAACGTCTAC
GTCGACGGCG AGCGCCGCGG GCAGGTGATC GACGGGACGG CGACGACGGT CGACGGCCTG
GAGCCGGCTT CGACCTACGA GGTCGGTGTT TCTGCCGTCG ACAGTGCGGG CAACGAGTCC
AATCAGACGA CGACGGTCGC CGAAACGATT GCCACCGACG CCGGCCAGTC GGCGTTCGTC
GAGCACGAAC TCCCGGGCCG CATCCAGGCC GAGGACTTCG ACGAGGGTGG CCAGGGAATC
GCCTATTACG ATACAGGATC CACGAACGAG GCCGGGGCCG ACTACCGCGA GACGGGCGTC
GACATCGGGA CGGCCGTCGA GTCGGGGTAC AACGTCGGCT ACACCGAGAC CGGCGAGTGG
CTCGAGTACA CTGTCACCGT CGAATCCGGT GGTAGCTACG AGGCCACCGT TCGGGTTGCC
AACGGCGCTG ATTCGGGTGG CGACCTCCGG ATCGAGGTCG ACCGCGCCGA GGTGGCGACA
CAGAACGTCT GGCCGACCGG CGGCTGGGAG AACTTCGAGG AGATCCGTGT CGGCGAGGTC
GACATCCCCG AGGGCGAGCA CGTCATCCGG ATCGTCGTCG AGACCAGCGG CTGGAACTTC
GACTGGATCG AGTTCACTGG CGGCGACGGC GGCGGCGAGG ACGTGACCCC GCCGACTGCT
CCCTCGAACC TCTCAGTGAC CACGACGACG CCGTCATCCG CCGAGATCGC GTGGGATGCC
GCGACCGACG AGGGCGGGAG CGGACTCGAT CACTACGCGG TGTACGTCGA CGGGAGTCTC
GATCAGCAGG TTCCGACCGG GACCACGTCG GCGACGATCG CGGATCTCGC GGCCGAGACG
AGCTACGAGA TCGGCGTCTC GGCCGTCGAT GGGGCAGGCA ACGAGTCCGA ATCGGTGACT
GTCGACGTGA CGACCGACGC CGGCGACGAC ACGACCCCGC CGACTGTCCC CGGCGACCTC
TCGGTCGATG GGACGACGGC CACGTCGATC GACGTCGCCT GGAGTGGTGC TTCGGACGCC
GGCACGGGTG TCGACGCCTA CGCCGTCTAC GTCGACGGGA GCCGTGATCA GGCGGTTAAG
GCAGGGACGA CGACGGCGAC GATCGACAGC CTCTCGGCGG TGACGACCTA TGAGGTCGGG
GTTTCGGCGA TCGACGGGGC CGGCAACGAG TCGGCGACGG CGACCGTCGA GGCCACCACC
GACCAGAGCG ACGACGGCGA AGACGATGAG GACGACGAAT CACCGGCAGA CGCCCTGGTC
GTCAACGATT ACGACGGCGA TCCGTCGTGG TCGAGCAATC GCAACGACCT CGGCAAGTGG
TGCGGGGCCG GGTCGTTCCA GAATGGTACT GCCGGTGGCG GTGCGGTCGA GGACGGTGCG
CTGGTCCTCG AATACGACAA CGCCGGGTGG TTCGTCGAAC AGGTCCAGCA AGACGTCAGC
GACTACTCGA CGGTCGTGTT GCGGGTCAGC GGGGCGAACG GCGGCGAGGA GAGCGAGTTC
CTCTTCGACA TGGGCGGTGC GCGCGACCTG CTCGCGAATC TGACCGACGA CTCGATCACG
ACGAGTGTCA CTGACGTCGC GATCGACATG GAGTCGGCCG GGATCGACCC GTCGGGCGGG
GGACTCTCGA TCCGCCTGAA CTTCTGGCAA GGAGGTGCGA GCACGCTCGA AATCGAAGAG
ATCCGACTCG AATAG
 
Protein sequence
MTDEATESIE ASATDHTDET AGNRKDPGLT SSRRTFLGAM ASAGTIGAGL SAATGTAAAG 
VPTPRLHTEG RWIRDPAGND VTLRGMAPAD PGFYRQYHPK SFEEVLEWAT DTDRGWHPNI
VRLPCTQDSI DALGLETYVT EVLRPAVDLL AARDVYALVD FHLIRPYTQD ATETYNEEND
DDLAPIDDVM TTFWDRVAPE FAEDEHVIYE LFNEPTQPAM YGDDAGAFQA WRDAAQPWVD
LVREHAPETP IIIGSPRWTS VTHMAPEYPF DGENLIYAAH IYPDNGPPAD FDQWYGEPAT
EVPVVVTEFG WEPTGGSVDQ GTTSGWGEPF REWVEGYENM GWISWCFDDS WEPAFFESPD
AGANEPWTLK DDADQMGGYI KTWLEATKDQ GIPESAIDDD VAPPVPSGLE VTRSTEISVE
IAWNAVTDEG EAGLSHYNVY VDGERRGQVI DGTATTVDGL EPASTYEVGV SAVDSAGNES
NQTTTVAETI ATDAGQSAFV EHELPGRIQA EDFDEGGQGI AYYDTGSTNE AGADYRETGV
DIGTAVESGY NVGYTETGEW LEYTVTVESG GSYEATVRVA NGADSGGDLR IEVDRAEVAT
QNVWPTGGWE NFEEIRVGEV DIPEGEHVIR IVVETSGWNF DWIEFTGGDG GGEDVTPPTA
PSNLSVTTTT PSSAEIAWDA ATDEGGSGLD HYAVYVDGSL DQQVPTGTTS ATIADLAAET
SYEIGVSAVD GAGNESESVT VDVTTDAGDD TTPPTVPGDL SVDGTTATSI DVAWSGASDA
GTGVDAYAVY VDGSRDQAVK AGTTTATIDS LSAVTTYEVG VSAIDGAGNE SATATVEATT
DQSDDGEDDE DDESPADALV VNDYDGDPSW SSNRNDLGKW CGAGSFQNGT AGGGAVEDGA
LVLEYDNAGW FVEQVQQDVS DYSTVVLRVS GANGGEESEF LFDMGGARDL LANLTDDSIT
TSVTDVAIDM ESAGIDPSGG GLSIRLNFWQ GGASTLEIEE IRLE