Gene Huta_2585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2585 
Symbol 
ID8384890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2649797 
End bp2651608 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content63% 
IMG OID644973660 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003131480 
Protein GI257053647 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAGTG AACAGTCTTC AGGGAGTTTC GAGGACGTAG TGAATCGCCG CAACTTCATG 
CGGCTGGCAG GCGCGACCGG CGCGACCATG CTTGCCGGGT GTCAAGGTGA GGAGACAGAC
ACGGAAACGG AGAGCGGTGG AAACGGTGAC ACCGAGACGG ACAGCGGTGG CGACACCGAC
GTCTACGACG TCACGGTCGA ATCCGCTGCC GCCCGTGGGA TGGACCTGCT CGACTGGAAC
GCCCAGTTCG CCGGGTGGCC GAACATCTGG GGGCGCTGGC TCGCTTACGA GCGCTATGCC
CAGTACAACA TGACGGAAAA CGAGTGGATC CCCCGGCTCA TCCAGGACTG GTCGGTCGAC
GGGACGACTG TCACCCTCAA CATCCGTGAG GATCACACCT ACGCGAACGG CGACCAGGTG
ACGGCCGACG ACATCAAGGC GAACACCGTG ATGAACCTCG CGACGGGGGC CGCGTTCGCG
GATGTCTTCG ACTCGTTCAA CGCACCTGAC GACAAGACGC TCGAGATCGA GACCACAAAG
GAAGTCAATC CGGATATCCT CGAGTTCACG CTCCTCTCCC AGCTCCAGCA GGCGAAGATG
GCCGAGCCCT ACGACGAGCT CTATCAGCGC TACTGGGAGG ACGAGGAGGA GGGCGTCTCC
AGCGACATTC AGGGTCGTGA GCCCAATTCG CCCGACTACG TCTCCGGGAT CTTCGGGCAG
CAGAGCAAGG ACGACGAGCA GTACATGATG AACCGTAACC CCGAGCACCC TGACGCCGGG
AACGTCAACT TCGAGCGTTA CCGGTTCCCG AACTACCCGG GCAACGAGGG TAATTGGGAA
GCCATGATCG GTGACGATAT TGACACGATC ATGAGCGCGT TTACTCCGTC GAACATCCGG
GCGGAGCTTG CCGACCACTG GCAGGAATAC AACTTCCCCG GCTACTGGGG TGTCGGCTAC
GTGTTCAACC ACGACGAGGA AGCCGCGCCG CACATCTCGA AGCGATCGGT CCGACAGGCG
ATCACGCACG CCATCAACCG GGAGGACGTC GTCACGGCCG CGGGCGCAGA GATCAAGGAG
GCCTTCCCCA CTCCGGCCGC CGTCTCCGCG AACGTCCAGG ACGAGTGGCT CGACGTGGGT
GGCCAGTTCC CCGCGATGCA GGGTGGCGCA GAGCAGGCCG CCGAGATCAT GCAGGACGCC
GGCTACGAGA AGAACGGCAA CGGCAACTGG GCCATGGACG GCGAGACCGT CACGTTCCAG
GTCGTCGTCC CGAGTAGCTG GAGTGACTGG GTCACGGCGA CCCAGGCTGT GGTCCCACAG
CTCCAGGACG CCGGGTTCGA CGCGGAGATG AACCGTGTCG ACAACATCAA CACGGTCGTC
GGAGAAGGGA ACTTCAAGAT GGCGGCCCGT CCGTGGTCGC CCGGCAACGC CCGCTCCTCG
CACCCGTACT TCCCACTGAA CTGGGTCTTC GGGCGCGCGT ACAACAACGC CCACAGCTAC
CCGGGTGCCG AGGCTGGCAC CGAGATCGAG GTTCCGGCGA TGGACGGAGA CGGTACCATG
TCGGTCGACG TCCAGCAGCG CCTCGACGAC CTCTCGACCT CGAGCGGCGA GGAAGCCCAG
TCCATCGTTC AGGAGCTCGC GTGGGTCTCC CACCAGGACC TCCCGTACCT GCCGATGATC
AACAAGGTAG AGCAGTCGTG GATCAGTACG CGGCGGCTGT CGGCTCCCGA AACTGACGAC
CCAGCAGGCA ACGTCAAGTG GCCGACGTTC TACGCCCCGC GTGTCGGAAA GATGCAGTGG
CAGGGCGAGT AA
 
Protein sequence
MASEQSSGSF EDVVNRRNFM RLAGATGATM LAGCQGEETD TETESGGNGD TETDSGGDTD 
VYDVTVESAA ARGMDLLDWN AQFAGWPNIW GRWLAYERYA QYNMTENEWI PRLIQDWSVD
GTTVTLNIRE DHTYANGDQV TADDIKANTV MNLATGAAFA DVFDSFNAPD DKTLEIETTK
EVNPDILEFT LLSQLQQAKM AEPYDELYQR YWEDEEEGVS SDIQGREPNS PDYVSGIFGQ
QSKDDEQYMM NRNPEHPDAG NVNFERYRFP NYPGNEGNWE AMIGDDIDTI MSAFTPSNIR
AELADHWQEY NFPGYWGVGY VFNHDEEAAP HISKRSVRQA ITHAINREDV VTAAGAEIKE
AFPTPAAVSA NVQDEWLDVG GQFPAMQGGA EQAAEIMQDA GYEKNGNGNW AMDGETVTFQ
VVVPSSWSDW VTATQAVVPQ LQDAGFDAEM NRVDNINTVV GEGNFKMAAR PWSPGNARSS
HPYFPLNWVF GRAYNNAHSY PGAEAGTEIE VPAMDGDGTM SVDVQQRLDD LSTSSGEEAQ
SIVQELAWVS HQDLPYLPMI NKVEQSWIST RRLSAPETDD PAGNVKWPTF YAPRVGKMQW
QGE