Gene Huta_1244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1244 
Symbol 
ID8383519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1216792 
End bp1218558 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content64% 
IMG OID644972303 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003130153 
Protein GI257052320 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.863592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTCCT CGGATACGCC CAGTGGTCTC TCCCGCCGCA GTCTACTTGG CGGTGCCAGC 
GCTGGCTTCG CCGCTTCGAG TGCCGGCTGC CTCCAGTACG CCCGCAGTCT CGTCGATCGA
GAGTCCCCCA AGCAGGTGTC AGTGCGGATC AAGACTGTCC CGGCAGACGA AGACGAAGCA
GCGGTCCAGA TTGCACGGGC CTTGCAGAAA AACATGGAGA CGGTCGGCAT CGACGCGTCG
ATCGTCCTGA TGACCGGAGC AGAGCTGCTT CGGGACGTGC TCCTCAACGA ATCGTTCGAC
ATCTACGTGA GTCAGTACCC CTCACATCAC GATCCTGATT TCCTGCGACC GGCGCTACAC
TCGACGTTCG TCACCGAGCA GGGATGGCAA AATCCGTTCG GCATCTCCGA TCTCGATCTC
GACGAGCAGT TGACCGAACA GCGAACGGCC GTCGGAGCGG AGCGACAGCG TGCCGTCGCC
GGCGTCGTCG GGTCCGTCAC GGAGATCCAG CCGTTCGCGA TGGTGTGTTT CCCCACGCGA
ATCACCGTCG TCCGAAACGA CCGCTTCACC AACTGGAACG GCCTTGAGGA CCCGATCAAC
TACCTGGCGT TGCGACGGAC CGAAGACGCC CCCGACGGCG AACCCACGCT CCGGATCGCG
TTAACGGACG ACCGAATCAC GAAGAACTAC AATCCCCTTG CCGTGGAGTA CAGGCGGCCG
AACCCCCTGA CTGACCTCCT TTACGATCCA CTCGCTCGCC GTTCCGATGG AGAGATCCAA
CCCTGGCTGG CTGGTGACGT CACGTGGCAG TGGACCGAGG ACACAGCCGT GACGGCGACC
GTCAAACTCC GGGACGGGCT GACCTGGCAC GACGGCCAAC CCCTGACCGC AGCCGACGTC
GCCTTTACGT ACCGATTCCT CGACGACACG AGCCTCGGGA CAGGAAACAT GAACGTCCCC
TCGCCCCGGT TTCGGGGGCG GACCTCACTG GTCGAGTCGG TCGAACGCCT CGACGCCCAG
ACAGTCCGGT TGGAGTTCGG TGACACGGCG GAAGCCGTCG CGGCCCGTGC GCTGACTGTC
CCGATCCTGC CGTCGCACGT CTGGGAGGAA AAATCAGCGG CCACGAACAT CGCCGGCATC
AACATTTCCG AAGGAGTCAC GCAAGCACTG ACCTGGGCCA ACCCCGATCC GGTGGGAAGC
GGCCCGTTGC GATTCGAGTC TCGGACGCCG GGGGAGCGGG TCGTATTCTC GCGGTTCGAC
GACCACGTTC TGGTCGGTGG TGGCCCGGAC CGCGTTTCCG TCCCCTTCGA ACGCTTCGTC
GTCCAGATCG CGCCGTCGGA CACGGCAGCA GTCTCACTGG TTACCGACAG CACGGTCGAC
GCAACCGGCG ATCCGATCCA CCCGAAGGTT CTCGACAGGG TGACCGAAAA CGACCCCATT
GAGGTCCTCT TCGGGAACTC ACGGTCGTTC TATCACGTCG GATTCAACAC GCGGCGTGAG
CCGTTCGGCA ACGTCCGGTT TCGGCAAGCA GTCGCCCGAC TCCTCGACGC CGAACACATC
GCCTCGTCGG TGTTCGACGG CCATGCCACG CCAGCAGCGA CGCCCCTGGC CGGCACCGAC
TGGGAGCCAC CGGAGTTCGA ATGGGACGGC ACAGATCCGG TCGTTCCCTT TGCCGGGACG
GACGGTGAAC TCGACATTTC GGACGCAAGA GGCGCGTTCA GGGAAGCCGG GTATAGGTAC
GACGGCGACG GGAGACTCCT GAAATGA
 
Protein sequence
MDSSDTPSGL SRRSLLGGAS AGFAASSAGC LQYARSLVDR ESPKQVSVRI KTVPADEDEA 
AVQIARALQK NMETVGIDAS IVLMTGAELL RDVLLNESFD IYVSQYPSHH DPDFLRPALH
STFVTEQGWQ NPFGISDLDL DEQLTEQRTA VGAERQRAVA GVVGSVTEIQ PFAMVCFPTR
ITVVRNDRFT NWNGLEDPIN YLALRRTEDA PDGEPTLRIA LTDDRITKNY NPLAVEYRRP
NPLTDLLYDP LARRSDGEIQ PWLAGDVTWQ WTEDTAVTAT VKLRDGLTWH DGQPLTAADV
AFTYRFLDDT SLGTGNMNVP SPRFRGRTSL VESVERLDAQ TVRLEFGDTA EAVAARALTV
PILPSHVWEE KSAATNIAGI NISEGVTQAL TWANPDPVGS GPLRFESRTP GERVVFSRFD
DHVLVGGGPD RVSVPFERFV VQIAPSDTAA VSLVTDSTVD ATGDPIHPKV LDRVTENDPI
EVLFGNSRSF YHVGFNTRRE PFGNVRFRQA VARLLDAEHI ASSVFDGHAT PAATPLAGTD
WEPPEFEWDG TDPVVPFAGT DGELDISDAR GAFREAGYRY DGDGRLLK