Gene Huta_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2233 
Symbol 
ID8384527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2279629 
End bp2281836 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content62% 
IMG OID644973302 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003131133 
Protein GI257053300 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.039331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGACG AGTTTTCACT CAGTCCAACC CGTCGGCAGT TGCTCGCATC GCTTGGGACT 
GGCGGCGTGC TTGCTGGTGG GGTTGCGGGT TTCAGGACGA TGGCGGATGT CGTCACCCCA
ATAGCAGCCC AATCGCCCCA GCGCGGGACA CTCGTGGGGA CGATGACCGG CCCTACCCAG
GGACTGCACT TCAACCTCTT TGGAACGTCG AGCGCGGATA TCCCCGCTGC GATGGTGTTT
GACCCGACGG TGAAATACCA TCGTGGTCGC GGCGAATTCA TCCCGGCCGC CGTGACGGAG
TGGACAGTCG AGGGCGAAAC CATGACCCTC TCGCTGCGAC CGGATCTTGT CTGGGACGAC
GGCGATCCGG TGACGGCCCG CGATCTGCGA ACCCAGCTAC TGCTCGGCAA GACCGTCGAC
GACGATCTTT GGGAATACGC CACGGCAGTC GAGACGCTCG GGGAAAAGCG ACTCGCGGTT
CACTTCGACG CGTCGTACAA TCCGGACTTG CTCGAACGCA TCGTTCTCGG AGACCGCCGA
CTGTTCGTCA AACATGAGTA TTACGAGCCG TTTCTGGAGA CGTTACGTGC CGACGGCGAA
ATCGCCGCCA GAGAGGAACT CATGGCATTC ACGCCTACGT CGATGGGCGA ATTCGCCGCA
AGCGGCCCCT TCACTGTCGA CTCGATCGAC ACCGAGGGAA TCAAACTCGA ACTGAATCCG
ACCCATCCCG ACAGCGATTC CATCGGGTTC CAGTCCTACG AGTTTCGCGC CTTGCTCGGA
ACAGACACGG GGTTTCAGGC ACTCGAAACC GGCCAGGTCG ACGTGATCGC CTCGATGTCG
ACGCTACCGA ACCAGCCCAT CGAATTGCCA TCATCCGTTG TACAGGTGCG GATCCCCGAT
TATTGGGGGT TGGGCCTCTG GCCGAACCAC GATGTCGAGC CGCTGGACGA CCGGGCCGTC
CGACAGGCAA TCGCCTTTGT CATCGACCGC TCGGCCGTGA CCCAGGCGAG TGGCCGGGAA
ACGAACCAGC CGGTAGAGAC GCCAAGTGGC CTGCCGGCGT CCGCTGTTGC GGACTGGATA
GCGCAGGATT CACTGGAGAC GTATGTCGAT CCGGATAACG AGGCTGCGAC AGACGTCCTG
TCGAGGGCAG GATATACTCG CGAGGATGGA ACCTGGACGG CGGAGGACGG GACCCCCCTC
GGCTTTTCGA TATCGGTCCC CGAGTTCATC ACCGACTTCG TCAACGCGAC GGATTCCGTC
GCCGAACAAC TCCGTTCGTT CGGCATCGAA GCCGACGTTC GGGAAGTGAC GTTTCGGGAC
ATCTTCGCCG GTGACTTCGA CGTCGTGTCG GGGTGGTGGC TACCGGGGAG CATCGACAGT
ACACACCCGT ACGTCGCGTA CAGGTTCGGG TTCGGCATGG GGATGGTCCA GCCCGATCTA
CTCGAGTATC CAGGGTTGGA CAGCACCGTG ACTGTGCCCG CGGACGGGAA CAACGGCTCG
ATAGACGTCA CTCCCCAAGA CGAACTGGAA GCGCTGGCCA CGACTGTCGA CCGCGGCGAG
GCCGAATCGA TCGTGCAGCG ACTCGCAAAG GTGTACAACG CGGACCTTCC GATGCTCCCA
CTCTACCGGA GTCAATATAC GTCGTTCATC GATACGAGCG CCGTCGACGC ACCGGCGGAA
GGTTCACCGA AGTTCCAGAT CACGTGGCCC CCGCACTGGC TTGTTCGAAC CGGCGATCTG
CACAGTCCAG GCTGGGCCGA GAACGAAGCC TCGACACCCA CCGAGACAGT GACAAATACG
CCGACTGCGA CGGACACGGA ACGGGAGACG CCATCGGCAA CCGAAACCGA GACGACACAA
CCGACGGAGA CGGCGACAGA ATCCAGCCCG TCGACACCGA GCGAGACGGC GACGAAAACG
GTGACGCCGA CAGCAACCCC GACAGACCCA AAAACAGCGA CAGTCAGCGA GACAGTCACG
AACACTCCGG CAACGACCGA GACAGACCGA TCGGCCACTG AATCACGGAC GACGACGGTC
ACGGAGATAC CGACCTCGAC GACCGATGCT GAATCAGAAG GTGGCCAGAA CACCCAGGAC
AGCACCGACG GAGCCACCAC GACCAACGGT CCGGGGTTCG GTCTCCTCTC GTCGGTGGCT
GGATTAGGTA GCTACGCGCT CTATCGGGCC AGGAAACGGG ACGGGTAG
 
Protein sequence
MSDEFSLSPT RRQLLASLGT GGVLAGGVAG FRTMADVVTP IAAQSPQRGT LVGTMTGPTQ 
GLHFNLFGTS SADIPAAMVF DPTVKYHRGR GEFIPAAVTE WTVEGETMTL SLRPDLVWDD
GDPVTARDLR TQLLLGKTVD DDLWEYATAV ETLGEKRLAV HFDASYNPDL LERIVLGDRR
LFVKHEYYEP FLETLRADGE IAAREELMAF TPTSMGEFAA SGPFTVDSID TEGIKLELNP
THPDSDSIGF QSYEFRALLG TDTGFQALET GQVDVIASMS TLPNQPIELP SSVVQVRIPD
YWGLGLWPNH DVEPLDDRAV RQAIAFVIDR SAVTQASGRE TNQPVETPSG LPASAVADWI
AQDSLETYVD PDNEAATDVL SRAGYTREDG TWTAEDGTPL GFSISVPEFI TDFVNATDSV
AEQLRSFGIE ADVREVTFRD IFAGDFDVVS GWWLPGSIDS THPYVAYRFG FGMGMVQPDL
LEYPGLDSTV TVPADGNNGS IDVTPQDELE ALATTVDRGE AESIVQRLAK VYNADLPMLP
LYRSQYTSFI DTSAVDAPAE GSPKFQITWP PHWLVRTGDL HSPGWAENEA STPTETVTNT
PTATDTERET PSATETETTQ PTETATESSP STPSETATKT VTPTATPTDP KTATVSETVT
NTPATTETDR SATESRTTTV TEIPTSTTDA ESEGGQNTQD STDGATTTNG PGFGLLSSVA
GLGSYALYRA RKRDG