Gene Huta_0862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0862 
Symbol 
ID8383135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp828509 
End bp829702 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content62% 
IMG OID644971926 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003129778 
Protein GI257051945 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTTACC TCCACCACAT GGCACAGACC GAACCGGCAC AAAAGCGGTT TGAACTCCTG 
CACCAATTAG TGGGGGCTGA GTGGGGCGGG GCGGCGCTGA ACGCGCTCGT TGATGGCTAC
GAGCGCCGAA CGTCGTATGC CGTCGAGGAG ACGACGACGT CGGCCGACGA TCTCTCTATC
CGGGTGAAGA TCCGGATCCT TCAGGAACGG GCCCCGGACG CCTGGATCGA ATGGCCGGGC
CAGCACATCA CGCCGTACAT CGATACCGGT GCAGTCCGTG ACATCACCGA TGTCTGGGAA
GAGAACGGGC TCGTGGATGC CTTCACCGAG GGGGCCAAAG AACAGGTCCG CTTCGACGGC
AGTTACTACG CAATCCCGCT GAACATCCAC CGGATCAACA ACCTGTTTTA CAACGTCGAG
ATGGTCGAGC GAGCCGGTGT CAATATCGAC GTCAACTCAC CGCAAGCGTT CGTCGACGTC
CTCGAACAAC TCGATGATGC CCTCGACGTC GCGCCGTTCT TGATGGCACT CCGGAACCCC
TGGGGAGCGA TCCACGTCTG GGAGACGATC GTCCTCGGGG AGACCGATCC CCAGACGTAT
CGGGACATCA TCAACGGGGA TGCCGACCGC CACCGCGATG CCATCGCGTC GACGCTTTCG
ATTCTGGCAC GCTATCTGGA ATTCGCCAAC GACGACGCGC AGTTCTCCTC GCTGCCCGAC
GCCAACGCCC ACTTTGTCGA CGACGAGGGG GCGCTGTTCC TGATGGGCGA CTGGGCTGCC
AGCGCGTACG ATCAGGACGA CTACGGCGAG ACTTGGGATA CGATCCCGTT CCCGGGGACT
GCGGGCGAGT ATCCCATCAA CATGGACGCG CTCATCCCGT CGAGTACTGC CGGCGACACG
ACGGCGATCG ACGAGTTCCT CGCCTACGCC GGCTCCCGCG AGGCACAGAC CGCGTTCAAT
CGTCACAAAG GTTCGACCCC ACCCCGGACC GACACCGATC GCTCGGAGTT CACGGACTTC
CTTCAGGATC AGCAGGCGGA CTTCGACGCC GCCACCTCAC AGGTCCCGTC GATGGCCCAC
GGTCTGGCGG TCCATCCCGA GCAACTCATC GAAGTCAAGT CCACGATGGC GGAGTTCGTC
TCCGATCCCG ATCCGGCGAC GACCGCCGAC AGACTCGCCG ATATCCTCTC TTGA
 
Protein sequence
MIYLHHMAQT EPAQKRFELL HQLVGAEWGG AALNALVDGY ERRTSYAVEE TTTSADDLSI 
RVKIRILQER APDAWIEWPG QHITPYIDTG AVRDITDVWE ENGLVDAFTE GAKEQVRFDG
SYYAIPLNIH RINNLFYNVE MVERAGVNID VNSPQAFVDV LEQLDDALDV APFLMALRNP
WGAIHVWETI VLGETDPQTY RDIINGDADR HRDAIASTLS ILARYLEFAN DDAQFSSLPD
ANAHFVDDEG ALFLMGDWAA SAYDQDDYGE TWDTIPFPGT AGEYPINMDA LIPSSTAGDT
TAIDEFLAYA GSREAQTAFN RHKGSTPPRT DTDRSEFTDF LQDQQADFDA ATSQVPSMAH
GLAVHPEQLI EVKSTMAEFV SDPDPATTAD RLADILS