Gene Huta_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1022 
Symbol 
ID8383295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp987358 
End bp988314 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content71% 
IMG OID644972086 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003129938 
Protein GI257052105 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0725] ABC-type molybdate transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAC GGAACGATCA CCCGGGATCC GGGGGCCTGG AGCGGGTTTC GCGCCGCGGG 
TTCCTCGCGG GAGCGGCCAC GCTGGGTGTG GGGACACTTG GAGGGTGTCT CGCCAGCAGC
GCGAGCACGG TGTCGGTCCT CTCGGCGGGG AGTCTGGCGT CGGCTTTCGA GGAGCGGGTC
GGGTCGACCT TCGAGGAAGC GACTGACTTC GGGTTCCAGG GGACGTACTA CGGGTCCCGT
GCAGTCATGC GACTGGTCGA GGACGGCCAG CGTCGCCCGG ACGTGGTCGT CAGTGCCGAC
GCGGAACTGC TTCGTGAGCG ACTCCAGCCG ACACTCGCTG ACTGGGACGT GGTCTTCGCG
ACGAACGCGC TCGTGATCGC GTACAACCCC GAGACCGACA TCGGGGCCCG ACTCGCCGAC
GGCGAACCCT GGCACGCGGT ACTGGCCGCT GCGGACGGAC GGATCGCACG GACCGATCCG
GACCTGGATC CGCTCGGCTA TCGGGCGATC CAGCTGTTCG ACCTCGCCGA ATCGTACTAC
GACGAGCCCG GACTGGCCGG GGCCCTCCGG GCCAACACCG TGATCGAGCC CGAGGAACCG
CAACTACTCG CGGCCGTCGA GAGCGGCGAG CGGGCCGCCG CCGTCGCCTA CCGAAACATG
GCCCACGACT GGGACGTGCC AAGCGTCGAA CTCCCGCCGG AGCTGAACTT CGCCGACCCC
GGGCTGGCCG ACCACTACGC CACCGCGACC TACACGACCG AGGACGGCAC CTCGCTGCCC
GGGCGGCCGA TCCGATACAA CGCGACCGTC CCGGCGAACG CCGAGCACCC CGAGGCGGGC
CGGCGGTTCG TCCGGTTGCT CGCCGAGCGG CCGGCCCCTC TCCGGGAGTC GGGGCTGGTC
GTGCCCGACG GCGTTCCGAA GGGGCACGGA GACGTGCCGG ACGGGGTGCT ACCGTGA
 
Protein sequence
MEQRNDHPGS GGLERVSRRG FLAGAATLGV GTLGGCLASS ASTVSVLSAG SLASAFEERV 
GSTFEEATDF GFQGTYYGSR AVMRLVEDGQ RRPDVVVSAD AELLRERLQP TLADWDVVFA
TNALVIAYNP ETDIGARLAD GEPWHAVLAA ADGRIARTDP DLDPLGYRAI QLFDLAESYY
DEPGLAGALR ANTVIEPEEP QLLAAVESGE RAAAVAYRNM AHDWDVPSVE LPPELNFADP
GLADHYATAT YTTEDGTSLP GRPIRYNATV PANAEHPEAG RRFVRLLAER PAPLRESGLV
VPDGVPKGHG DVPDGVLP