Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1022 |
Symbol | |
ID | 8383295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 987358 |
End bp | 988314 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644972086 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003129938 |
Protein GI | 257052105 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0725] ABC-type molybdate transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAAC GGAACGATCA CCCGGGATCC GGGGGCCTGG AGCGGGTTTC GCGCCGCGGG TTCCTCGCGG GAGCGGCCAC GCTGGGTGTG GGGACACTTG GAGGGTGTCT CGCCAGCAGC GCGAGCACGG TGTCGGTCCT CTCGGCGGGG AGTCTGGCGT CGGCTTTCGA GGAGCGGGTC GGGTCGACCT TCGAGGAAGC GACTGACTTC GGGTTCCAGG GGACGTACTA CGGGTCCCGT GCAGTCATGC GACTGGTCGA GGACGGCCAG CGTCGCCCGG ACGTGGTCGT CAGTGCCGAC GCGGAACTGC TTCGTGAGCG ACTCCAGCCG ACACTCGCTG ACTGGGACGT GGTCTTCGCG ACGAACGCGC TCGTGATCGC GTACAACCCC GAGACCGACA TCGGGGCCCG ACTCGCCGAC GGCGAACCCT GGCACGCGGT ACTGGCCGCT GCGGACGGAC GGATCGCACG GACCGATCCG GACCTGGATC CGCTCGGCTA TCGGGCGATC CAGCTGTTCG ACCTCGCCGA ATCGTACTAC GACGAGCCCG GACTGGCCGG GGCCCTCCGG GCCAACACCG TGATCGAGCC CGAGGAACCG CAACTACTCG CGGCCGTCGA GAGCGGCGAG CGGGCCGCCG CCGTCGCCTA CCGAAACATG GCCCACGACT GGGACGTGCC AAGCGTCGAA CTCCCGCCGG AGCTGAACTT CGCCGACCCC GGGCTGGCCG ACCACTACGC CACCGCGACC TACACGACCG AGGACGGCAC CTCGCTGCCC GGGCGGCCGA TCCGATACAA CGCGACCGTC CCGGCGAACG CCGAGCACCC CGAGGCGGGC CGGCGGTTCG TCCGGTTGCT CGCCGAGCGG CCGGCCCCTC TCCGGGAGTC GGGGCTGGTC GTGCCCGACG GCGTTCCGAA GGGGCACGGA GACGTGCCGG ACGGGGTGCT ACCGTGA
|
Protein sequence | MEQRNDHPGS GGLERVSRRG FLAGAATLGV GTLGGCLASS ASTVSVLSAG SLASAFEERV GSTFEEATDF GFQGTYYGSR AVMRLVEDGQ RRPDVVVSAD AELLRERLQP TLADWDVVFA TNALVIAYNP ETDIGARLAD GEPWHAVLAA ADGRIARTDP DLDPLGYRAI QLFDLAESYY DEPGLAGALR ANTVIEPEEP QLLAAVESGE RAAAVAYRNM AHDWDVPSVE LPPELNFADP GLADHYATAT YTTEDGTSLP GRPIRYNATV PANAEHPEAG RRFVRLLAER PAPLRESGLV VPDGVPKGHG DVPDGVLP
|
| |