Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2361 |
Symbol | |
ID | 8384660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2408594 |
End bp | 2409664 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644973434 |
Product | hypothetical protein |
Protein accession | YP_003131260 |
Protein GI | 257053427 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0163765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGACG CGAGCGGGGA CGTCGGCGAG GACCGGCCGA GCGACACGAT GCGCGAGCGG GTGCCGGCGA GCAGCCGCAA GTTCTGGCTG TTGCTCGTCG TGGATCGCTC GGCCGTCGCG GCCGGGATCG TGGGCGCGAT CTTTCTCGCG CTGGTCGCCA TCGGTCAGTT TCATCCCGCC GGCACGCCGG CCCTGTTCAC CCAGGGCGAT CCACTGGAGA CGCTGTTTCA GGGGCTGCTC ACCGCGATCA TCACCGGCGT CACGCTCGTG CTCACGCTGA GCCAACTCGT CCTCTCACAG GAACAGGGGC CGATCGGCGA CCAGCGCGAG CGCATGGAGG GCGCGATGGC GTTCCGTCGC GACGTCGAGG ACGTGATCGA GGAACCGGTG AGTCCGGCCC AGCCCTCGGC ATTCCTGCGG TCGCTGGTCA CACTCACGCG CCGCCGGGCC GAGGCCGTCG GGGACGCCGT CGCGGCCATC GACGATCCCG CGCTCACCGA ACAGGTCGAC CCGTTCGTCG ACAGCGTCGT GGAGAACGCG GATACGGTGG CCCACAACCT CAAGGGGTCA CAGTTCGGCG AGTTCGACGT CGTCTTCTCG GCGCTCAACT ACAACTACTC CTGGAAGCTC TACGCTGGGC GGCGAATTCG GGCCGAGCAC GACGACGTTC TCACCGAGGC TGCCGACGAC GCCCTGGCGG AGTTGATCCA GGCGCTCGAA CTGTTCGGCC CGGCGCGCGA GCACTTCAAG ACGCTGTACT TCCAGTGGGA GCTCATCGAT CTCTCGCGGG CGATGCTGTA CGCCTCGATC CCGGCGCTTC TCACCGCTGT CTCCGGTGTG CTGTACCTCG ATCCGACGGT GTTGTCGGGG ACGCTGCTGG GCGTTCGAAC CGGTGTGCTC GTCGTGAGCG GGGCCGTCGC GGTCTCGCTG CTCCCGTTTG CGCTCCTGCT CTCCTACGTC CTCCGCATCG TGACGGTTAC CAAACGAACC CTCTCGATCG GCCCGTTCAT CCTGCGAGAA ACCGACCGTA CCCATGACCT CACGGCTGAC ACCGAGACGG ATCGTGAGTA G
|
Protein sequence | MGDASGDVGE DRPSDTMRER VPASSRKFWL LLVVDRSAVA AGIVGAIFLA LVAIGQFHPA GTPALFTQGD PLETLFQGLL TAIITGVTLV LTLSQLVLSQ EQGPIGDQRE RMEGAMAFRR DVEDVIEEPV SPAQPSAFLR SLVTLTRRRA EAVGDAVAAI DDPALTEQVD PFVDSVVENA DTVAHNLKGS QFGEFDVVFS ALNYNYSWKL YAGRRIRAEH DDVLTEAADD ALAELIQALE LFGPAREHFK TLYFQWELID LSRAMLYASI PALLTAVSGV LYLDPTVLSG TLLGVRTGVL VVSGAVAVSL LPFALLLSYV LRIVTVTKRT LSIGPFILRE TDRTHDLTAD TETDRE
|
| |