Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1121 |
Symbol | |
ID | 8383396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1095930 |
End bp | 1097090 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644972182 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003130032 |
Protein GI | 257052199 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.606067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACG TGTCTTTAGT GGGATACGCG GATATTGGGT CACAGCCTGG ACAAGATTTT ACCGAATTAG CTAACGCCTT ATTCGATGAA AGAATTCTAT CGCAGGCATA CTGTAGAGGT ATTGAGGGTA CTGAGTTGCC CGAGGAGCGT GTTACCACAC CAATTCCATT CGGTCGACGT TTCCCGAGAT TAATCAATGG TATCGGGAGA CATGTTTACG ATATACAGAA CAGATATTAT TCTGAGTATA TGTTTGACTA TTATTCAGCA AAAAGGATCT CAAAGACGGA CGATATTCAA CTCCATTTTT CTCCAGGATA TCCAAAAACA TTAGAAAAAG GAAAAGAACA CTCTGATCAA GTAATAGTAC GAACTGCTAC AGAATATGAA AGAGAAAAAA AGAAGAGATT GGCCCCAGAA TATAAAAAAT ATACAATTAA TAGCTACCCA ATACCCAAAA AGAGAATAAA AAGAAGAGAG GAAACGATAA ACAAGGCAGA CAAAGTCATA GCTATCTCAA AATTTGTTAA AGAATCGCTT ATAAACGGAG GTGTACCGGA AAATAAGATA GATCTTGCTC CGTTTGGAGT GAATTCAGAT GACTACCCCA CCAAAACTGA AGACATAGAT AAATTCACAG TTTTATTTAT TGGAAGTATA AATATAAATA AGGGGGTTCC TTACCTTCTT GATGCGTGGC AAAAAAATGG ATGGGAATAT GATAACGAAG CCCAACTCAT TCTTGGTGGT AGGATCTCTC CAGAATTGGA AGAGATCATT GAGGAGAAAA ATATAAACAA CCTAAAGACA CCCGGGTATG TTGAACCTCG AAATTACTAC CAAAATGCGT CCGTGTTTTG CTTTCCTTCC CTTTCGGAAG GATTCGGAAA AGTAATACTT GAAGCGATGG CCTCAGGACT CCCTGTTATT TCTACAGAAT ATACCGGAGC ACGAGATGTA ATGACTGACG GTGAGGAGGG ATATATTGTG GAAACTAGAG ATTCAGACGT GATTGCCAAT AAACTTCAAT ACTTGCGAGA TAATCCAGAA GAAAGAAAAC AGATGGGAGA TAAAGCATTA CAAACTGCTA AAGAAAACCC ATGGGATAAG CACACTAATA AAATTATTGA TATAATTTCC GACGATAACA ATTCTAAGTA G
|
Protein sequence | MTDVSLVGYA DIGSQPGQDF TELANALFDE RILSQAYCRG IEGTELPEER VTTPIPFGRR FPRLINGIGR HVYDIQNRYY SEYMFDYYSA KRISKTDDIQ LHFSPGYPKT LEKGKEHSDQ VIVRTATEYE REKKKRLAPE YKKYTINSYP IPKKRIKRRE ETINKADKVI AISKFVKESL INGGVPENKI DLAPFGVNSD DYPTKTEDID KFTVLFIGSI NINKGVPYLL DAWQKNGWEY DNEAQLILGG RISPELEEII EEKNINNLKT PGYVEPRNYY QNASVFCFPS LSEGFGKVIL EAMASGLPVI STEYTGARDV MTDGEEGYIV ETRDSDVIAN KLQYLRDNPE ERKQMGDKAL QTAKENPWDK HTNKIIDIIS DDNNSK
|
| |