Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0861 |
Symbol | |
ID | 6743672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | - |
Start bp | 811074 |
End bp | 812474 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642750667 |
Product | glycosyl transferase family 39 |
Protein accession | YP_002121526 |
Protein GI | 195953236 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00631433 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTATT TGACATCGCG TAGGCATTAT CTTTTGCTTT TTTTAGTAAT AGCACTTTAT TTTGTAAACC TGGGCACAAA TCAAGTGTTT GCTCCCAACG AGTCTTTTTA TGCAGATGCT GCTTTAAACA TGTTGAAAAC TGGCAATTAT CTAGTTCCCA TCTACAACGA CCATATAAGG CTTGCCAAAC CACCCCTTTT ATACTGGCTT ATAGCGTTAT CTTTTAAAAT GTTTGGGGTT TCAAGCTTTT CTCTGAGATT ACCCTCCGCC CTTGCCGGGG CTTTTTGTGT TTTATTTACT TATTACTTTG GTGAAAAAAT CACAAAAAAC CTTGGGTTAT ACGGCGCTAT TGCCACAGGT ACGGCTCTTG AGTTTGTATC AAACACAAGG TACGCTTCTC CCGAAGTACT GTTTTGCTTT TTCATAAGTA GCTCTATATA TTTTCTTTAT TTTTATCTTA AAAGCCAAAA GCTTATATAT CTTATCGTGA GTATCGTTAT GTCTTCTTTG GCTATGCTTA CCAAAGGTCC TGTGGGGGTT CTGATAGTGG GTTTTGTGGG CTTTTGGTAT ATTGTGTTTG AAAATCCTAA GCTTTTAAAA GATTATAAGC TATATATGGC GTTTGTTATA GCTCTTTTTA TAGGTGGGCT TTGGTATATA GAAGTCTTAA ATTCACCTTA CAAAGAGCTT CTTATACACA AATTTTATGT AGAAAACATA AAGAGAATAA ACGGTTTAGA ATCTGACCCT TGGTATTTTT ACTTAAAAGA TACGATAGTG TCTTTTGCCC CTTTTTCTAT TATACTTTAT TTGTTTTTTC TTAATCTTAA AAGCCTAAAA ACTATAAAAC TTGCAAGTGT ATGGTTTTTT TCACTTTTTA TCGTGTTTTC CTTGATAAAG ATGAAGATTC CAACATATCT TATACCAGCT TATCCAGCGA TGGGTTTTAT AGTGGGTATA AACGCCATGA CTAAAAAATA CTTTAGATAC ATGTTTTGGA TACTATTTTG TATTGGGCTT TTGATATACT ATGTTTTAAT TATTTTTGTT TTATCATCTA TAGAACCTTA TAGACCATAT AAAGAAATGG GCGATGTTAT AAGGCTTTAT AAAGATAATA AACCGGTTTA TTATGAAGGG TATTTCATCC ATCAGCTACC GTTTTATGCT GATGCTACGA TTAAAAAATT TACTAAAAAT TCAAAACCAG GTATTTTAAT AACGGAAACC CCATGTAAAA AACCTTTATG GGAAGGTTAT GTGTATACAG ATTCTGAGTC AAGGTTTTTG GTGTTTTTAA AAGATATTTA CAAATTTAGA AAAGGTAATA TCAAATATTC TAAATTTAAA AAATTTTATA TATGTGATTA TAAAGGCCCC ACCAAAATGG GGAGGCCTTA A
|
Protein sequence | MDYLTSRRHY LLLFLVIALY FVNLGTNQVF APNESFYADA ALNMLKTGNY LVPIYNDHIR LAKPPLLYWL IALSFKMFGV SSFSLRLPSA LAGAFCVLFT YYFGEKITKN LGLYGAIATG TALEFVSNTR YASPEVLFCF FISSSIYFLY FYLKSQKLIY LIVSIVMSSL AMLTKGPVGV LIVGFVGFWY IVFENPKLLK DYKLYMAFVI ALFIGGLWYI EVLNSPYKEL LIHKFYVENI KRINGLESDP WYFYLKDTIV SFAPFSIILY LFFLNLKSLK TIKLASVWFF SLFIVFSLIK MKIPTYLIPA YPAMGFIVGI NAMTKKYFRY MFWILFCIGL LIYYVLIIFV LSSIEPYRPY KEMGDVIRLY KDNKPVYYEG YFIHQLPFYA DATIKKFTKN SKPGILITET PCKKPLWEGY VYTDSESRFL VFLKDIYKFR KGNIKYSKFK KFYICDYKGP TKMGRP
|
| |