Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0797 |
Symbol | |
ID | 6743602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 738395 |
End bp | 739666 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642750597 |
Product | glycosyl transferase family 39 |
Protein accession | YP_002121462 |
Protein GI | 195953172 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000309537 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAAAA AAGAATTATT TACTTATACG CTGCTTTCTT TTTATTTTTT CTTTCTTGGC AACAACATAC TTTCAATCAC CTCTCCAGAT GAAGGAAGAA ACCTATACGC CGCTTCTTAC ATGCTTCAGA CTGGACATTT TCTAACGCCT ATGTTTAACT GTCATTTTAG GTTTGAAAAG CCCCCTATGC TATACTGGGT TAGTGATTTG TTGTTTTTAA TATTTGGAGT TCATGATTGG CTTGCAAGAG CTGTTTCTGG TATTAGCGCT TTTATAGTAG GTATTTTTAC ATATAAAATA GCAAAGGATA TATATAAGCT TGAAAATCCT TTTTACGCAT CTTTGGTGCT TTTTAGCATT ACACATCTTT GGATAGAATC AAGAGCCTAT GTCCCAGAGA TGCTTCTTAC GGCTTTTGAA ATAATAGCCA TATACTATTT TCTAAAAGAT AAAATAAGCT TAGGTTATCT TTTTATGGGT CTTGCTACAC TCACAAAAGG ACCAGTTGGT ACCGCTGTAG TGCTTATGGT GATTATTACT CTAAAAAGAG ATATTAAGTA TTTTAAAAAG CTCTTAGATC TAAAAAGCAT ACTACTTTAT ATCTTGGTGG GCTGGAGTTG GTATTTTTAT ATGGTTTATA AATATGGATT TTACTACATA TATAAATTTT TTATAAGAAA CAATATAGAT GTCTATACAG GGCAAAAAAA TATACACCTT TACCCGTTTT ATTATTATGT AGTTGTGCTT CTTATAGCCC TAATCCTATG GGTCCCTTGG TTTTATAGCT TTTTTAAAAA ATATATAAAA GATAAATCAA AAACTGCCAC AGATATGCTA TTATGGTCTT TGGTGGTACT CTTATTTTTT ACTCTGTCCA AAAACAAACT ACATCATTAC ATAATAAGTA TATATGTCCC TATCTCGATA TTTATATCTA TTTATGCACC AAAAAAGCTA ATAAAGTTTA ACGTGTTTTT ATCAAGCATC TTGCTTTTCA TACTTTTTAT ACTTGCTTAT CGCTATGAAC AAGAACGCTT TGTCCCAAAA GCTGTATCTA TACTAAAAGC CCAGAAACTA CCAGTTTATT ACGACAACGC TCATCTATCA GCTATGGTAT ACTATCTAAA CACCTGCATA GACTCTTTAC CAAAATACAC ACCAAAACAC TACTTTGTAA TATCAAAAAA CCCCCCAGAT AAAATGCCAT CTTTTAAGCT TTTAACAAAA GGCATAGAGT TTGACGGCAA GTATTATCTT TACGAGAGAT AA
|
Protein sequence | MSKKELFTYT LLSFYFFFLG NNILSITSPD EGRNLYAASY MLQTGHFLTP MFNCHFRFEK PPMLYWVSDL LFLIFGVHDW LARAVSGISA FIVGIFTYKI AKDIYKLENP FYASLVLFSI THLWIESRAY VPEMLLTAFE IIAIYYFLKD KISLGYLFMG LATLTKGPVG TAVVLMVIIT LKRDIKYFKK LLDLKSILLY ILVGWSWYFY MVYKYGFYYI YKFFIRNNID VYTGQKNIHL YPFYYYVVVL LIALILWVPW FYSFFKKYIK DKSKTATDML LWSLVVLLFF TLSKNKLHHY IISIYVPISI FISIYAPKKL IKFNVFLSSI LLFILFILAY RYEQERFVPK AVSILKAQKL PVYYDNAHLS AMVYYLNTCI DSLPKYTPKH YFVISKNPPD KMPSFKLLTK GIEFDGKYYL YER
|
| |