Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_1576 |
Symbol | |
ID | 6744407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 1491306 |
End bp | 1493546 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642751395 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002122235 |
Protein GI | 195953945 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGAAGT TGTTGAAATA TAATATTTAT CTACAAAGAT ATATTATTAA AAAAAGTGGC TTGTTTGACG AAAGATACTA TCTTAAATCC TATCCAGATA TAAGGTTAGG AGATATAGAT CCAATAATAC ATTATATAAA GCATGGTGCA AGAGAAGGAA GAAATCCAAA CCCATTTTTT GACACAACTT TTTACTTAAA CAGGTATGAA GATGTTGCTA AAAGCAGAAT AAACCCTCTT TTACACTATA TCTTGTATGG CGCAAAAGAA GGAAGATGGG CAGGGCCTAA TTTTAACTCT GGATTTTATT TGCTTAGCAA TCCAGATGTC AAAAAAGCAG GACTAAATCC ACTTTTACAC TACATTTTAC ATGGGCAATA TGAAGGAAGA AAAGCGAAGA TAGAGACAGA TAGAGAGCTA AAGTTGTATA ATTATAGGAA GTTGTCTAGC TTATTTTTGT TTATTAATAA ATCAAATATC AAAAAAAGTA TGGATTATAT AAGAAAATAC GGTTTTAGGG TTTTTATAAA TAAAGCAAGT ACTAAACTAA ACTCGGTTTA TCAATATGAG GTTTGGATTA AGATAAATGA GCCGACAGAG GAAGAGTTAA CTGCTCAGAA AAATATAAAA TTTAAGTATG AGCCGAAAAT AAGTATAGTG GTACCAGTAT GGAATACGCC AAAAAAATTT TTAATTGATA TGATAGAATC TGTCTTAAAT CAAACGTATT CAAATTGGGA ACTTTGCATT GTTGATGGCA ACAGCAAAGA AAAACATGTT AAAGAGACTT TAGAACACTA TACTCTTAAA GATAAAAGAA TAAAAGTAAA ATATCTAAAA GAGAACAAAG GGATAGCTGG AAATTCAAAT GAAGCAATTG CTTTAGCTAC TGGCGAGTAT ATAGCTTTTT TAGATCACGA TGATGTCTTG GCTCCTTTTG CTTTGTACGA AGTGGTAAGA GCCATAAACG AAAATGAAGA TGTTGATTTT ATATATTCAG ATGAGGACAA GATAACAGAA GATGGACTTA AGAGATTTGA TCCATTTTTC AAACCGGACT TTAGCCCAGA TACACTTAGA AGCTACAACT ACATAACACA TCTTTCGGTA GTAAAGAAAG AGCTTTTAAA TGAAGTGGGG TGGTTTAGAG AAGGATATGA TGGCTCTCAG GATTATGACT TGATACTTAG ATGCACAGAA AAAGCAAAAA AAATAGTTCA TATCCCAAAG ATTTTGTATA ATTGGAGAAT CAATGATAAT TCCGTAGCTC AGGACCCTAA AAACAAAATG TATGCTTACG ATGCTGCTAA AAAAGCTTTA CAAGATCACT TAGACAGAGT GGGATTAAAA GGAAAAGTTA GAGACGGTGT ATTTCTTGGT TCATATAAAA TAGACTATGA TATAAGCTAT CATCATAAAG TGTCTATCAT AATACCAAAC AAAGATCATA AAGAGGATTT GGAGAAATGT ATTACATCTA TTATAAACAA ATCCACTTAT AAAAATTACG AAATAATAAT TGTTGAAAAC AACAGTAAAG AAAAAAAGAC TTTTGAATAC TACAAATATT TACAGAATAA ATATAATAAT ATTGTATTAC TAGAGTGGAA AGATAAATTT AACTATTCAG CTGTGAATAA CTTTGCATCT AAATATGCAA ATGGTGATAT ATTACTATTT TTAAATAATG ATACAGAAGT GATAAATGAA AATTGGATAG AAGAAATGCT TATGTATGCT CAAAGAAAGG ATGTAGGTGC TGTAGGTGCA AAATTGTATT ATCCTGATGA CACTATTCAG CACGGAGGTG TTATATTGGG TATAGGTGGA AAGGTAGGGC ATTCTCATAG GTTTTTCCCA AGAGTTTCTT ATGGAAATGT TGGAAGGTTG GTTGTTGTAC AAAACTTATC TGCAGTAACT GGAGCATGCT TAATGATGCG GAAAGATATA TTTAACGAAG TTGAGGGCTT TGATGAAAGA TATCCTTTGG CATTAAGTGA TATAGATATT TGTTTAAAAG TAAGAGAAAA AGGATATCTA GTTGTTTGGA CACCCTATGC TGAACTTTAT CATTATGAGT CTAAATCTCG TGGTTATGAA GATACACCTG AAAAACAAGA GAGATTTAAA AAAGAAATTG AGCTTTTTAA AAAGAAGTGG GGACATATTT TAGAAAAGGG TGATCCTTAC TACAATCCAA ATCTTACTTT GGATAGAGAA GATTTTTCTA TTAAAATATA G
|
Protein sequence | MWKLLKYNIY LQRYIIKKSG LFDERYYLKS YPDIRLGDID PIIHYIKHGA REGRNPNPFF DTTFYLNRYE DVAKSRINPL LHYILYGAKE GRWAGPNFNS GFYLLSNPDV KKAGLNPLLH YILHGQYEGR KAKIETDREL KLYNYRKLSS LFLFINKSNI KKSMDYIRKY GFRVFINKAS TKLNSVYQYE VWIKINEPTE EELTAQKNIK FKYEPKISIV VPVWNTPKKF LIDMIESVLN QTYSNWELCI VDGNSKEKHV KETLEHYTLK DKRIKVKYLK ENKGIAGNSN EAIALATGEY IAFLDHDDVL APFALYEVVR AINENEDVDF IYSDEDKITE DGLKRFDPFF KPDFSPDTLR SYNYITHLSV VKKELLNEVG WFREGYDGSQ DYDLILRCTE KAKKIVHIPK ILYNWRINDN SVAQDPKNKM YAYDAAKKAL QDHLDRVGLK GKVRDGVFLG SYKIDYDISY HHKVSIIIPN KDHKEDLEKC ITSIINKSTY KNYEIIIVEN NSKEKKTFEY YKYLQNKYNN IVLLEWKDKF NYSAVNNFAS KYANGDILLF LNNDTEVINE NWIEEMLMYA QRKDVGAVGA KLYYPDDTIQ HGGVILGIGG KVGHSHRFFP RVSYGNVGRL VVVQNLSAVT GACLMMRKDI FNEVEGFDER YPLALSDIDI CLKVREKGYL VVWTPYAELY HYESKSRGYE DTPEKQERFK KEIELFKKKW GHILEKGDPY YNPNLTLDRE DFSIKI
|
| |