Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_1518 |
Symbol | |
ID | 6744349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 1430608 |
End bp | 1432128 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 642751339 |
Product | glycosyl transferase family 39 |
Protein accession | YP_002122179 |
Protein GI | 195953889 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAG CCATAAAACT AAATCCCTAT ATGATTTCAC AATTAAATCA GTCTTATTGC AATATAATAG TTTTGATGAA CTTTAAAAAT GTCTTCTTAA TAAATGTGTT TTTTCTGATC TTTAGGATAA CGTACGTGCT TTTTTACCCT ATATCCTTGG ACCCAGAAGA AGTTCAGTAT TGGGACTGGG CTAGACATAT AGATTTTAGC TACTACTCAA AACCTCCTTT GGTGGCATAT TTAAACTTTG TCTCGATGCA TCTTTTTGGT ATAAACAATT TAGCTGTTAG AATATGGCCC ATTTTGTTTG CTTTTATAAT GTTAAACTTT TCTTATTTTT TTGTTAAAAA AATATATGGT CCATCTGCTG CCTTCTGGTC TTCTGTTATA CCAAACACCG TCATAGGCTT TAACATAAAC GCAATACTGA TGACCACCGA TGCGCCTTTT TTGTTTTTCT GGGGACTTAG TGTCATAGCA ATGTACGAAG CCCTTCATAA GAATAAATTA AGAGACTACA TATACCTTGG CATATTTGCA GGACTTGCTT TTTTAGCAAA GTATACCGCT GTTTTTTTGG TGCTTGGTTT TATTTATGCT TTTATATATA AAAAAGATGT TCTTAAAAGC TTTAAACCTT ATCTTTCAAT GTTAATAGCT ACAGTACTTG GCATTCAAGT GATAATATGG AATGCTTTTC ATCACTTCGA TGGGTTTAAG CATGTGGGGG CTTTGGCGGG AATAGAAGAT GATGCTAGTG TTAAAGCCCT AAGGGTTTTG AATTTCCTAG GAGGGCAAAT TGGAGTGTTA TCAATAGTCT GGTTTTTTGT ATTTTTATAC GCTTCTTTTA AAAGCCTAAA ACAAAAAGAC CACAATGATA TGTATTTTAT CATTTTATCT TGGCCTATTT TGCTATTTTT TGCTATTCTT AGTATAAATA CAAACGTACA AGCCAATTGG CCAGATTTTG CTTACTTTAG TGCGTTTATT TTAATATCAA AATATTTTGA GGCTTTTAGA AAAAAGATAC TTTATATAGG TTTTTCTATG CTTATAACAA TACTTGTGAT GTTTACACCA ATTCTTGATT TAATAGGTCT TGGTAATATT TTAAAACCCA TTCACGATCC TACAAAATTT TTGGCAGGCT GGGATAAGCT TGGTGCTTTT GTATCAAGAT TTTACAAACC TGGGGATTTG GTGTTTAGCG ATTACTATCA AATAGCTGGG GAGCTTGCTT TTTATATGAA AGACCATCCT GAAGTGTTTT GTATAAATTT AGGTACAAGG ATGAATGAGT TTTATCTATG GCAACCTCTT ATGAAAAACG ATATAGGGCA CGATGGGATC TTTGTAAGCG TTCATCCGAT AGATGACAAA GTTTTATCTG GCTTTAAAAA AATAATATAT CAAACCACTT ACACCGTTTA CTGGAGATCT AAACCTGTAG AAACCTACTA TATATATGTA TTAAAAGATT ACAACGGGCA TATAAAACAG GTAAAGACAC GTAGTTATTA A
|
Protein sequence | MPKAIKLNPY MISQLNQSYC NIIVLMNFKN VFLINVFFLI FRITYVLFYP ISLDPEEVQY WDWARHIDFS YYSKPPLVAY LNFVSMHLFG INNLAVRIWP ILFAFIMLNF SYFFVKKIYG PSAAFWSSVI PNTVIGFNIN AILMTTDAPF LFFWGLSVIA MYEALHKNKL RDYIYLGIFA GLAFLAKYTA VFLVLGFIYA FIYKKDVLKS FKPYLSMLIA TVLGIQVIIW NAFHHFDGFK HVGALAGIED DASVKALRVL NFLGGQIGVL SIVWFFVFLY ASFKSLKQKD HNDMYFIILS WPILLFFAIL SINTNVQANW PDFAYFSAFI LISKYFEAFR KKILYIGFSM LITILVMFTP ILDLIGLGNI LKPIHDPTKF LAGWDKLGAF VSRFYKPGDL VFSDYYQIAG ELAFYMKDHP EVFCINLGTR MNEFYLWQPL MKNDIGHDGI FVSVHPIDDK VLSGFKKIIY QTTYTVYWRS KPVETYYIYV LKDYNGHIKQ VKTRSY
|
| |