Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0055 |
Symbol | |
ID | 6742836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 49700 |
End bp | 51844 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 642749838 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_002120725 |
Protein GI | 195952435 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000759141 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAATA TATCAGTGAA GACAAAGCTT TATATCTTGG GTGCAATAGT GGCTTTTTTT GGTGTTGTTA TAGGGGCTAT CGTCTTTTAT AATACCCATA AAGCAGTAGC TCAAATGGAC CAAGAAAAGT ATTGTCTTAA AGTAGCCGAT CCTCTTTATG AGATGCTTAT AGAACTTCAA ACGGCTAGGA TGTATAATAA GTCTTTGGCG GTGGGTGATA AAAGCTATAA AACACCTTTG GAAAATACTT TTAACAAAAT AGACAAGATT TTATCAAAAC TTTCTTTATT GGATAAAAAA TATGGGAAAA AACTTGATAT GAACAAAAGT TTTGCTGAGC TTTTATCTAT GTATGACAAC CTTAAGCAAA ATGCTTATAG TTTTAATCCG TATCAAGTGA ATAAAAATTA TGATACCATA TCTAACTTTT TAACAGATAA ACTTATTATA AGAGATTGTT ATATAAGAGG AAAACTTATC AAAGATCCTG ATTTTGCGGA TTCTACTGCC ATAACTGGAT TTTTTGGAAA TCAGCCTCAT ATTATTTTAA AAGTCGGTAA ACTTCAAAGT AAATCTTTTG AGCTTATAAA TATATCTAAA CAAAACGCAA CTTCAAACAC TGCTCAAAGT ACAAATTCAA CATCAAACCT AAACAATAGC TTAAACCAAC AGAGCGCTTC TAATCAACAA AACACATATA TACACCATTT GGTTATGGAT ATCTCAAGTT TGTTGAGCAG CATAAGAGAA CATATTCATT ACGCTTGTTT ATTTTACAAC GAGAGCGCCA GAGCCAATAA GTTTTACAAC GAAATACATC TGTCTGCCAA CGCTTATAAA AGATATATAA ACGATACTTT TAAACCCATT ATAGAAAAAA TATTGGATAA TCCAGAGAGT GCCTATATTT ACAAAGAGCA GTTTGTGAGC TCTCTTTCAA AGCTACAAAC TCTTTCTAAC GCATTATCAA GAGATACAAT TCATACGTTA GAAAAATCTG TAAATATTAG AGATGCAAAA GACCATGAAG TACTTTATGT AAGTCTATTT GTGGTAAGTA TAGGCATAGT ATTATCTATG TGGTTTGTAT ATTTGACGAT AAAGAGTATA GATAACAACG TAAGTATATT AAATAGAGTT GCAAATGAAT TTAAAAACGG TAATCTAAAT ATAAAAGCTG AGATACGTTA TCAAGATGAA ATAGGTAAGG CAATAGAAAG TTTGGTAGAT GGAGTTGGTA CGGCAAACAG CATATTGCAA GATATAAAAT CTACTATCGA GAGAATGGGA AGCTTAGATT TTACTAAAAA TGTAGAGACT GATGCGGTAG GAGATTTTGA AGCTATCAAA AATGATGTCA ACAAGTCTTT GGACGCTTTG AGAAAACTAT TGCAAGCTAT AACAGAAAGC GTTGTAAAAC TAGGTACAAG CATGGAAGAA ACATCTGCCA CTACAAACGC TTTGGCTTTA GACAATAAAA ATCTCAACGA GCAGATAAAT GCGTTGGCAA ATTCTATAGA AGAGATATCT GCTACTGTTA ACTCTATAGC CTCAAACATG ACAGATACAA AAAATATCAT AAACAAACTT TTTGAAATAG TAAACAAAGG AAAATTGGCG ATAAATCAAA CAAGAGTTGA TGCTGATATG ATGAGTGAGC TGGCAAAAAA GGCAGTCTCC ATTGTAGATT CTATAGTGTT TATAACAGAA CAGACAAATC TTTTAGCTTT AAACGCTGCC ATAGAAGCGG CCCGTGCAGG GGAAGCTGGG AGAGGATTCG CCGTTGTGGC AGATGAGGTG AAGAAACTGG CTGAAAAAGC CGGTGGTTTT GCAAAGAACG TATCAGATAT TATATCTGAT ATAACAAAAG GTGTAAATTC AACTGTTAAG TCTATATTGA TGGTAGACGA TTATTATAAA GATATAGAAA ATTATACATC TAACGTACAA GAAGCTTCTC AATCCATATC TTCTGCCATA GAAGAGCAAA ACGCTACTTT GAACATGCTA AACAACTCCA TGTTAGATGT GAGGACGTTT TCTGACAAAT TGGCGGCAGC TATAGAAGAG CTTTCAGCTA CAGCAAAGTC ATTGGCTGAT GTAGCTCAAG AGCTTAAAAT AGAAGTTACT AAATTTAAGT TTTAG
|
Protein sequence | MTNISVKTKL YILGAIVAFF GVVIGAIVFY NTHKAVAQMD QEKYCLKVAD PLYEMLIELQ TARMYNKSLA VGDKSYKTPL ENTFNKIDKI LSKLSLLDKK YGKKLDMNKS FAELLSMYDN LKQNAYSFNP YQVNKNYDTI SNFLTDKLII RDCYIRGKLI KDPDFADSTA ITGFFGNQPH IILKVGKLQS KSFELINISK QNATSNTAQS TNSTSNLNNS LNQQSASNQQ NTYIHHLVMD ISSLLSSIRE HIHYACLFYN ESARANKFYN EIHLSANAYK RYINDTFKPI IEKILDNPES AYIYKEQFVS SLSKLQTLSN ALSRDTIHTL EKSVNIRDAK DHEVLYVSLF VVSIGIVLSM WFVYLTIKSI DNNVSILNRV ANEFKNGNLN IKAEIRYQDE IGKAIESLVD GVGTANSILQ DIKSTIERMG SLDFTKNVET DAVGDFEAIK NDVNKSLDAL RKLLQAITES VVKLGTSMEE TSATTNALAL DNKNLNEQIN ALANSIEEIS ATVNSIASNM TDTKNIINKL FEIVNKGKLA INQTRVDADM MSELAKKAVS IVDSIVFITE QTNLLALNAA IEAARAGEAG RGFAVVADEV KKLAEKAGGF AKNVSDIISD ITKGVNSTVK SILMVDDYYK DIENYTSNVQ EASQSISSAI EEQNATLNML NNSMLDVRTF SDKLAAAIEE LSATAKSLAD VAQELKIEVT KFKF
|
| |