Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0733 |
Symbol | |
ID | 7408427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 824014 |
End bp | 825021 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643715105 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_002572621 |
Protein GI | 222528739 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.224262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTAT GGCAAAAAAG TTTTAAATAT TATTTGGTTG TATTACTCTT CTTAGTCTTA ATGGTTTCAC TTATTTATGT ATTCTTCAAA CCTCAAAAAG AAAAAGTAAT CGAAGCTATG GTTGAGATTC CACAAAATAC ATCCACAAAA GATGTTGCTA TGATTTTAAA GAAAAATGGA ATTATTGAAA ACCCATACTT TTTTATGTTT TACGTCAAAC TCAACAACTA TAAAATAGCA GCAGGAAAAT ACAAACTTTC ATCTGATATG ACATATAGAG AGCTTTGCAA AGTTCTTGAA AAAGGTTTTG TTCCAAAAGT TGCTATTAAG TTTACTATTC CAGAAGGATT TACGGTCCAG CAAATTGCCA AAAAACTTCA AAGTCTTGGA CTTGTAGATG AAAACAAGTT TTTGGAAACT GCTAATAGTT ACGACTTTAA TTTTAAGTAT AAATACAGTT CGAAAGAAGT AAAGTATAAG CTTGAAGGGT TTTTGTTTCC AGATACATAT GAAGTATATC CCGGCGCTTC TGAAAAGGAT ATTATAAAAA TGATGCTAAA TAGATTTTTA GAAGTATATG AAAACATAAA AATTAAAAAG ACAACAAATT TAGATGATAT TCAAACAGTT ATACTTGCTT CAATTGTTGA AAAAGAGGCA AAAAAAGATA GCGAAAGAGG GATTATTGCT GGTGTGTTTT CAAACAGGCT ACAAAGAGGC ATAAAACTTG AAAGCTGTGC AACGGTAGAA TATGTGTTGC CTGTTCACAA AGAGGTTCTT TCTTTGCAGG ATGTTAGAAT AGAATCTCCG TACAATACAT ACCTAAAAAA AGGACTGCCG CCTTCTGCTA TCTGCAGCCC TGGCAGAAAA AGTCTTCTTG CAGCTTTAGC TCCTGAAAAA ACAGATTATC TATTTTTTGT TGCCAAAAAG GATGGAACTC ATATATTTTC AAAGACATTT GAAGACCATT TGAGGGCTCA AAAACAAATA GAAGAAGGGA AAAAATAA
|
Protein sequence | MRLWQKSFKY YLVVLLFLVL MVSLIYVFFK PQKEKVIEAM VEIPQNTSTK DVAMILKKNG IIENPYFFMF YVKLNNYKIA AGKYKLSSDM TYRELCKVLE KGFVPKVAIK FTIPEGFTVQ QIAKKLQSLG LVDENKFLET ANSYDFNFKY KYSSKEVKYK LEGFLFPDTY EVYPGASEKD IIKMMLNRFL EVYENIKIKK TTNLDDIQTV ILASIVEKEA KKDSERGIIA GVFSNRLQRG IKLESCATVE YVLPVHKEVL SLQDVRIESP YNTYLKKGLP PSAICSPGRK SLLAALAPEK TDYLFFVAKK DGTHIFSKTF EDHLRAQKQI EEGKK
|
| |