Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1637 |
Symbol | |
ID | 3831266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1672942 |
End bp | 1674000 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637829562 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_430482 |
Protein GI | 83590473 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.521882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGAAG AACTCGAAAC AAGTCATAAT TACATCATAG CCTTTGACCG GCCCTGGCGG CACCGGGTCA CACTATTACT TGCTTTAACA GCCCTGCTGG TGGGCCTGGG CTGGTACTTT ACGACCCTTC TGGCTCCCAG GCACCCCGGG GGAGCGGCAA TCGAGGTGTC TATTCCGCCC GGAGCGAGTT CGGCCACCAT TGCCGCCACC CTGGCGGAGA AGGGCCTTAT CCGCAGCCCC CTGGCCTTCC GGCTGGTGGC CATGGCCCAG GGAGTGGACA GGCAGTTAAA GTCGGGCAGT TATTTAATTT CTCCCGGCCT GCCCCTGCCG GCTATAACCC GCCTGCTGGC CAGCGGCAAC ACCCTGGAGA TCGAGTTTAC CGTTCCGGAA GGGTATACCG TCCGCCAGAT TGCGTCCCTG TTGCAACAAA AGGGACTGGT TAAGGAAGAG GATTTCTTAA AAGCCGCCGC CGGGGATTAT CCTTTTTCTT TTCTCCAGGG CCTGCCTTCG GGGCCGGAAC ATGTGCAGGG TTTTCTCTTC CCCGATACCT ACCAGGTAGC GCCGGGTACG CCAGCCCGGG AAATTATCAT GATGATGCTG AACCGCTTTA ACCAGGTCTA CCAGGAGATT GCCCCGCAAA AGGATAAAGA CGTGGAATTC AATATCCGTC AGATTGTAAC CCTGGCTTCC ATAGTCGAAA GGGAAGCCAA ACTCGACAAT GAACGCCCCC TTATCGCCGG GGTATTTATC AACCGCCTCC GGCGGGGTAT GAGGCTGGAA TCCTGTGCCA CGGTGGAATA TCTCCTGCCG GCGCCCAAAC CGGTCCTTTC CTACCAGGAC CTGCAAATTG ATTCTCCTTA CAACACCTAC CGGGTAAAGG GCCTGCCTCC AGGCCCCATT GCCAATCCGG GGCGGGCCTC ACTCCTGGCA GTCCTAAAGC CGAATCAAAC GGATTATCTC TATTTTGTTG CCAAACCGGA CGGCTCCCAC TACTTCAGCA GCACCCTGGC CGAGCATAAC CAGGCTACAG CCCGATACGA GTCAGGTAAC GGAAATTAG
|
Protein sequence | MGEELETSHN YIIAFDRPWR HRVTLLLALT ALLVGLGWYF TTLLAPRHPG GAAIEVSIPP GASSATIAAT LAEKGLIRSP LAFRLVAMAQ GVDRQLKSGS YLISPGLPLP AITRLLASGN TLEIEFTVPE GYTVRQIASL LQQKGLVKEE DFLKAAAGDY PFSFLQGLPS GPEHVQGFLF PDTYQVAPGT PAREIIMMML NRFNQVYQEI APQKDKDVEF NIRQIVTLAS IVEREAKLDN ERPLIAGVFI NRLRRGMRLE SCATVEYLLP APKPVLSYQD LQIDSPYNTY RVKGLPPGPI ANPGRASLLA VLKPNQTDYL YFVAKPDGSH YFSSTLAEHN QATARYESGN GN
|
| |