Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1417 |
Symbol | |
ID | 4270415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1622901 |
End bp | 1623917 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126173 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_742256 |
Protein GI | 114320573 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.626218 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.75601 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTGGG CCCGTCTCAC GCTGCTGACG ACGCTGCTCT TGGTGATGGC CGCCCTGGCG GTCGGTGCCT GGGCCTGGCA GGCCTGGGAC CGGCTCACTG CGCCTATCAC AGCGGACGGG GAGTCGGTGG TCATCGAGAT CCCCCGGGGC GCCTCCTTCC GCCAAGTGGT TGAGCGGCTG GAGCGGGAGA CGGCCTTCGA GGATGGCCTG GCCCTGCGGC TGTACGCCCG CTATACCGGC GACGACGCCC GGGTCCAGGC GGGCGAGTAC GCCCTGGAGC CGGGCATCAG CGTGCTGGAT GCCCTGGAGC GGTTCGCCCG AGGCGAGGTC GTCCAGCACC GCATCACCGT GGTCGAGGGC CTCACCTTCC GCCAGATGCG GCGTCTCATC GAAGCCCACC CGGCCCTGGA GCAGACCCTT AAGGGGCTGG ACGATGAGGG GGTGATGGCC GAGCTGGGCA AGCCGGATCG TCACCCGGAG GGCTGGTTCT ACCCCAGTAC CTACACCTTC CCCCGCGGGA CCACCGACCG TGACCTGCTG GCCCGCGCCA TGCGCCGCAT GGAGCGCCGC CTGGAGGAGG AGTGGGCGGC GCGGGCCGAC GGACTGCCCC TGGAGACGCC CTACGAGGCG CTGATTCTGG CCTCCATCAT CGAGCGCGAG ACCGGGCGGG ACGGGGAGCG GGCGAAGGTG GCCGGCGTCT TCACCCGGCG GCTGGAAAAG GGCATGCGCC TGCAGACCGA CCCGACGGTC ATCTACGGCA TGGGTGAGGC CTATGACGGG CGCATACGCA GCGCCGATCT GCGCCGGGAC ACGCCTTATA ACACCTACAC CCGCCACGGC CTGCCCCCGA CGCCCATTGC CATGCCCGGC AGCGCCTCGA TCCGCGCGGC CGTGAACCCG GCGGACCACG ACTACCTCTA CTTCGTCTCG CGCGGCGACG GCAGCCACCA ATTCTCCCGC ACCCTGGCGG AACACAACCG TGCCGTGCGC CGCTACATTC TGGGGGAGGG CGAATGA
|
Protein sequence | MNWARLTLLT TLLLVMAALA VGAWAWQAWD RLTAPITADG ESVVIEIPRG ASFRQVVERL ERETAFEDGL ALRLYARYTG DDARVQAGEY ALEPGISVLD ALERFARGEV VQHRITVVEG LTFRQMRRLI EAHPALEQTL KGLDDEGVMA ELGKPDRHPE GWFYPSTYTF PRGTTDRDLL ARAMRRMERR LEEEWAARAD GLPLETPYEA LILASIIERE TGRDGERAKV AGVFTRRLEK GMRLQTDPTV IYGMGEAYDG RIRSADLRRD TPYNTYTRHG LPPTPIAMPG SASIRAAVNP ADHDYLYFVS RGDGSHQFSR TLAEHNRAVR RYILGEGE
|
| |