Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1226 |
Symbol | |
ID | 4269757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1429237 |
End bp | 1430337 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125976 |
Product | chorismate synthase |
Protein accession | YP_742065 |
Protein GI | 114320382 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0161922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGAA ATACCATTGG CAAACTGTTT ACCGTCACCA CCTTTGGTGA AAGTCACGGC CCTGCACTGG GCGCCATCGT GGATGGCTGT CCGCCGGGGC TGGCGCTGAG CGAGGCCGAT CTGCAGCGGG ACCTGGACCG GCGGCGGCCC GGCACCTCCA AGTTCACCAC CCAGCGCAAA GAGCCTGATC AGGTGCGGAT CCTGTCCGGG GTGTTCGAGG GGCGGACCAC CGGCACCCCG ATCGGATTGC TGATCGAGAA CACCGACCAG CGCTCAAAGG ACTACGCCGA GATCGCCCGG CGCTTCCGGC CGGGCCATGC CGATTACACC TATCTGCAGA AATACGGCAT CCGCGACTAC CGTGGCGGCG GCCGGTCGTC GGCGCGCGAG ACCGCCATGC GGGTGGCCGC CGGCGGTATC GCGCGCAAGT ACCTGCGCGA GCGGCTGGGG GTCACCGTGC AGGGGTGCCT CACCCAGCTC GGCCCGATCG AGCTGGGTAT CAAGGACTGG ATGGCGGTGG ATGACAACCC CTTCTTCTGT GCCGATCCGG AGCGTGTTCC GGACCTTGAG TCGTTCATGC AGGACCTGCG AAAGGCGGGC AATTCCATCG GCGCGGCCGT GACCGTGGTC GCCCGAGGTT GCCCGCCCGG GCTGGGTGAG CCGGTGTTCG ACCGCCTGGA TGCGGATCTG GCCCACGCCT TGATGAGCAT CAATGCGGTC AAGGGCGTGG AGCTGGGGGC GGGGTTTGCC AGTGTGATGC AGCACGGCAG CGAACACCGG GATGAGCTGA CCCCTGAGGG CTTCGCCAGT AACAACGCCG GCGGCGTGCT CGGCGGCCTG TCCACCGGGC AGGACGTGGT GACACGGATC GCCCTCAAGC CCACCTCCAG CATCGTGGTG CCGGGTCGCA CCATCGATAC CGAGGGTGAG CCGGTGCCAG TGGTCACCAA GGGGCGGCAC GACCCCTGCG TGGGCATCCG CGCGGTGCCC ATCGCCGAGG CCATGGTGGC TTTGACGCTC ATGGACCACT GGCTGCGCCA CCGCGCCCAG TGCGCTGACG TGCAGCCGGA GACGCCGCCC ATCCCCGCCG CCAATCGCTG A
|
Protein sequence | MSGNTIGKLF TVTTFGESHG PALGAIVDGC PPGLALSEAD LQRDLDRRRP GTSKFTTQRK EPDQVRILSG VFEGRTTGTP IGLLIENTDQ RSKDYAEIAR RFRPGHADYT YLQKYGIRDY RGGGRSSARE TAMRVAAGGI ARKYLRERLG VTVQGCLTQL GPIELGIKDW MAVDDNPFFC ADPERVPDLE SFMQDLRKAG NSIGAAVTVV ARGCPPGLGE PVFDRLDADL AHALMSINAV KGVELGAGFA SVMQHGSEHR DELTPEGFAS NNAGGVLGGL STGQDVVTRI ALKPTSSIVV PGRTIDTEGE PVPVVTKGRH DPCVGIRAVP IAEAMVALTL MDHWLRHRAQ CADVQPETPP IPAANR
|
| |