Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2250 |
Symbol | |
ID | 4269191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2549950 |
End bp | 2551458 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638127007 |
Product | anthranilate synthase component I |
Protein accession | YP_743082 |
Protein GI | 114321399 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.327103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCGG AACAACTGCG TGAACTGGCC GAGGCCGGCT ACAACCGCGT CCCGCTGGTG CGGGAGATCC TCGCCGATCT GGACACCCCG CTGTCCACCT ACCTGAAGCT GGCCCAGGGG CCCTATTCCT ATTTGTTCGA GTCGGTCCAG GGCGGTGAGA AGTGGGGCCG TTATTCCATC ATCGGCCTGC CTTGCCGCAC GGTGCTGCGG GTATGGGGCC AGGAGGTGGA GGTCAGCGAG GATGACCGGG TGGTGGAACG CATCCCGGTG GACGATCCGC TGGCCTGGAT CGAGGCCTAC CAGGCCCGCT TCAAGGCGGC CGATCCGCCC GGCTTTCCGC AGATGCCCCG CTTTCTCGGC GGGCTGGTGG GCTACTTCGG CTACGAGACC GTGGGGTATA TCGAGCCGCG CCTGGCCGGC CGGGCCAACC GCGACGAGCT GGAGGTGCCC GACATCCTGT TGATGGTCAG CGAGGAGGTG CTGGTCTTCG ACAACCTGGC CGGCCGCCTC TACCTGGTGG TGCAGGTGGA CCCGGCCGCC GAGGGGGCGC TGACACGCGG GCAGGCGCGC CTGGAGGAGC TGGCCGAGCG GCTGCGCCGC GCCGACTCGG TCTACCAGCG GCGCCGTCCT CCACGGCGGG TGCTGGAGTC GGACTTTCGC TCGAACTTCA CCCAGCCGGA GTTTGAGGCG GTGGTGGAGC GTATCCGCGA GTACATCCTG GCCGGCGACT GCATGCAGGT GGTGCCCTCC CAGCGCATGT CGGTGCCCTA CCGGGCGGAG CCGCTGGACC TCTACCGGGC GCTGCGCTGC ACCAATCCCT CCCCCTATAT GTATTACCTG GACCTGGGCG ATTTCCACGT GGCCGGCTCG TCGCCGGAGA TCCTGGTGCG GCTGGAGGAC GACCAGGTGA CGGTGCGGCC CATCGCCGGT ACCCGACGCC GTGGCCACGA CGAGGCCGAG GACCTGGCCC TGGAGGCCGA TCTGGTCAGT GACCCCAAGG AGCTGGCTGA GCACCTGATG CTCATCGACC TGGGGCGCAA CGACGTGGGC CGGGTGGCGG ACACCGGCAG CGTGCGGGTC ACCGAGCGGA TGGTGGTGGA GCGCTACTCC CACGTCATGC ATATCGTCTC CAACGTCACC GGGCGGCTGC GCCCCGGGCA GGGGCCCATG GAGGTGCTGC GCGCTACCTT CCCGGCGGGT ACGGTGAGCG GGGCGCCGAA GATACGGGCC ATGGAGATCA TCGCCGAGGT GGAGCCGGTC AAGCGGGGGG TGTACTCGGG GGCGGTGGGG TACCTGTCCT GGTCCGGCAA CCTGGATACC GCCATCGCCA TCCGCACCGC GGTGATCAAG GACGGCCGGG TGTACGTGCA GGCGGGCGCT GGGGTGGTGG CCGACTCGGT GCCGCGGCTG GAGTGGAAGG AGACGCTGAA CAAGGGCCGC GCCCTGTTCC GCGCCGTGGA GATGGCCGAG CACGGCCTGG ATAACCCACC GGCGGTGGAG GCGCACTGA
|
Protein sequence | MQPEQLRELA EAGYNRVPLV REILADLDTP LSTYLKLAQG PYSYLFESVQ GGEKWGRYSI IGLPCRTVLR VWGQEVEVSE DDRVVERIPV DDPLAWIEAY QARFKAADPP GFPQMPRFLG GLVGYFGYET VGYIEPRLAG RANRDELEVP DILLMVSEEV LVFDNLAGRL YLVVQVDPAA EGALTRGQAR LEELAERLRR ADSVYQRRRP PRRVLESDFR SNFTQPEFEA VVERIREYIL AGDCMQVVPS QRMSVPYRAE PLDLYRALRC TNPSPYMYYL DLGDFHVAGS SPEILVRLED DQVTVRPIAG TRRRGHDEAE DLALEADLVS DPKELAEHLM LIDLGRNDVG RVADTGSVRV TERMVVERYS HVMHIVSNVT GRLRPGQGPM EVLRATFPAG TVSGAPKIRA MEIIAEVEPV KRGVYSGAVG YLSWSGNLDT AIAIRTAVIK DGRVYVQAGA GVVADSVPRL EWKETLNKGR ALFRAVEMAE HGLDNPPAVE AH
|
| |