Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4668 |
Symbol | |
ID | 5834429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5217958 |
End bp | 5219478 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641370463 |
Product | anthranilate synthase component I |
Protein accession | YP_001642107 |
Protein GI | 163854064 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0323578 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGC CTGCGCACGA GGCCGTAGCG CGCGCCTATG AGACCGGACG GGCAAGCCTG CTCGGCCTCA CCCTGGTGGC CGATCTGGAG ACCCCGGTCG CGGCCTTTCT GAAGCTGAAA GCGGTCCATG CCGGCCCCGG CTTCCTGCTC GAATCGGTCG AGGGCGGCGC GGTGCGCGGG CGCTACTCGA TGATCGGGCT CGACCCCGAC CTGATCTGGC GCTGCCGCGA CGGGCGCACC GAGATCGCCC GCGACGCGGG CCTCGCGGAC TTCGTGGCCG ACGAGCGGGC ACCGCTCGCC TCGCTGCGTG CGCTGATCGC CGAGAGCGCG GTCGGGGATG ATGCCGAAAG CGCGGCGCTG CCGCCGATGG CCGCCGGCCT GTTCGGCTAT CTCGGCTACG ACATGGTCCG CGCCATGGAG CGGCTGCCGG AGCCGAACCC CGATCCGCTC GGCGTGCCCG ACGCGATCCT GATGCGTCCG CGGGTGATGG TGGTGTTCGA CGCGGTGGCC GACGCGCTCA CCGTCGTCAC CCCCGTGCGC CCGACCGACG GCGTGAGCGC CCGCGCCGCG CTGGAAGCTG CGCAGGCCCG GCTCGACCGG GTGGCGGAAG CCCTCGAAGG CCCATTGCCG GTCGAGGCGC GCCTCGACGT CTCGAGCCTT CCCCTGCCCT CCCCGGTCTC GAACACCGAG CCGGACGCGT TCCTCGGCAT GGCGGCCAAG GCCAAGGAGT ACATCGTCGC GGGCGACATC TTCCAAGTGG TGCTGTCGCA GCGCTTCGAG GCGCCCTTCA CCCTGCCGGC CCTTGCGCTC TACCGCTCCC TGCGCCGCAC CAACCCGGCG CCGTTCCTGT GCTACCTCGA TTTCGAGGCG TTTCAGATCG TCTGCTCAAG CCCCGAGATC CTCGTTCGGG TGCGCGAGGG CAAGGTGACG ATCCGCCCGA TCGCCGGCAC GCGGCGGCGC GGCACCACAC CGGCCGAGGA CCGGGCGCTC GCCGACGAAC TCCTGGCCGA CCCGAAGGAG CGCTCCGAGC ACCTGATGCT GCTCGACCTC GGCCGCAACG ATGTCGGCCG CGTCTCGAAG ATCGGCAGCG TCACCGTCAC CGACTCGTTC TTCCTCGAAT ACTATTCCCA GGTCATGCAC ATCGTCTCGA ACGTCGAGGG CGACCTCGAC CCGAGCCACG ACGCGCTCTC GGCGCTGGCC GCCGGCTTTC CCGCCGGCAC CGTCTCGGGC GCGCCGAAGG TGCGGGCGAT GGAGATCATC AACGAGTTGG AGCGCGAGAA GCGCGGGCCC TATGGCGGCT GCATCGGCTA TTTCGGTGCG CGCGGCGAGA TGGATACCTG CATCGTCCTG CGCACGGCCA TCGTGAAGGA CGGCCGCATG CATGTGCAGG CGGGTGCCGG CATCGTCTAC GATTCCGATC CGCACTCCGA GCAGCAGGAA TGCGTGAACA AGGCCAAGGC CCTGTTCCGC GCGGCAGAGG ATGCGGTGCA GTTCGCGAGC CGGGCCAAGC GCGGGCAGTA G
|
Protein sequence | MTEPAHEAVA RAYETGRASL LGLTLVADLE TPVAAFLKLK AVHAGPGFLL ESVEGGAVRG RYSMIGLDPD LIWRCRDGRT EIARDAGLAD FVADERAPLA SLRALIAESA VGDDAESAAL PPMAAGLFGY LGYDMVRAME RLPEPNPDPL GVPDAILMRP RVMVVFDAVA DALTVVTPVR PTDGVSARAA LEAAQARLDR VAEALEGPLP VEARLDVSSL PLPSPVSNTE PDAFLGMAAK AKEYIVAGDI FQVVLSQRFE APFTLPALAL YRSLRRTNPA PFLCYLDFEA FQIVCSSPEI LVRVREGKVT IRPIAGTRRR GTTPAEDRAL ADELLADPKE RSEHLMLLDL GRNDVGRVSK IGSVTVTDSF FLEYYSQVMH IVSNVEGDLD PSHDALSALA AGFPAGTVSG APKVRAMEII NELEREKRGP YGGCIGYFGA RGEMDTCIVL RTAIVKDGRM HVQAGAGIVY DSDPHSEQQE CVNKAKALFR AAEDAVQFAS RAKRGQ
|
| |