Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_5131 |
Symbol | |
ID | 7116169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 5495978 |
End bp | 5497498 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643527824 |
Product | anthranilate synthase component I |
Protein accession | YP_002423823 |
Protein GI | 218533007 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.555304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.114456 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGC CTGCGCACGA GGCCGTAGCG CGCGCCTATG AGGCCGGACG GGCAAGTCTG CTCGGCCTCA CCCTGGTGGC CGATCTGGAG ACGCCGGTTG CGGCCTTCCT CAAGCTGAAG GCGGTTCATG CCGGCCCCGG CTTCCTGCTC GAATCGGTCG AGGGCGGCGC GGTGCGCGGG CGCTACTCGA TGATCGGGCT CGACCCCGAC CTGATCTGGC GCTGCCGCGA CGGGCGCACC GAGATCGCCC GCGACGCAGG CCTCTCAGAC TTCGTGGCCG ACGAGCGGGC GCCGCTCGCC TCGCTGCGCG CCCTCATCGC TGAAAGCGCG GTCGGGGATG ATGCCGAAAG CGCCGCGCTG CCGCCGATGG CCGCCGGCCT GTTCGGCTAT CTCGGCTACG ACATGGTCCG CGCCATGGAG CGGCTGCCTG AGCCGAACCC CGATCCGCTC GGCGTGCCCG ACGCGATCCT GATGCGCCCG CGGGTGATGG TGGTGTTCGA CGCGGTGGCC GACGCGCTCA CCGTCGTCAC CCCCGTGCGC CCGACCGACG GCGTCAGCGC CCGCGCCGCG CTGGAAGCCG CGCAGGCCCG GCTCGACCGG GTGGCGGAGG CCCTCGAAGG CCCGCTGCCG GTCGAGGCGC GCCTCGACGT CTCGAGCCTT CCCCTGCCCT CCCCGGTCTC GAACACCGAG CCGGAGGCCT TTCTCGGCAT GGTGGCCAAG GCCAAGGAGT ACATCGTCGC GGGCGACATC TTCCAGGTGG TGCTGTCGCA GCGCTTCGAG GCGCCCTTCA CCCTGCCGGC CCTTGCGCTC TACCGCTCCT TGCGCCGCAC CAACCCGGCG CCGTTCCTGT GCTACCTCGA TTTCGAGGCG TTTCAGATCG TCTGCTCAAG CCCCGAGATC CTCGTGCGGG TGCGCGAGGG CAAGGTGACG ATCCGCCCGA TCGCCGGCAC GCGCCGCCGC GGCGCCACAC CCGCCGAGGA CCGGGCGCTC GCCGACGAAC TCCTGGCCGA CCCGAAGGAG CGCTCCGAGC ACCTGATGCT GCTCGATCTC GGCCGTAACG ATGTCGGCCG CGTCTCGAAG ATCGGCAGCG TCACCGTCAC CGACTCGTTC TTCCTCGAAT ACTACTCCCA GGTCATGCAC ATCGTCTCGA ACGTCGAGGG CGACCTCGAC CCGAGCCACG ACGCGCTCTC GGCGCTGGCC GCCGGCTTTC CCGCGGGCAC CGTCTCGGGC GCACCGAAGG TGCGGGCGAT GGAGATCATC GACGAGCTGG AGCACGAGAA GCGGGGCCCC TATGGCGGCT GCATCGGCTA TTTCGGTGCG CGCGGCGAGA TGGACACCTG CATCGTCCTG CGCACGGCTA TCGTGAAGGA CGGCCGCATG CATGTGCAGG CGGGTGCCGG CATCGTCTAC GATTCCGATC CGCACTCCGA GCAGCAGGAA TGCGTGAACA AGGCCAAGGC CCTGTTCCGC GCGGCGGAGG ATGCGGTGCA GTTCGCGAGC CGGGCCAAGC GCGGGCAGTA G
|
Protein sequence | MTEPAHEAVA RAYEAGRASL LGLTLVADLE TPVAAFLKLK AVHAGPGFLL ESVEGGAVRG RYSMIGLDPD LIWRCRDGRT EIARDAGLSD FVADERAPLA SLRALIAESA VGDDAESAAL PPMAAGLFGY LGYDMVRAME RLPEPNPDPL GVPDAILMRP RVMVVFDAVA DALTVVTPVR PTDGVSARAA LEAAQARLDR VAEALEGPLP VEARLDVSSL PLPSPVSNTE PEAFLGMVAK AKEYIVAGDI FQVVLSQRFE APFTLPALAL YRSLRRTNPA PFLCYLDFEA FQIVCSSPEI LVRVREGKVT IRPIAGTRRR GATPAEDRAL ADELLADPKE RSEHLMLLDL GRNDVGRVSK IGSVTVTDSF FLEYYSQVMH IVSNVEGDLD PSHDALSALA AGFPAGTVSG APKVRAMEII DELEHEKRGP YGGCIGYFGA RGEMDTCIVL RTAIVKDGRM HVQAGAGIVY DSDPHSEQQE CVNKAKALFR AAEDAVQFAS RAKRGQ
|
| |