Gene Mchl_5131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5131 
Symbol 
ID7116169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5495978 
End bp5497498 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content70% 
IMG OID643527824 
Productanthranilate synthase component I 
Protein accessionYP_002423823 
Protein GI218533007 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.555304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.114456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC CTGCGCACGA GGCCGTAGCG CGCGCCTATG AGGCCGGACG GGCAAGTCTG 
CTCGGCCTCA CCCTGGTGGC CGATCTGGAG ACGCCGGTTG CGGCCTTCCT CAAGCTGAAG
GCGGTTCATG CCGGCCCCGG CTTCCTGCTC GAATCGGTCG AGGGCGGCGC GGTGCGCGGG
CGCTACTCGA TGATCGGGCT CGACCCCGAC CTGATCTGGC GCTGCCGCGA CGGGCGCACC
GAGATCGCCC GCGACGCAGG CCTCTCAGAC TTCGTGGCCG ACGAGCGGGC GCCGCTCGCC
TCGCTGCGCG CCCTCATCGC TGAAAGCGCG GTCGGGGATG ATGCCGAAAG CGCCGCGCTG
CCGCCGATGG CCGCCGGCCT GTTCGGCTAT CTCGGCTACG ACATGGTCCG CGCCATGGAG
CGGCTGCCTG AGCCGAACCC CGATCCGCTC GGCGTGCCCG ACGCGATCCT GATGCGCCCG
CGGGTGATGG TGGTGTTCGA CGCGGTGGCC GACGCGCTCA CCGTCGTCAC CCCCGTGCGC
CCGACCGACG GCGTCAGCGC CCGCGCCGCG CTGGAAGCCG CGCAGGCCCG GCTCGACCGG
GTGGCGGAGG CCCTCGAAGG CCCGCTGCCG GTCGAGGCGC GCCTCGACGT CTCGAGCCTT
CCCCTGCCCT CCCCGGTCTC GAACACCGAG CCGGAGGCCT TTCTCGGCAT GGTGGCCAAG
GCCAAGGAGT ACATCGTCGC GGGCGACATC TTCCAGGTGG TGCTGTCGCA GCGCTTCGAG
GCGCCCTTCA CCCTGCCGGC CCTTGCGCTC TACCGCTCCT TGCGCCGCAC CAACCCGGCG
CCGTTCCTGT GCTACCTCGA TTTCGAGGCG TTTCAGATCG TCTGCTCAAG CCCCGAGATC
CTCGTGCGGG TGCGCGAGGG CAAGGTGACG ATCCGCCCGA TCGCCGGCAC GCGCCGCCGC
GGCGCCACAC CCGCCGAGGA CCGGGCGCTC GCCGACGAAC TCCTGGCCGA CCCGAAGGAG
CGCTCCGAGC ACCTGATGCT GCTCGATCTC GGCCGTAACG ATGTCGGCCG CGTCTCGAAG
ATCGGCAGCG TCACCGTCAC CGACTCGTTC TTCCTCGAAT ACTACTCCCA GGTCATGCAC
ATCGTCTCGA ACGTCGAGGG CGACCTCGAC CCGAGCCACG ACGCGCTCTC GGCGCTGGCC
GCCGGCTTTC CCGCGGGCAC CGTCTCGGGC GCACCGAAGG TGCGGGCGAT GGAGATCATC
GACGAGCTGG AGCACGAGAA GCGGGGCCCC TATGGCGGCT GCATCGGCTA TTTCGGTGCG
CGCGGCGAGA TGGACACCTG CATCGTCCTG CGCACGGCTA TCGTGAAGGA CGGCCGCATG
CATGTGCAGG CGGGTGCCGG CATCGTCTAC GATTCCGATC CGCACTCCGA GCAGCAGGAA
TGCGTGAACA AGGCCAAGGC CCTGTTCCGC GCGGCGGAGG ATGCGGTGCA GTTCGCGAGC
CGGGCCAAGC GCGGGCAGTA G
 
Protein sequence
MTEPAHEAVA RAYEAGRASL LGLTLVADLE TPVAAFLKLK AVHAGPGFLL ESVEGGAVRG 
RYSMIGLDPD LIWRCRDGRT EIARDAGLSD FVADERAPLA SLRALIAESA VGDDAESAAL
PPMAAGLFGY LGYDMVRAME RLPEPNPDPL GVPDAILMRP RVMVVFDAVA DALTVVTPVR
PTDGVSARAA LEAAQARLDR VAEALEGPLP VEARLDVSSL PLPSPVSNTE PEAFLGMVAK
AKEYIVAGDI FQVVLSQRFE APFTLPALAL YRSLRRTNPA PFLCYLDFEA FQIVCSSPEI
LVRVREGKVT IRPIAGTRRR GATPAEDRAL ADELLADPKE RSEHLMLLDL GRNDVGRVSK
IGSVTVTDSF FLEYYSQVMH IVSNVEGDLD PSHDALSALA AGFPAGTVSG APKVRAMEII
DELEHEKRGP YGGCIGYFGA RGEMDTCIVL RTAIVKDGRM HVQAGAGIVY DSDPHSEQQE
CVNKAKALFR AAEDAVQFAS RAKRGQ