Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1834 |
Symbol | |
ID | 3103133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1962237 |
End bp | 1963307 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637170993 |
Product | chloromuconate cycloisomerase, putative |
Protein accession | YP_114271 |
Protein GI | 53803900 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCG CCGACATCCA GGTGCGGACC GAACATTTTC CGCTGACGCG TCCCTACCGC ATCGCATTCC GCTCGATCGA GGAAATCGAC AACCTCATCG TCGAAATCAG GACCGCCGAC GGACTGCTCG GACTGGGCGC CGCCTCGCCC GAACGGCACG TCACCGGCGA AACCCTGGAG GCCTGCCACG CCGCTTTGGA TCATGATCGT CTCGGGTGGC TGATGGGCCG GGACATCCGG ACCCTGCCGC GGCTGTGCCG GGAACTCGCC GAACGGCTGC CTGCCGCGCC GGCCGCCCGC GCCGCTCTCG ACATGGCGCT GCACGATCTG GTGGCCCAGT GTCTCGGCCT GCCCCTGGTC GAAATTTTGG GACGCGCCCA CGACAGCTTG CCGACCTCGG TCACGATCGG CATCAAGCCG GTCGAAGAAA CGCTGGCCGA GGCGCGCGAA CATCTGGCGC TCGGCTTCCG GGTTCTCAAG GTCAAGCTTT GCGGCGACGA GGAGCAAGAC TTCGAACGCC TGCGCCGGCT GCACGAAACG CTGGCCGGGC GGGCCGTCGT ACGGGTCGAT CCCAATCAGA GCTACGATCG CGACGGCCTG CTCCGTCTGG ACCGGCTGGT GCAGGAACTC GGCATCGAGT TCATCGAACA GCCGTTCCCG GCAGGGCGAA CCGACTGGTT GCGGGCGCTC CCGAAAGCGA TACGGCGCCG GATCGCCGCC GACGAATCCC TGCTGGGCCC CGCCGATGCC TTCGCTTTGG CTGCACCGCC GGCCGCCTGC GGCATCTTCA ACATCAAGCT CATGAAGTGC GGAGGGCTGG CCCCGGCGCG GCGTATCGCG ACGATCGCCG AAACCGCCGG GATCGATCTG ATGTGGGGCT GCATGGACGA AAGCCGCATC AGCATCGCCG CCGCCCTGCA CGCCGCCCTC GCCTGCCCGG CCACCCGCTA CCTGGACCTG GACGGCAGCT TCGACCTGGC CCGCGACGTC GCCGAAGGCG GCTTCATCCT CGAGGATGGC CGGCTCCGGG TGACCGAACG GCCCGGCCTC GGACTCGTAT ACCCGGATTA G
|
Protein sequence | MKIADIQVRT EHFPLTRPYR IAFRSIEEID NLIVEIRTAD GLLGLGAASP ERHVTGETLE ACHAALDHDR LGWLMGRDIR TLPRLCRELA ERLPAAPAAR AALDMALHDL VAQCLGLPLV EILGRAHDSL PTSVTIGIKP VEETLAEARE HLALGFRVLK VKLCGDEEQD FERLRRLHET LAGRAVVRVD PNQSYDRDGL LRLDRLVQEL GIEFIEQPFP AGRTDWLRAL PKAIRRRIAA DESLLGPADA FALAAPPAAC GIFNIKLMKC GGLAPARRIA TIAETAGIDL MWGCMDESRI SIAAALHAAL ACPATRYLDL DGSFDLARDV AEGGFILEDG RLRVTERPGL GLVYPD
|
| |