Gene MCA1834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1834 
Symbol 
ID3103133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1962237 
End bp1963307 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content68% 
IMG OID637170993 
Productchloromuconate cycloisomerase, putative 
Protein accessionYP_114271 
Protein GI53803900 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCG CCGACATCCA GGTGCGGACC GAACATTTTC CGCTGACGCG TCCCTACCGC 
ATCGCATTCC GCTCGATCGA GGAAATCGAC AACCTCATCG TCGAAATCAG GACCGCCGAC
GGACTGCTCG GACTGGGCGC CGCCTCGCCC GAACGGCACG TCACCGGCGA AACCCTGGAG
GCCTGCCACG CCGCTTTGGA TCATGATCGT CTCGGGTGGC TGATGGGCCG GGACATCCGG
ACCCTGCCGC GGCTGTGCCG GGAACTCGCC GAACGGCTGC CTGCCGCGCC GGCCGCCCGC
GCCGCTCTCG ACATGGCGCT GCACGATCTG GTGGCCCAGT GTCTCGGCCT GCCCCTGGTC
GAAATTTTGG GACGCGCCCA CGACAGCTTG CCGACCTCGG TCACGATCGG CATCAAGCCG
GTCGAAGAAA CGCTGGCCGA GGCGCGCGAA CATCTGGCGC TCGGCTTCCG GGTTCTCAAG
GTCAAGCTTT GCGGCGACGA GGAGCAAGAC TTCGAACGCC TGCGCCGGCT GCACGAAACG
CTGGCCGGGC GGGCCGTCGT ACGGGTCGAT CCCAATCAGA GCTACGATCG CGACGGCCTG
CTCCGTCTGG ACCGGCTGGT GCAGGAACTC GGCATCGAGT TCATCGAACA GCCGTTCCCG
GCAGGGCGAA CCGACTGGTT GCGGGCGCTC CCGAAAGCGA TACGGCGCCG GATCGCCGCC
GACGAATCCC TGCTGGGCCC CGCCGATGCC TTCGCTTTGG CTGCACCGCC GGCCGCCTGC
GGCATCTTCA ACATCAAGCT CATGAAGTGC GGAGGGCTGG CCCCGGCGCG GCGTATCGCG
ACGATCGCCG AAACCGCCGG GATCGATCTG ATGTGGGGCT GCATGGACGA AAGCCGCATC
AGCATCGCCG CCGCCCTGCA CGCCGCCCTC GCCTGCCCGG CCACCCGCTA CCTGGACCTG
GACGGCAGCT TCGACCTGGC CCGCGACGTC GCCGAAGGCG GCTTCATCCT CGAGGATGGC
CGGCTCCGGG TGACCGAACG GCCCGGCCTC GGACTCGTAT ACCCGGATTA G
 
Protein sequence
MKIADIQVRT EHFPLTRPYR IAFRSIEEID NLIVEIRTAD GLLGLGAASP ERHVTGETLE 
ACHAALDHDR LGWLMGRDIR TLPRLCRELA ERLPAAPAAR AALDMALHDL VAQCLGLPLV
EILGRAHDSL PTSVTIGIKP VEETLAEARE HLALGFRVLK VKLCGDEEQD FERLRRLHET
LAGRAVVRVD PNQSYDRDGL LRLDRLVQEL GIEFIEQPFP AGRTDWLRAL PKAIRRRIAA
DESLLGPADA FALAAPPAAC GIFNIKLMKC GGLAPARRIA TIAETAGIDL MWGCMDESRI
SIAAALHAAL ACPATRYLDL DGSFDLARDV AEGGFILEDG RLRVTERPGL GLVYPD