Gene Mlg_2602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2602 
Symbol 
ID4269235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2948461 
End bp2949639 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID638127361 
Productmethylitaconate delta2-delta3-isomerase 
Protein accessionYP_743432 
Protein GI114321749 
COG category[S] Function unknown 
COG ID[COG2828] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02334] probable AcnD-accessory protein PrpF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000005222 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0000122233 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACACA CCCCACAAAT CCGCATCCCC GCCACCTACA TGCGCGGCGG TACCAGCAAG 
GGCGTCTTTT TCCGCCTGGA TGATCTGCCC GAGGCGGCCC AGCAGCCGGG CGAGGCCCGG
GACAAACTGC TGCTCCGGGT CATCGGCAGC CCCGACCCCT ACGGCAAGCA CACCGACGGC
ATGGGCGGGG CCACCTCCAG CACCAGCAAG ACGGTGATCC TCTCCAGGAG CGACAGCCCC
GACCACGACG TGGACTACCT CTTCGGCCAG GTGGCCATCG ACAAGCCCTT CGTGGACTGG
AGCGGCAACT GCGGCAACCT CTCCGCCGCC GTCGGCCCCT TCGCGGTCAG CAACGGCCTG
GTGGACCCGG CCCGCATCCC CGAGAACGGC GAGGTGGCCG TGCGCATCTG GCAGGCCAAC
ATCGGGAAGA CCATCGTCGC CCGGGTGCCC ATCACCAACG GCCAGGTGCA GGAGACCGGC
GACTTCGAAC TGGACGGGGT CACCTTCCCC GCCGCCGAGG TGCCGGTGGC CTTCATGGAC
CCAGCGGACG GGGGCGGCGC CATCTTCCCC ACCGGCAACC GGGTGGACGA CCTGAAGGTG
CCCGGGGTGG GGACGCTGAA GGCCACCCTC ATCAACGCCG GCATCCCCAC GGTCTTCGTT
AACGCGGACG AGATCGGCTA CACCGGCACC GAGCTGCAGG AGGCCATCAA CGGCGACCCC
AAGGCGCTGG AGATGTTCGA GACCATCCGC GCCCACGGGG CGGTGAAGAT GGGGCTCATC
ACTGACCCGG CCGAGGCGGC CGACCGCCAG CACACGCCGA AGGTGGCCTT CGTGGCCCCG
CCGGCCAGCT ACACCGCCTC CAGCGGCAAG GCCGTGCAAG CGGAGGACAT CGACCTGCTG
GTGCGCGCCC TCTCCATGGG CAAGCTCCAC CACGCCATGA TGGGCACCGC CGCCGTGGCC
ATCGGCACCG CCGCCGCCAT CCCCGGCACC CTGGTTAACG AGGCGGCCGG TGGCGGCGAT
CGCAACTCGG TCCACTTCGG CCACCCCTCC GGCACCCTGC GGGTCGGTGC GGAGGCCACC
CAGGAAAACG GCGAGTGGAC GGTGAAGCAG GCGATCATGA GCCGTAGCGC GCGGGTGCTC
ATGGAAGGCT GGGTCCGGGT ACCCGGCGAT AGCTTCTAA
 
Protein sequence
MAHTPQIRIP ATYMRGGTSK GVFFRLDDLP EAAQQPGEAR DKLLLRVIGS PDPYGKHTDG 
MGGATSSTSK TVILSRSDSP DHDVDYLFGQ VAIDKPFVDW SGNCGNLSAA VGPFAVSNGL
VDPARIPENG EVAVRIWQAN IGKTIVARVP ITNGQVQETG DFELDGVTFP AAEVPVAFMD
PADGGGAIFP TGNRVDDLKV PGVGTLKATL INAGIPTVFV NADEIGYTGT ELQEAINGDP
KALEMFETIR AHGAVKMGLI TDPAEAADRQ HTPKVAFVAP PASYTASSGK AVQAEDIDLL
VRALSMGKLH HAMMGTAAVA IGTAAAIPGT LVNEAAGGGD RNSVHFGHPS GTLRVGAEAT
QENGEWTVKQ AIMSRSARVL MEGWVRVPGD SF