Gene Mlg_2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2359 
Symbol 
ID4268457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2676055 
End bp2677287 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content71% 
IMG OID638127117 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_743189 
Protein GI114321506 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.128163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.731078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGG AGCGCCAGGA GACCCTCACC CGCATCGGCC CCGGCACGAC CATGGGCGAG 
CTGCTGCGCC GCTACTGGTG GCCGCTGGCC GCCAGCGTGG AGGTGCGTCC CGGCCGGGCC
CTGGCCCGCC GGCTGCTGGG CGAGGACCTG GTGCTCTTCC GCACGCCGGC CGGCGGGCTG
GGGCTGATCG ACGAGCACTG CACCCACCGC GGCGCCTCCC TGCGGTGCGG CCACGTGGAT
GAGGAGGGCA TCGCCTGCCC CTACCACGGC TGGAAGTTCG ACCACCACGG CCATTGCCTG
GCCATGCCCG CCGAGCCGCG ACAGAAGCCG GCGCTGCTCC GGCGCGCGGC CACCCGGGGT
TTCCCGGTCC AGGAGCTGGG CGGCCTGGTG TTTGCCTACA TCGGGCCCGA TCCCGCGCCG
TTGCTGCCCC GCTACGACCT GCTGCTGCAG GACGATGCCC TGCGCGACAT CGGTGTCGCC
GAGTTGCCCT GCAATTGGCT GCAGATCATG GAGAACAGCG TCGACCCGGC GCACGTGGAG
TGGCTCCACG GCCACCACCT GGCCGGGGTG CGCAGCGAGC GCGGCGAGCC CGCCCCCACC
CAGTACCTTA AGCACCACCA GCGCATCGGG TTCGATGTCT TTGAGTACGG CATCATCAAG
CGGCGGATCG TGGCGGGGGG CAGCGAGGAG GACGAAGACT GGCGCACCGG CCATCCGCTG
ATCTTCCCCT GCGCCTTAAG GGTGGGCACC GGCAACCAGC ACCGCTTCCA GTTCCGCGTG
CCGATGGATG ACACCCACAC CCGCCACTAC TGGTATGCCT GCTACCTCCC GCCGGAGGGC
CGCAGCGCGC CGCCCCAGCG CGAGATCCCC CTTTACCCGG TGCCCTGGCG GGATGAGCAC
GGCGACTACA TCGTCGACTT CGTCGATGGC GGCGACATCA TGGTCTGGGT GAGTCAGGGC
GCCATCGCCG ACCGCACCCG CGAGCGGCTG GTGGCCTCGG ACAAGGGCAT CGTGCTCTAC
CGGCGCCTGC TGCTGGAGCA GGCGCAGCGG GTGGCCGACG GGCTGGATCC CATGGGCGTG
ATCCGCGATG AGGCGGAAAA CCGGGTCATC CGCTTCGCCC AAGAGCGGAA CAAGCTCGGC
GACGGCCGCC GGCTGCTGCG CGAGGCCATC GAGATGAGCC ACGTGCGCTA CAGCCCGCTC
AAAGAGCAGA TCATCGCGCT GCTCCAGCCC TGA
 
Protein sequence
MTPERQETLT RIGPGTTMGE LLRRYWWPLA ASVEVRPGRA LARRLLGEDL VLFRTPAGGL 
GLIDEHCTHR GASLRCGHVD EEGIACPYHG WKFDHHGHCL AMPAEPRQKP ALLRRAATRG
FPVQELGGLV FAYIGPDPAP LLPRYDLLLQ DDALRDIGVA ELPCNWLQIM ENSVDPAHVE
WLHGHHLAGV RSERGEPAPT QYLKHHQRIG FDVFEYGIIK RRIVAGGSEE DEDWRTGHPL
IFPCALRVGT GNQHRFQFRV PMDDTHTRHY WYACYLPPEG RSAPPQREIP LYPVPWRDEH
GDYIVDFVDG GDIMVWVSQG AIADRTRERL VASDKGIVLY RRLLLEQAQR VADGLDPMGV
IRDEAENRVI RFAQERNKLG DGRRLLREAI EMSHVRYSPL KEQIIALLQP