Gene Mlg_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2454 
Symbol 
ID4270195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2789078 
End bp2790343 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content63% 
IMG OID638127212 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_743284 
Protein GI114321601 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000486629 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.341461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCG ATCCCAAGGT CATCAACGAC AAGCTGGCCA AGCGCATGTA CCAGCCCTAC 
CTGGAAGCCG AGTGGGGTTT CATCAATCAC TGGTACCCGG CCCTGTTCAC CCACGAACTG
GAGGAGGGTG ATACCAAGGG CATCCAGATC TGTGGTGTGC CCATCGTTCT GCGTCGTTCC
AAGGGCAAGG TCTACGCCCT GAAGGACCAG TGCATCCACC GTGGCGTGAA GCTTTCGGCC
AAACCCATGT GCCTTACCGA TGACACCATC ACCTGCTGGT ACCACGGCTT CACCTTCGAC
CTGGCCTCCG GCAAGCTGGT CTCCATCGTG GCCGCGCCCG ATGACGAGAT CATCGGCACC
ACCGGCGTTC AGACGTTTGC CGTCGAGGAG CACAGCGGCA TGATCTTCGT GTTCGTCTGC
GACGAGGACT GGGACGAGGA CGTGCCCCCG CTGGCCGCGG ACCTGCCGCT GCGTTATCCG
GAGAACAACG AGCGTTTCCC GCACCCCTAC TGGCCCGATA CCCCCAGCGT GCTGGACGAG
CACTCCGTTG CCCTCGGTAT CCACCGCAAG GGGTACGCCA ACTGGCGACT GGCGGCCGAG
AACGGCTTTG ATCCGGGCCA CCTGCTGATC CACAAGGACA ACGCCATCGT GCACGCCCGT
GACTGGGCGC TGCCGTTGGG GGTGAAACCG GTCACCGACC AGGCCATCGC GCTGATCGAG
GACGACAACG GCCCCAAGGG CTTCCTGAAC CGGTACTACA CGGACCACTA CGAACCGATC
CTGGAGAACG AAAAGCTGGG TGTGAAGGCG CAGGGCACCG TGCCCCGCTA CTTCCGCACC
TCCATGTACC TGCCCGGCGT GCTCATGGTG GAGAACTGGC CGGAGGACCA TGTGGTGCAG
TACGAGTGGT ACGTGCCCAT TACCGACGAC ACCTATGAGT ACTGGGAGGT GCTGGTCAAG
CACTGCAAGG ACGAGCAGGA GCGCAAGGAC TTCGAGTACC GCTTTGAGAA CCTCTACAAG
CCCATGTGCC TGCACGGCTT CAACGACTGC GACCTGTTCG CCCGCGACGC CATGCAAAAC
TTCTACGCCG ATGGCACCGG CTGGAACGAG GAGCAGCTGG CCGACATGGA CGCCTCGGTG
GTGACCTGGC GCAAGATCGC CTCCCGTCAC AACCGCGGTC TGGCCCGCAA GCCCAAGGGC
GTGCCGGGCG TGCTCAAGGA CCAGAGCTAC CGGTTTGCCG AGGCCTCTGA AGGCGCCTTC
GAGTAA
 
Protein sequence
MAADPKVIND KLAKRMYQPY LEAEWGFINH WYPALFTHEL EEGDTKGIQI CGVPIVLRRS 
KGKVYALKDQ CIHRGVKLSA KPMCLTDDTI TCWYHGFTFD LASGKLVSIV AAPDDEIIGT
TGVQTFAVEE HSGMIFVFVC DEDWDEDVPP LAADLPLRYP ENNERFPHPY WPDTPSVLDE
HSVALGIHRK GYANWRLAAE NGFDPGHLLI HKDNAIVHAR DWALPLGVKP VTDQAIALIE
DDNGPKGFLN RYYTDHYEPI LENEKLGVKA QGTVPRYFRT SMYLPGVLMV ENWPEDHVVQ
YEWYVPITDD TYEYWEVLVK HCKDEQERKD FEYRFENLYK PMCLHGFNDC DLFARDAMQN
FYADGTGWNE EQLADMDASV VTWRKIASRH NRGLARKPKG VPGVLKDQSY RFAEASEGAF
E