Gene Mlg_0300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0300 
Symbol 
ID4270760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp341884 
End bp342969 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content71% 
IMG OID638125026 
Productcytochrome oxidase assembly 
Protein accessionYP_741145 
Protein GI114319462 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1612] Uncharacterized protein required for cytochrome oxidase assembly 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGCTC GCGCGTTTCA GCGCCTGGCG CTGCTGCTCA CCGTCTGGAC CTTCATGGTG 
GTGGTGCTCG GGGCCACCGT GCGGCTGATG GACGCCGGCC TGGGTTGCCC GGACTGGCCC
GGCTGCTACG GTCGGCTGGT GGTGCCGCAG AGTGAGCCGG CGATCAGTCA GGCCAATCTG
GCCTTTCCCG AACGGCCGGT GGAGGTGGCC AAGGGGTGGT GGGAGATGGC CCATCGCTAC
GTGGCCGGCC TGCTGGGCTT GATGATCCTG GCGCTGGCGG TGCTCGCTTG GCGCCGTCGC
CAGGACCCGT GGCAGCCCGT GGCCCTGCCT GTGGCGGTGG TGTTTGTGGT GCTCTTCCAG
TCGGTCCTCG GGGCCTGGAC CGTCACCTGG CAGTTGAAGC CCATCGTGGT GGTGGCCCAC
CTGTTGGGGG GGCTCGCGGT GCTCGCGTTG CTCTGGTGGA CCTACCTGCG CAGCCGCCGG
GTCGGCCTGG CCGCCGGCGC GCCGGCGGCC GGACCTGCGA TGCGGGCGGG TGCACTGGTG
GTGCTGGCCG CCGTGGTGGG GCAGGTTGCC CTGGGCGGCT GGGTGAGCGC TAATTATGCC
GCGCTGGCCT GTACCGATTT CCCCACCTGT AACGGTTCTT GGTGGCCGCG GGCCGATTTC
GCCGAGGCCT TCGTGCTCTG GCGGGGCCTG GGCGTCAATT ACGAGTTCGG CATCCTGGAC
AACCCGGCGC GGGTGGCCAT TCAGCTGAGC CACCGGATCG GCGCGGTGGT GGTGGTGGGC
CTGGTGGTGG CGTTTGCCAT TGGCCTGTTG CGCGCCAGTT CCCACCGTGC GCTGCGCCGT
GGGGCCTGGC TGCTGTTGGC ACTGACCCTG GCCCAGTTTG CGCTGGGGGC GGCCAATGTG
CTGCTGTCCC TGCCCCTAGG GCTGGCGGCG GCGCATACGG CGGGCGCTGC GTTGCTACTG
TTGGCGACGG TGCACGTGAC CCATCTGCTT TTCGCCCCCA CCGCCGAGGC TGCGGCGGTG
GGGGGCCGCT CCCGGTCGTC GCCGGCCCCT GCCGGCGGTG CGGTCCTGGA GCAGCCATCC
GGTTAG
 
Protein sequence
MGARAFQRLA LLLTVWTFMV VVLGATVRLM DAGLGCPDWP GCYGRLVVPQ SEPAISQANL 
AFPERPVEVA KGWWEMAHRY VAGLLGLMIL ALAVLAWRRR QDPWQPVALP VAVVFVVLFQ
SVLGAWTVTW QLKPIVVVAH LLGGLAVLAL LWWTYLRSRR VGLAAGAPAA GPAMRAGALV
VLAAVVGQVA LGGWVSANYA ALACTDFPTC NGSWWPRADF AEAFVLWRGL GVNYEFGILD
NPARVAIQLS HRIGAVVVVG LVVAFAIGLL RASSHRALRR GAWLLLALTL AQFALGAANV
LLSLPLGLAA AHTAGAALLL LATVHVTHLL FAPTAEAAAV GGRSRSSPAP AGGAVLEQPS
G