Gene Mlg_0294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0294 
Symbol 
ID4269324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp337041 
End bp338621 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content63% 
IMG OID638125020 
Productcytochrome-c oxidase 
Protein accessionYP_741139 
Protein GI114319456 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00647712 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCA CGGTCTCCGA ACTGGACTAC GACAACCACA AGCCCCAGGG GTTTCTTGAC 
CGCTGGGTCC TGACCACCAG TCACAAGGAC ATCGGCACCC TCTACCTGGT CTTCTCTCTG
ACCATGTTCT TTGTCGGTGG GGCCATGGCC ATGGTCATCC GCGCCGAGCT GTTTCAGCCC
GGCCTGAGCC TGGTGGACCC GCACTTTTTC AACCAGATGA CCACCATGCA CGCCCTGGTG
ATGATCTTCG GGGCGGTCAT GCCGGCCTTC GTGGGCTTGG CCAACTGGTT GATCCCGATG
CAGATCGGGG CGCCGGACAT GGCGCTGCCG CGGATGAACA ACTGGAGCTT CTGGCTGCTG
CCGTTCGCCT TCCTGATGTT GCTGGCAACC CTGATCATGC CCGGCGGCGG CCCCGCTTCG
GGGTGGACGC TCTACCCGCC CTTATCGCTG CAGACGGGCA TGGCCCTGCC CTTCCTGATC
TTCGCCATCC ACGTGGCCGG GCTGTCCTCC ATCATGGGCG CCATAAACAT CATCGTCACC
ATCCTCAACC TGCGGGCGCC GGGGATGACG CTGATGAAGA TGCCCCTGTT CGTCTGGACT
TGGCTGATCA CCGCCTTCCT GCTGATCGCG GTGATGCCGG TGCTGGCCGG TGCGGTGACC
ATGCTGCTCA CCGACGCCTA CTTCGGCACC AGCTTCTTCT CGGCCGCCGG CGGCGGTGAC
CCGGTAATGT ACCAGCACAT CTTCTGGTTC TTCGGGCACC CCGAGGTCTA CATCATGATC
CTGCCGGCCT TCGGCATCGT TTCGGCGATC ATCCCGGCCT TCGCCCGCAA GCCGCTGTTC
GGCTATGCCT CCATGGTCTA CGCGACCGCC GCCATCGCCT TCCTGTCGTT CATCGTCTGG
GCGCACCACA TGTTCACGGT GGGAATGCCC ACGGCCGGTG TGCTGTTCTT CATGTACGCC
ACCATGCTGA TCGCGGTGCC CACCGGGGTG AAGGTGTTCA ACTGGGTGGC CACCATGTGG
CGGGGCGCCA TGACCTTCGA GACCCCCATG CTGTTCAGCA TCGCCTTCGT GGTGCTGTTC
ACCATCGGCG GGCTCTCCGG GATCATGCTG GCCATCGCCC CGGCGGACTT CCAGTACCAC
GATACCTACT TTGTGGTGGC GCACTTCCAC TACGTGCTGG TCACCGGCTC GGTGTTTGCC
ATCTTCGCCG CCGTCTACTA CTGGATTCCC AAGTGGACGG GGGTGATGTA CGACGAGTTC
CTGGGCAAGG TGCACTTCTG GCTGTCGGTG GTCTCGGTGA ACGTGCTCTT CTTCCCGCAG
CACTTCCTGG GGCTGGCCGG CATGCCGCGG CGCACCGCCG ATTACGCCCT GCAGTTCGCC
GAGTGGAACA TGATCTCCTC CATCGGGGGC TTCGCCTTCG GCGTCAGCCA ACTGCTGTTC
CTGTATATCG TCATCAAGTG CATGCGCGGC CAGGGTGAGC CGGCCCCGGC CCGCTCCTGG
GAAGGGGCCG AGGGGCTGGA GTGGGAGCTG CCCTCGCCGG CCCCGCACCA CACGTTCTCC
AAGCCGCCGG TGATCAAATG A
 
Protein sequence
MSITVSELDY DNHKPQGFLD RWVLTTSHKD IGTLYLVFSL TMFFVGGAMA MVIRAELFQP 
GLSLVDPHFF NQMTTMHALV MIFGAVMPAF VGLANWLIPM QIGAPDMALP RMNNWSFWLL
PFAFLMLLAT LIMPGGGPAS GWTLYPPLSL QTGMALPFLI FAIHVAGLSS IMGAINIIVT
ILNLRAPGMT LMKMPLFVWT WLITAFLLIA VMPVLAGAVT MLLTDAYFGT SFFSAAGGGD
PVMYQHIFWF FGHPEVYIMI LPAFGIVSAI IPAFARKPLF GYASMVYATA AIAFLSFIVW
AHHMFTVGMP TAGVLFFMYA TMLIAVPTGV KVFNWVATMW RGAMTFETPM LFSIAFVVLF
TIGGLSGIML AIAPADFQYH DTYFVVAHFH YVLVTGSVFA IFAAVYYWIP KWTGVMYDEF
LGKVHFWLSV VSVNVLFFPQ HFLGLAGMPR RTADYALQFA EWNMISSIGG FAFGVSQLLF
LYIVIKCMRG QGEPAPARSW EGAEGLEWEL PSPAPHHTFS KPPVIK