Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0022 |
Symbol | |
ID | 4268879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 25408 |
End bp | 27252 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638124749 |
Product | cytochrome c biogenesis protein, transmembrane region |
Protein accession | YP_740871 |
Protein GI | 114319188 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAC TGGCCCGACC CCTGCCAGCG GCCACGCCGT TGCGCTGGCC GCTCTACTGG CTGCTGGTAT TGCTGCTCGC CCTGCCTCTG TGGGGTGCGG ATGCGCGCGC CAACCCGCTG GATCAACTGG GCGGGGGCCA GCAGCAAATC CTCCCCGCTG AGGAGGCCTT TCCCCTTTTT CTGGAGCGGA TCGGCGAGCA CCGCCTGCTG CTGACGGTGG ATGTGGCCGA GGGCTACTAC CTCTACCGTG AGAAGTTGGC GTTCGAGGTG CGTAACGGCG GCGTCGCGCT CACCGACGTG GCCATACCCG AAGGCCGGGA GAAAGACGAC CCCTTCTTCG GGCAAACCAC CATCCTCCGC GGCGTGGCGC AGGTGCAACT GCAGTTCGAC GGCCCGGTGC CCGACGATCT GCGTTTGGAG GTGGCCTACC GCGGTTGCGC CGATATCGGT GTCTGTTATC CACCGCTGAC CGCCGATCTG GGGCTCGATG GGCAGGGCGC TATTGCCAGT GGTGTCGGCG GCGGTGGCGG CGGGACGGCC GGCGGCGCCG AGGGCCGGCT CTCCAGCCTG CTTCAGGACG GCAGCGCCTG GATGATCCTC GGTGGCTTCT TCGTCGCCGG GCTGCTGCTG GCCTTCACCG CCTGTCTCTA CCCCATGATC CCCATCCTCT CCGGGCTGAT TGCCGGTGAC AGCCAGCGCG GCAGCGGGCG GGCGCTGCTG CTCTCTTTCG TTTATGTCCA GAGTGTGGCC ATTACCTATG CCCTTGCCGG GGCCGCGGCG GGCTTGACCG GCCGGGTGGT TCAGGCCGAG CTGCAGAATC TCTGGGTGTT GGGCGGGTTC TCATTGGTCC TGGTGTTGAT GGCTATGGCC ATGTTCGGCC TGTTCACCGT GCAGATGCCG GGTGTGCTCC AGAACCGACT GGACGCCCTG TCCCGCCGCC AGAAGGGCGG GCGGCTGCTG GGGGTGGCGG TCATGGGCAT GCTCTCGGCG CTGATTGTCG GGGCCTGCTC CGGCCCGGCA CTCATCGCCG CGCTCAGTTT CATCGGTACC ACCGGCGAGG TCGGTCTCGG GGCGCTGGCG CTCTACGTGA TGGCGCTGGG CATGGGGCTG CCGCTGCTGC TCATTGGGAC CGCCGCCGGC AAGTGGCTGC CGCGCTCCGG GGCCTGGATG GACGGCGTCC GCCAACTGTT CGGTTTCGTG CTGCTGGGCG TCGCCATCTG GATGATTGAG CGGTTGCTGA GCGATGCCTT TGCCCTCGCC CTCTGGGGCC TGCTGCTGAT CGCCTTTGCC GTCTGGGTCG GCTGGCGTCT GCGCGGTGGC CGGCTGCTGT GGCGCTGGAG CGGCCGTGCA GTGGGGCTGG CGGCGCTGCT TTGGGGCGCC GCCGCCCTGG TGGGGGCCGG GACCGGCGCC CATCAGCTCA GCCAGCCCCT GGCGGGGCTG AGCGGGGAGT CCCGGACGGA ACTTGCCTGG GTCGATGCGC ACAGCCTGGA CGAGTTGGAT GCCCTGCGCG AGCAGGCGCA GGCCGAGGGC CGGCCGGTGA TGCTCGATGT CACTGCCGAT TGGTGTATCT ACTGCCAACA GCTCAAGGAC CGGACCTTCC CTGACCCCGG CGTGCAGGCG GCGCTGAGCG ATGCAAAGCT GGTGCGGGTG GATGTCACCG CCATGGACGA TGCCGAGCGC GCGCTACTGG AGGCGCTGGG TGTTTTCCTG CCACCGGCCA TCATCTTCTA CCGCGCCGAC GGCAGCGAGG CTCGCGGCCA GCGGGTCTCG GGTTTTCTCG GGGCCGAGGA GTTTCGGCAG CGGGTGGATG CCGCCCTGAA CGGTGAGCCG GGGCCGCGGG GATGA
|
Protein sequence | MTALARPLPA ATPLRWPLYW LLVLLLALPL WGADARANPL DQLGGGQQQI LPAEEAFPLF LERIGEHRLL LTVDVAEGYY LYREKLAFEV RNGGVALTDV AIPEGREKDD PFFGQTTILR GVAQVQLQFD GPVPDDLRLE VAYRGCADIG VCYPPLTADL GLDGQGAIAS GVGGGGGGTA GGAEGRLSSL LQDGSAWMIL GGFFVAGLLL AFTACLYPMI PILSGLIAGD SQRGSGRALL LSFVYVQSVA ITYALAGAAA GLTGRVVQAE LQNLWVLGGF SLVLVLMAMA MFGLFTVQMP GVLQNRLDAL SRRQKGGRLL GVAVMGMLSA LIVGACSGPA LIAALSFIGT TGEVGLGALA LYVMALGMGL PLLLIGTAAG KWLPRSGAWM DGVRQLFGFV LLGVAIWMIE RLLSDAFALA LWGLLLIAFA VWVGWRLRGG RLLWRWSGRA VGLAALLWGA AALVGAGTGA HQLSQPLAGL SGESRTELAW VDAHSLDELD ALREQAQAEG RPVMLDVTAD WCIYCQQLKD RTFPDPGVQA ALSDAKLVRV DVTAMDDAER ALLEALGVFL PPAIIFYRAD GSEARGQRVS GFLGAEEFRQ RVDAALNGEP GPRG
|
| |