Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2155 |
Symbol | |
ID | 4270150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2450338 |
End bp | 2452149 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126911 |
Product | hypothetical protein |
Protein accession | YP_742987 |
Protein GI | 114321304 |
COG category | [S] Function unknown |
COG ID | [COG4655] Predicted membrane protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.230042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0266885 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAGAC TGCAAGGACA GGGAATCGGT CCGGGGCGGC AGCGGGGTGC GATCGGCCTG GCGGCGGTCC TCCTGCTGGT GGTGGTGGTC GTGTTCCTGG CCCTGGCTCT GGATGCCGGC CGGCTCTATA TGGAGCAGCG GAACCTGCAG CGGATCGCCG ATGCGACGGC CCTGGAAACG GCCTGGAAGC ATACCGGCTG CACGGCGGAT CCTGCATCGG CGCTACAGAC GGCGCAGGCG GTGGCGGAGC GCAACGGCTA CCAGGGTGAT GACCTCGTCA TAGGAGCGCA GGGGCTGTTG CTGGGCCGTC TGGTGGAGGA CGGCGTCCTC CGGGTGTTCG AGACCATGCC GGATGTAGGC CATCACGGGG TGGTGCCGGA GGCGGCGCGG GTGCACGTGG AGCACGAGGT GCCGCAGAGC TTGGTCCTCG GGGGGCTGTT TGGCCAGCAG GCCACCCTGA GCGCCGAGGC GGTGGCCCGG CGCATGCCGC TGGTGGGGAT CTCCGCCGGC TCCTGGGCCG CGCGGGTGGA CACCGAGAAC TCCCCCCTGC TCAATGCCCT GCTTAACGGT CTGCTGGGGA CCAACCTGCA GTTGGACGCG GTGGCCTTCG CCGGCCTGGT GGATACCTCA GTGACGCTGC TCCAGTTGGC TCAGGACCTG GCGGTACTGG GGGTGGATCT CAGTGTCGCG ACGGTCGATG AGCTGCTGTC CGCAAATGTG CGTTTGCTGG ATGTGCTGGA GGCAGCCGTC CGGGCGGTGG AACGGGAGGG TGTCCTCGAC GTGAACGCCT CGGTCCTGCG CAACCAGCTC CTGAACATCG GGGTGGAAAA CCTGGAGCTG CAGCTTGCCG ACATCCTCCA GGTCCAGGCC CCCTCCATGG ATCCGGACGC AGCCCTGGAC GCCCAAGTCA ATGTGCTGGA CCTGATCATG ACCACGGCGA TGACCGCAAC CCGGGACCAC GCCGTGGAGC TTGATGTTCT CCTGCCACTG AGTGATCTGA ATCTGCTGAA CCTGGTAGAT GTTGACGCCC GGGTTAAGGC CACGATTGTC GAGCCGCCGC AGATTGTCAT CGGCCCGCCC GGCCGGGGCC CCGACGGGGA GTGGCGGACC ATTGTGGATA CCGCGCAGGT GCGCCTCCAG GCCGCCGCGG ACCTATCATT GAATGTGGGT ATCGCAGCGG TCGACGTCGA CCTGGGGGTG GCGCTGCAGG CGGCGCAAGG GAGCGCCTGG GTTGAGGGGG TTGGCTGTCC TCCGGACACC CCCGGGGCCA CGGAGGTGGC GGTCGGCACT CTGCCCGGGG TTGCTAATCT GGAGCTGGGG GAGTTTGATG ACATCGCTGT CTCCGACCCG TCGGTGTTAC CGGTGGCGGT CGAGGTGAGG GCCTTGGGGA TTCACATTGC CACCCTGGCG CTCGCTGCGA ACGCGCCGAT TCAGCCTGCT GCCGGCGAAA CGCTTCATTT CCTGGTTGAG GACCGGGCGG CGTTGCCTAC AGAGGTGCAA TCCGTCGCCA GCGGCTTGGG CGGAGCACTG GCGAACGGGT TACAGACCTT GGGCGAGAGT ATCGATGTGG AGATCACCCT GGTCGAGGAT TTGGGAGTGC TCGCAACGTT GCTGGGACTG ACCACGGCCG TGGTGGAAGC GCTGGTTAAT GAAGTGGTTG CGATCTTGCT GAGCCTGGTA TTACCCCTCG TGCTGCAGCT ATTAGGGAGC GTCATTCTGG AGCCGCTGCT GAGCATGCTT GGCGTTGGCG TGGGAGGGTT AGACGTCCAA GTGGTGGAGC TGCTTGAGGG CGGCGTCGAT CTGGTCCGAT GA
|
Protein sequence | MVRLQGQGIG PGRQRGAIGL AAVLLLVVVV VFLALALDAG RLYMEQRNLQ RIADATALET AWKHTGCTAD PASALQTAQA VAERNGYQGD DLVIGAQGLL LGRLVEDGVL RVFETMPDVG HHGVVPEAAR VHVEHEVPQS LVLGGLFGQQ ATLSAEAVAR RMPLVGISAG SWAARVDTEN SPLLNALLNG LLGTNLQLDA VAFAGLVDTS VTLLQLAQDL AVLGVDLSVA TVDELLSANV RLLDVLEAAV RAVEREGVLD VNASVLRNQL LNIGVENLEL QLADILQVQA PSMDPDAALD AQVNVLDLIM TTAMTATRDH AVELDVLLPL SDLNLLNLVD VDARVKATIV EPPQIVIGPP GRGPDGEWRT IVDTAQVRLQ AAADLSLNVG IAAVDVDLGV ALQAAQGSAW VEGVGCPPDT PGATEVAVGT LPGVANLELG EFDDIAVSDP SVLPVAVEVR ALGIHIATLA LAANAPIQPA AGETLHFLVE DRAALPTEVQ SVASGLGGAL ANGLQTLGES IDVEITLVED LGVLATLLGL TTAVVEALVN EVVAILLSLV LPLVLQLLGS VILEPLLSML GVGVGGLDVQ VVELLEGGVD LVR
|
| |