Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2400 |
Symbol | |
ID | 4269987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2726255 |
End bp | 2728150 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638127158 |
Product | hypothetical protein |
Protein accession | YP_743230 |
Protein GI | 114321547 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.000233655 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACTAA TACGTTCGTA TTGGGCCGCA TCATTTCATC CCCATTCACG TCCTTCGTTA CCGTTTTTTT CATTGCTGAC TTTCGTCATT CTGGGGCTGG CCCTGGCCGG GTGCAGCAGC TCCAGCTCAT CCGGTGACCC ATCGGATGAG GAGGATAACG GCTTTTCCGT CAGTGTGAAC GTCCGCGGGT TGGACTTGGC CATTGATCCC GGGACACCGC TGATACTGAA CGACCAGGCC GAGGAGGCGC TGACCATTGA GGCGGATGGC AGCTATGCAT TCTCCGCCGA GCTGGCGTCC GGCGACAGCT ATGGGGTCGA GGTGACGCAG CAGCCCAACG AACCGCGGCA ATTCTGCTTC ATCGAGCGTC AGCAGGCCGT CTCCGGCGAG ATCACCGGCG ATGTGACCCT CAACGTGGAC TGCGGCCTGG CCTACCACGG GCTGTTCCCC GGCTGGGGGG TGGGCGCGAC TCCGGACGAC GGCTCGCTGC GGCCCCTGCA GATCGCCCCC GCCACGCTGG TGGCCCTGCG GGTCGACGGC CAGGACGAGA TTGCCGTTGT CGATCCCTGC CAGGTGGAGA CCCGTGGTGA GGTGACCCAC GCGGACGGTA CCTACGACCT GTCAGTGACC GCGGCCACCG CCTGTGACCC CACCCTGTTC CTGAACACCA CCATGACGGA GGAGGCGTTC ACGGCGCAGA TCCCGGAGGG CATCGACGAG TTGCCCGACG AGCTCGAGCT GAGCCGTTTC GCCGGCCCCG ACCGCGGCCT GCCCACCGCG GCGCTGGGGC ATGCGGTGGC GGATGCCGAG GCGCAGGATC TGCTCCAGCA CAACCGCTGG CTGCTCGCCG GCCTCACCGC CCGCCAGGCG GACGCCATGG CCGCCGAGGA GCTGCACGGC ACGTGGGGCC TGGTGGCGCT CACCATGGAG CTGAACGACG AGGCTGAGCC CCAGTGGGTC CGCCACAGCA GCCTCAGTGT GCCGGCGATC ATCGGCAACG ACGGCAACGG AGATCTGCTC CTGAACGCGG ACGGCTGGCA GACGCTGACC GCCGCCGTGC AGGCCATTCA GGACGACCCG CCGCCTCCGC CGATCACCCA GGGCTTCACC CAGGGGGCAC CCGGCGATGG CTTCAGCGAC GTTCCGTTGA CCCTGAGCGC AGATGGCCGC TTGCAGGCCG GGGACAGCCA CGGTTTCGTC TCCGCCGATG GCGACTTCTT CGTGCTGATC CATTCCGAAC CCACCCCGGA ACAGGTCCGG GCGGACGAGG AGAACGGGCT GGGTGAGGGC CAGGGCGCCC ACCAGGTGCT GCTGGGCGTG CGGCGCGACG AGAACCTGGT CTCGCTGGAC GGGCGGACTT ACGCCCTGTT CGGCCCCTCC TGGTTCCTGG GCACGGAGGA GGACGGTGAC TCGCTGCAGG GCGTGGCCGA GTTCGAACTG GCGCCCTTCC TGGAGGGCAC GGCACTGAGC TTCTCCGAGG GCGAGGTGAC GCTGACCCTG GAGGAGGAGC AATGGATCGC GCCCTTCGGC GGCGGTGCCC TGGACATCGA CAGCGACTCC GACTCACTCG CGGGACTGCC CTATACCATT GGTGACAATG AAGGCGACAA GCCGCAACTG ATCGAGATCG ACCTGGGCGA TGGCTTCGGG CCCGATCGTG ACAACTACCT CACCGGCTAC GCCCACGGCA AGCTGCTGGT ACTCGCCCTG GGCGTCCGCG ACGGCACCCA GGAGGAGGAG GCGGCGAGTC TCAGTGCCGA GATCAACCTG CTCGGCCTCA TCACCCTGAG TGCCGACGCC ACGGTGAACG GCGACGATGA CGCCATTCCG GCGGAGGACG TGAGTGTGGG CACGCTGATC GGCATCTGCG TGCAGGGCTG CAACAACGAG TTCTGA
|
Protein sequence | MKLIRSYWAA SFHPHSRPSL PFFSLLTFVI LGLALAGCSS SSSSGDPSDE EDNGFSVSVN VRGLDLAIDP GTPLILNDQA EEALTIEADG SYAFSAELAS GDSYGVEVTQ QPNEPRQFCF IERQQAVSGE ITGDVTLNVD CGLAYHGLFP GWGVGATPDD GSLRPLQIAP ATLVALRVDG QDEIAVVDPC QVETRGEVTH ADGTYDLSVT AATACDPTLF LNTTMTEEAF TAQIPEGIDE LPDELELSRF AGPDRGLPTA ALGHAVADAE AQDLLQHNRW LLAGLTARQA DAMAAEELHG TWGLVALTME LNDEAEPQWV RHSSLSVPAI IGNDGNGDLL LNADGWQTLT AAVQAIQDDP PPPPITQGFT QGAPGDGFSD VPLTLSADGR LQAGDSHGFV SADGDFFVLI HSEPTPEQVR ADEENGLGEG QGAHQVLLGV RRDENLVSLD GRTYALFGPS WFLGTEEDGD SLQGVAEFEL APFLEGTALS FSEGEVTLTL EEEQWIAPFG GGALDIDSDS DSLAGLPYTI GDNEGDKPQL IEIDLGDGFG PDRDNYLTGY AHGKLLVLAL GVRDGTQEEE AASLSAEINL LGLITLSADA TVNGDDDAIP AEDVSVGTLI GICVQGCNNE F
|
| |