Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0266 |
Symbol | |
ID | 4270484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 301817 |
End bp | 305119 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638124991 |
Product | hypothetical protein |
Protein accession | YP_741111 |
Protein GI | 114319428 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0987529 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT CCATCGCCCC CCAGGTCTGG CCCGCGGGAC CGCGCGGCCG CGACGACGAC CTCTTCCACG AAGACCCGCG CCCCCTGTGC GCAACGTTCC TGTGGGCGGA CATCGTCTAT CTCACCGGCT CCGGCGAGTT CTGGCTGCTC AACGCCACAG CCGCCGCCGC GATGCACCAC GCGGCCGACA AGCTGGCCGA CATCGCCGCC GTGGACGACC GCGACGAGCG CAACCGGCGG CTGTCCGAGG AGGCGGGCGT GCTGGACAGC TTCCTGCCCG CGCATCCGGT CAGTTTCCTG GGGGAAGCGG ACCGCCAACG GTTCGCGCAG ACCCTCCAGC AGCTCGCCGC CCTTCAGGAC GAGGCCCCGG ACACGCTGCT GCAACGGGTG GTGGACGGCG TGCCTTTCCA GGGCACCACC AGCGTGAGCA CCTCGCAATC CTGGTCCGGC GACCATTCAA TGCCTTCGGC CGTACCGACC CAATGCCCGG AACCGGCACG CATCCACGAG AGCAACGACC ATCTCGACGC ACTTCAGGCC CTTTATCAAC GCGGCCTGGA CAAGGCCGAG AAGGCCGGCT ATGTCGTCGA TAGCGCCCTG GTCCACGGCG ACAGCGAGGC GCGCATCCGC GAGGCGCTGC AGCGCTATCA CCGGCGGCGG GAGCTTGCTT TCCAGGGTGC TCGGTCCCAC CTGGAACAGG GTCAGGGCCT GCCACCGACG CGGCCGCTCC ATAAGATCCT GGAGCAATAC CGCCGGCATG TGGCGCTCTG CGACCAGGAC CCGGTACCAG AGGCCGTCGA ACGCTGCGAG ATCGCCTCCG TGATCGAGCA CTACATCCCG CAACTGGAGC AGGACTACCG CCACTACATC GACAGTCTCA TCGAGCTGGC CGGGCTGGGC GTGGCCACCC CCGAGCTGGC GCTGGCTGAG GACCCGGACG CCGGCTTCGC CGACGGCGTG GACTACGTGG CCCGCTACTT CGCGACCCTG GACGAACTCG ACGCCTTGCG CGAGGACGTG GACACGCGGC TCAGGGAGTG GGAACAGGGC ACGGGCCGGG CCACCCCGCT GCCCATCTTC CTGTTCACCG ACGAGCAGGC GCGGTTCGAC CGGCTGCGCG AGCGCATGGA CCGCCTCTAC CGCACGGCCC GGCGCCGGGT GGACCGCACG CGCCCGAGGC GCGTCCTGCA CTGGGACCTG GGCCCCGACG ACATCCGCGA CCCCGAACCC TACCGCCCGC CCCCGATCCA CCGGCTGGTG CGCGCCGACT TCCCCCTGCG CGAGTTCAGC GGCCCCGGTC GGCAGCGCAC CCTCGATCAC CTGAGCCTGC ACCAGCTGGG CGAGACCCGC CCCCACTACG CCCGCCAGCG CGATGCCGCC ATCGCGCACG ACAGCCGCAC GGTCACCGAG CCCCGCTCCC TGCCCGACAC GGCACTGACC GGGTGGCTCA CCCGACGCGG CTGCCGGCGG CTCGACTGGA ACCCCGACTG GCACAGCGAG CCGCTCGGGC TGTTCGAGCC GGAACGCTTC TTCCACGACC TCGACCACCA GGGCCTGGTC ATCGACCGCC TTGCCGACGA CAGCGCCCGG GAGGAGTGGG GCCGGCGCCT GCGCCGGATC CTGTTCGCCG ACCCGCTCAA CCACCCCATG CGCCTGTTCG ATGCCAGTGG CCCGGCGCAG CTCCTGCGCC TGCTGGCCGG CGCATACGCC GAGCCCGACC GGCGCGACCG GGCGCTGTCC GGGGAGGCAC CCCTGTGGCT GCGCCGGCCC GGACCCGTGG CGCAGGCCGA ACCGGAGACA GCCGCTTCCG GAAACGGCGC CCGCATCGGC CTGACCGCCC GCTACCAGGA CACCGCCGGC ATCGACACCG GGGGCAACGC CGGCGGCCGG ACGGTCAGTG TGGCCCCCAG CCTCGCTCTC GACGAGCTGG GCATCACCGC CACCGTGGCC CGCGGGCAGC ACGGCTCGCC GGTCGTGTTG CGATTGCGCG GCCAGCACGC CTTCGACTTC GGCCGTGGCG AGATCGCCCT CGCGCCCATC CAGCTGCCCG ATCCCGCCAA GGCGGAGCCG GTCATCGTCC CGTTCGGGCT GGAGGCGGAT CACCCGGACG CCCGCTCCAT CGGCCGCTAC TGCCTGCACA TCGAGCCCGT GCTCCACGGC CATGCCGCCG TGTCCGTGGC GCTGGGCGGC GGCGTCGGGC TCGACACCGC CGGGGGCCGT CTCGCCGTCA ACGGCCTCGC CCCCGTGGAG CGCGACGGCG TCGATGCCCG ACTGGAGGCC TTCGCCGGCA CCGGACTGGC CGGCCACAAC CACTGCCGTC TGCTCTGGCA GCCGCCGGCG AACCTGCTCG CGCGCCTGCC GCGCTACCAG GCCATGGCCG AGATCGACCG CGCGGGCTAC GCCCGGGACG AGGCCCGCCA GTGGAAAACC CTGACCGCCG CCGAGATCAA CCCCGAGGTG CGCGTCGGCG TCGGCGGCGA GGCCGCCTTC CGGCTCGGCC TGCACAACGG CCGCTTCGTG CTGCACGCCT CCCTGCGCCT GGTGCTCGGC GTCGGCGGCG GGGGCAGCGT GCGCCTGGCG CTTGACCCCC GCCACCTCGA CCTCTGGCTC GCCATGATGC ACCAGGCGCT GGTGGAGGTC GGCTACGAGC GCGTCGACTG GATCGACGAA GACGCCTTCG AGGAGATGAG CCGCCTGGCC TATCTCGCCG CCATCACCCT GGTCGAACCC GCCCTGCTCC TGCTGCGCGG CACCCACCGA CTGCGCCAGA TGATCGAGTG GTTTACCCGG GACCGGGACA TGGCCAGCCG GATCGCCTAC ACCATCGTCA ACGACCCGCA ACGGGACGCC ATTGCCGCCT GGGTGCGCCA GCTACCGCCC GAGGCCCTGG GGCCGCTGTT ATACACCCTG ACCAGTCGGC CGCAGGCGTT CGAGGTGGAG ATTCAGCGGG ATGGCCAGAA GCAAGTACAG CGGTTCGGGC GTGAACAGGC TCTTGTGTTC CACCAACGGG CGATCCTCAA CTGCCTGCAA TGGATCGTCT CCGGCGTCAT GGCCGGCGTC TACGGCCCCC GACGCGACTT CTCCGCAGAG CACCCGCACC CGGCGCAGAA GCTGTTTGAA AAGGCCGTGG TGCGCATGGC CCGAGACGGA CAGCCTACCG ACGAATCGAG GGCCGATGCG TATGCCGAGA ACCGAGGGCG GCTGGACAAT TTCATGTCAG CAGGCAGCGG ACAGCTGGAG CAACAGGACC GTCAATGGAA ATACAGACAG AATGCCGGCT GGCTTTCCCG TCACATTCAG TAG
|
Protein sequence | MTDSIAPQVW PAGPRGRDDD LFHEDPRPLC ATFLWADIVY LTGSGEFWLL NATAAAAMHH AADKLADIAA VDDRDERNRR LSEEAGVLDS FLPAHPVSFL GEADRQRFAQ TLQQLAALQD EAPDTLLQRV VDGVPFQGTT SVSTSQSWSG DHSMPSAVPT QCPEPARIHE SNDHLDALQA LYQRGLDKAE KAGYVVDSAL VHGDSEARIR EALQRYHRRR ELAFQGARSH LEQGQGLPPT RPLHKILEQY RRHVALCDQD PVPEAVERCE IASVIEHYIP QLEQDYRHYI DSLIELAGLG VATPELALAE DPDAGFADGV DYVARYFATL DELDALREDV DTRLREWEQG TGRATPLPIF LFTDEQARFD RLRERMDRLY RTARRRVDRT RPRRVLHWDL GPDDIRDPEP YRPPPIHRLV RADFPLREFS GPGRQRTLDH LSLHQLGETR PHYARQRDAA IAHDSRTVTE PRSLPDTALT GWLTRRGCRR LDWNPDWHSE PLGLFEPERF FHDLDHQGLV IDRLADDSAR EEWGRRLRRI LFADPLNHPM RLFDASGPAQ LLRLLAGAYA EPDRRDRALS GEAPLWLRRP GPVAQAEPET AASGNGARIG LTARYQDTAG IDTGGNAGGR TVSVAPSLAL DELGITATVA RGQHGSPVVL RLRGQHAFDF GRGEIALAPI QLPDPAKAEP VIVPFGLEAD HPDARSIGRY CLHIEPVLHG HAAVSVALGG GVGLDTAGGR LAVNGLAPVE RDGVDARLEA FAGTGLAGHN HCRLLWQPPA NLLARLPRYQ AMAEIDRAGY ARDEARQWKT LTAAEINPEV RVGVGGEAAF RLGLHNGRFV LHASLRLVLG VGGGGSVRLA LDPRHLDLWL AMMHQALVEV GYERVDWIDE DAFEEMSRLA YLAAITLVEP ALLLLRGTHR LRQMIEWFTR DRDMASRIAY TIVNDPQRDA IAAWVRQLPP EALGPLLYTL TSRPQAFEVE IQRDGQKQVQ RFGREQALVF HQRAILNCLQ WIVSGVMAGV YGPRRDFSAE HPHPAQKLFE KAVVRMARDG QPTDESRADA YAENRGRLDN FMSAGSGQLE QQDRQWKYRQ NAGWLSRHIQ
|
| |