Gene Mlg_0266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0266 
Symbol 
ID4270484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp301817 
End bp305119 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content71% 
IMG OID638124991 
Producthypothetical protein 
Protein accessionYP_741111 
Protein GI114319428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0987529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT CCATCGCCCC CCAGGTCTGG CCCGCGGGAC CGCGCGGCCG CGACGACGAC 
CTCTTCCACG AAGACCCGCG CCCCCTGTGC GCAACGTTCC TGTGGGCGGA CATCGTCTAT
CTCACCGGCT CCGGCGAGTT CTGGCTGCTC AACGCCACAG CCGCCGCCGC GATGCACCAC
GCGGCCGACA AGCTGGCCGA CATCGCCGCC GTGGACGACC GCGACGAGCG CAACCGGCGG
CTGTCCGAGG AGGCGGGCGT GCTGGACAGC TTCCTGCCCG CGCATCCGGT CAGTTTCCTG
GGGGAAGCGG ACCGCCAACG GTTCGCGCAG ACCCTCCAGC AGCTCGCCGC CCTTCAGGAC
GAGGCCCCGG ACACGCTGCT GCAACGGGTG GTGGACGGCG TGCCTTTCCA GGGCACCACC
AGCGTGAGCA CCTCGCAATC CTGGTCCGGC GACCATTCAA TGCCTTCGGC CGTACCGACC
CAATGCCCGG AACCGGCACG CATCCACGAG AGCAACGACC ATCTCGACGC ACTTCAGGCC
CTTTATCAAC GCGGCCTGGA CAAGGCCGAG AAGGCCGGCT ATGTCGTCGA TAGCGCCCTG
GTCCACGGCG ACAGCGAGGC GCGCATCCGC GAGGCGCTGC AGCGCTATCA CCGGCGGCGG
GAGCTTGCTT TCCAGGGTGC TCGGTCCCAC CTGGAACAGG GTCAGGGCCT GCCACCGACG
CGGCCGCTCC ATAAGATCCT GGAGCAATAC CGCCGGCATG TGGCGCTCTG CGACCAGGAC
CCGGTACCAG AGGCCGTCGA ACGCTGCGAG ATCGCCTCCG TGATCGAGCA CTACATCCCG
CAACTGGAGC AGGACTACCG CCACTACATC GACAGTCTCA TCGAGCTGGC CGGGCTGGGC
GTGGCCACCC CCGAGCTGGC GCTGGCTGAG GACCCGGACG CCGGCTTCGC CGACGGCGTG
GACTACGTGG CCCGCTACTT CGCGACCCTG GACGAACTCG ACGCCTTGCG CGAGGACGTG
GACACGCGGC TCAGGGAGTG GGAACAGGGC ACGGGCCGGG CCACCCCGCT GCCCATCTTC
CTGTTCACCG ACGAGCAGGC GCGGTTCGAC CGGCTGCGCG AGCGCATGGA CCGCCTCTAC
CGCACGGCCC GGCGCCGGGT GGACCGCACG CGCCCGAGGC GCGTCCTGCA CTGGGACCTG
GGCCCCGACG ACATCCGCGA CCCCGAACCC TACCGCCCGC CCCCGATCCA CCGGCTGGTG
CGCGCCGACT TCCCCCTGCG CGAGTTCAGC GGCCCCGGTC GGCAGCGCAC CCTCGATCAC
CTGAGCCTGC ACCAGCTGGG CGAGACCCGC CCCCACTACG CCCGCCAGCG CGATGCCGCC
ATCGCGCACG ACAGCCGCAC GGTCACCGAG CCCCGCTCCC TGCCCGACAC GGCACTGACC
GGGTGGCTCA CCCGACGCGG CTGCCGGCGG CTCGACTGGA ACCCCGACTG GCACAGCGAG
CCGCTCGGGC TGTTCGAGCC GGAACGCTTC TTCCACGACC TCGACCACCA GGGCCTGGTC
ATCGACCGCC TTGCCGACGA CAGCGCCCGG GAGGAGTGGG GCCGGCGCCT GCGCCGGATC
CTGTTCGCCG ACCCGCTCAA CCACCCCATG CGCCTGTTCG ATGCCAGTGG CCCGGCGCAG
CTCCTGCGCC TGCTGGCCGG CGCATACGCC GAGCCCGACC GGCGCGACCG GGCGCTGTCC
GGGGAGGCAC CCCTGTGGCT GCGCCGGCCC GGACCCGTGG CGCAGGCCGA ACCGGAGACA
GCCGCTTCCG GAAACGGCGC CCGCATCGGC CTGACCGCCC GCTACCAGGA CACCGCCGGC
ATCGACACCG GGGGCAACGC CGGCGGCCGG ACGGTCAGTG TGGCCCCCAG CCTCGCTCTC
GACGAGCTGG GCATCACCGC CACCGTGGCC CGCGGGCAGC ACGGCTCGCC GGTCGTGTTG
CGATTGCGCG GCCAGCACGC CTTCGACTTC GGCCGTGGCG AGATCGCCCT CGCGCCCATC
CAGCTGCCCG ATCCCGCCAA GGCGGAGCCG GTCATCGTCC CGTTCGGGCT GGAGGCGGAT
CACCCGGACG CCCGCTCCAT CGGCCGCTAC TGCCTGCACA TCGAGCCCGT GCTCCACGGC
CATGCCGCCG TGTCCGTGGC GCTGGGCGGC GGCGTCGGGC TCGACACCGC CGGGGGCCGT
CTCGCCGTCA ACGGCCTCGC CCCCGTGGAG CGCGACGGCG TCGATGCCCG ACTGGAGGCC
TTCGCCGGCA CCGGACTGGC CGGCCACAAC CACTGCCGTC TGCTCTGGCA GCCGCCGGCG
AACCTGCTCG CGCGCCTGCC GCGCTACCAG GCCATGGCCG AGATCGACCG CGCGGGCTAC
GCCCGGGACG AGGCCCGCCA GTGGAAAACC CTGACCGCCG CCGAGATCAA CCCCGAGGTG
CGCGTCGGCG TCGGCGGCGA GGCCGCCTTC CGGCTCGGCC TGCACAACGG CCGCTTCGTG
CTGCACGCCT CCCTGCGCCT GGTGCTCGGC GTCGGCGGCG GGGGCAGCGT GCGCCTGGCG
CTTGACCCCC GCCACCTCGA CCTCTGGCTC GCCATGATGC ACCAGGCGCT GGTGGAGGTC
GGCTACGAGC GCGTCGACTG GATCGACGAA GACGCCTTCG AGGAGATGAG CCGCCTGGCC
TATCTCGCCG CCATCACCCT GGTCGAACCC GCCCTGCTCC TGCTGCGCGG CACCCACCGA
CTGCGCCAGA TGATCGAGTG GTTTACCCGG GACCGGGACA TGGCCAGCCG GATCGCCTAC
ACCATCGTCA ACGACCCGCA ACGGGACGCC ATTGCCGCCT GGGTGCGCCA GCTACCGCCC
GAGGCCCTGG GGCCGCTGTT ATACACCCTG ACCAGTCGGC CGCAGGCGTT CGAGGTGGAG
ATTCAGCGGG ATGGCCAGAA GCAAGTACAG CGGTTCGGGC GTGAACAGGC TCTTGTGTTC
CACCAACGGG CGATCCTCAA CTGCCTGCAA TGGATCGTCT CCGGCGTCAT GGCCGGCGTC
TACGGCCCCC GACGCGACTT CTCCGCAGAG CACCCGCACC CGGCGCAGAA GCTGTTTGAA
AAGGCCGTGG TGCGCATGGC CCGAGACGGA CAGCCTACCG ACGAATCGAG GGCCGATGCG
TATGCCGAGA ACCGAGGGCG GCTGGACAAT TTCATGTCAG CAGGCAGCGG ACAGCTGGAG
CAACAGGACC GTCAATGGAA ATACAGACAG AATGCCGGCT GGCTTTCCCG TCACATTCAG
TAG
 
Protein sequence
MTDSIAPQVW PAGPRGRDDD LFHEDPRPLC ATFLWADIVY LTGSGEFWLL NATAAAAMHH 
AADKLADIAA VDDRDERNRR LSEEAGVLDS FLPAHPVSFL GEADRQRFAQ TLQQLAALQD
EAPDTLLQRV VDGVPFQGTT SVSTSQSWSG DHSMPSAVPT QCPEPARIHE SNDHLDALQA
LYQRGLDKAE KAGYVVDSAL VHGDSEARIR EALQRYHRRR ELAFQGARSH LEQGQGLPPT
RPLHKILEQY RRHVALCDQD PVPEAVERCE IASVIEHYIP QLEQDYRHYI DSLIELAGLG
VATPELALAE DPDAGFADGV DYVARYFATL DELDALREDV DTRLREWEQG TGRATPLPIF
LFTDEQARFD RLRERMDRLY RTARRRVDRT RPRRVLHWDL GPDDIRDPEP YRPPPIHRLV
RADFPLREFS GPGRQRTLDH LSLHQLGETR PHYARQRDAA IAHDSRTVTE PRSLPDTALT
GWLTRRGCRR LDWNPDWHSE PLGLFEPERF FHDLDHQGLV IDRLADDSAR EEWGRRLRRI
LFADPLNHPM RLFDASGPAQ LLRLLAGAYA EPDRRDRALS GEAPLWLRRP GPVAQAEPET
AASGNGARIG LTARYQDTAG IDTGGNAGGR TVSVAPSLAL DELGITATVA RGQHGSPVVL
RLRGQHAFDF GRGEIALAPI QLPDPAKAEP VIVPFGLEAD HPDARSIGRY CLHIEPVLHG
HAAVSVALGG GVGLDTAGGR LAVNGLAPVE RDGVDARLEA FAGTGLAGHN HCRLLWQPPA
NLLARLPRYQ AMAEIDRAGY ARDEARQWKT LTAAEINPEV RVGVGGEAAF RLGLHNGRFV
LHASLRLVLG VGGGGSVRLA LDPRHLDLWL AMMHQALVEV GYERVDWIDE DAFEEMSRLA
YLAAITLVEP ALLLLRGTHR LRQMIEWFTR DRDMASRIAY TIVNDPQRDA IAAWVRQLPP
EALGPLLYTL TSRPQAFEVE IQRDGQKQVQ RFGREQALVF HQRAILNCLQ WIVSGVMAGV
YGPRRDFSAE HPHPAQKLFE KAVVRMARDG QPTDESRADA YAENRGRLDN FMSAGSGQLE
QQDRQWKYRQ NAGWLSRHIQ