Gene Mlg_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0018 
Symbol 
ID4269549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp21528 
End bp22703 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content71% 
IMG OID638124745 
Producthypothetical protein 
Protein accessionYP_740867 
Protein GI114319184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0100534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCAT TGCGAGCTAT GAATGCCCTG CACCCCACAC TGTTTCTGGT GCTTGCCCTG 
ACGGCAGTCA TTGCGCTCCC CGGCTGCGGT GCCATGCAAC GCCATTACGA GGCGGGGCTG
ATCCTGGCCG ATATCCGCGC CGGCGAGGCC GACAGCCGCT GGAAACGCAC CCGACCGGCC
CCGGAACGAG AGACCGTCGA GTACACCGGC CCCACCGGGG TGCCACGGGT GGCCGACCTC
TATACCCCGG GCGACGAGGT CCGCTCCAAC CTGGTGCTGG TCCACGGCTT CACCGAGGCG
GGCCGGCGGG ACCCGCGCCT GGTGCAGTTC GCCAAGACGC TGAGCCGGGC CGGTTTTCGC
GTCCTCGCCC CAGAGGTGGA GACCCTCACC CGCATGGACG TCTCGCCGGA GAACATCCGC
GATGTGGTGG ATGCCGCCCA CTGGCTGGAC GCGCGGGACG ACGGCGAGGG GGTGGGCGTG
GCCGCGATGA GCTTCTCCGT CGCCACCGCC GTGCTGGCGG CGCTGGAGGA GGACGGCCGG
CCACACATCG GCTGGATCGT CGGGGTGGGC GGCTACTACG ATCTGGTGGA GACCCTGACC
TACGTCACCA CCGGCTATTT CACCGAGGAC GGCGAGCGGC GCTACCAGAT CCCCCGGGTG
GAGGGCCGCT GGGTGGTCCT GCTGACCCAG CTGGACCGGG TGCCGGATGC CGACGACCGC
CGCCTGCTCG ACCGTATCGC CCGCGAGCGC CTGGCGGACC CGGAGGCCGA GACCGGGCCG
CTGGCCGAGC GGCTGTCACC GCCCGGGCGC GCGGTCTATG CCTTGCTCAC CAACCGGGAT
CCGGATCGGG TACCGGACCT GCTGGCGGCG TTGCCCGACG GCGTCCGCGA CGAGATTAAA
GCGCTGAATC TTGCCAACCG GGACCTGTCC CGGCTTCAGG CCTACCTGCT GCTGGTCCAC
GGCCGCGACG ACGATGTCAT CCCCTGGACC CAGAGCCAGG CCCTCAAGCA GGCCGCCCCC
AGGGGGCAGG CCGAGTTGCG GCTGGTGACC GGCCTCACCC ATGTGGATGT GGACCCCGGG
GTGGTGGGCG CCTGGCGGTT GCTGCGGGCG GTCAACCGGC TGCTGTTGCT GCGCGACGAC
CCGCCACCGA CCCCCTCGTC GGCGAATGAG CCATGA
 
Protein sequence
MPSLRAMNAL HPTLFLVLAL TAVIALPGCG AMQRHYEAGL ILADIRAGEA DSRWKRTRPA 
PERETVEYTG PTGVPRVADL YTPGDEVRSN LVLVHGFTEA GRRDPRLVQF AKTLSRAGFR
VLAPEVETLT RMDVSPENIR DVVDAAHWLD ARDDGEGVGV AAMSFSVATA VLAALEEDGR
PHIGWIVGVG GYYDLVETLT YVTTGYFTED GERRYQIPRV EGRWVVLLTQ LDRVPDADDR
RLLDRIARER LADPEAETGP LAERLSPPGR AVYALLTNRD PDRVPDLLAA LPDGVRDEIK
ALNLANRDLS RLQAYLLLVH GRDDDVIPWT QSQALKQAAP RGQAELRLVT GLTHVDVDPG
VVGAWRLLRA VNRLLLLRDD PPPTPSSANE P