Gene Mlg_2274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2274 
Symbol 
ID4268237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2577009 
End bp2578811 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content68% 
IMG OID638127031 
Productextracellular solute-binding protein 
Protein accessionYP_743106 
Protein GI114321423 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCAGC GTATCGTCCG GACCTTCGTC ATGGCCCTGG GCCTGATCGG CCTGCCCGCG 
GCCGCCTTGG CCGGTGGTCA CGCCATCGCG CTGCACGGCG AGCCCAAGTA CGGTCCCGAC
TTCGAGCACT TCGACTACGT CAACCCCGAC GCCCCCAAGG GCGGGGCGGT GCGCCTGTCC
GCCCTGGGCA CCTTCGACAG CCTGCACCCC TATATCCTCC GCGGGGTGCC GGCCCAGGGG
CTGAGCCAGG TCTTCGACAG CCTGACCGAG AACAGTGCCG ACGAGCCCTT TACCGAGTAC
GGGCTGATTG CCGAGACCAT TGAGGTGGAC CCGGAGGGCT ACTGGGTGCG CTTTGACCTG
CGGCCCGAGG CGCGCTTCCA CGACGGCGAG CCCATCACCG TGGACGACGT CATCTGGACC
TTCGAGACCC TGCGCGAGCA CGGTCACCCC TCGCTGCGCA GCTACTACCG CGACGTGGAG
CGGGTCGAGC GGACCGGCGA GCGCCAGGTG ACCTTCCACT TCGCTGGCAA TGAGAACGCC
GAGCTGCCGC TGATCGTCGG GCAGATGCCG GTGCTGCCGG AGCACTGGTG GGCGGACCGC
GAATTCGATC GCACCACCCT GGACAAGCCC CTGGGCAGCG GCCCCTACCG GGTGGCCGAG
GTGCGCCAGG GCCGGCATAT CGTCTACGAG CGGGTGGAGG ACTACTGGGC CGCCGACCTG
CCGGTGAACC GCGGGCGCCA CAACTTCGAC CGCATCCGTT ACGACTACTA CCGCGACGCC
GATGTGGCGC TGGAGGCCTT CCGGGCCGGG GAGTATGACT TCCGTCCCGA GAACATCGCC
CGCAACTGGG CCAATGCCTA CGATTTCGCC GCGGTGCGCG AGGGCCGGGT GCAGCGCGAG
GAGATCGCCC ACGAGATCCC CACCGGGATG CAGGGCTTCT TCATTAACAC CCGGCGTGAC
CGCTTCAGCG ATCCGCGGGT GCGTGAGGCG TTGTCGCTGG CCTTCGATTT CGAGTGGACC
AACCGCAACC TGTTCCACGA TGGCTACACC CGTACCCGGT CCTACTTCTC CAATTCGGAA
CTGGCGTCCG ACGGGCCGCC CTCGGCCGAG GAGCTGGAGA TCCTCGAGCC CTACCGCGAT
CAGTTGCCGG AGGCGCTGTT CGAGTCCGCC TTCGAGCCGC CGAGCACCGA AGGGGATCGC
GGCCTGCGCC GCAACCTGCG GCAAGCGGCG GCCCTGCTGC GGGAGGCCGG CTGGGTGGTC
GAGGACGGCC GGCTGGTGCA CGGCGAGACC GGTGAGCGCA TGCGCTTCGA GGTGCTGCTG
GATAACGCCA GCTTCGAGCG GGTAGCCCTG CCCTGGCGGC GCAACCTGGA GCGGTTGGGC
ATGGAGGTGA GTGTGCGTAC CGTGGACACT TCCCAGTACC AGAGCCGCAT GGATGAGTTC
GACTTCGACA TCACCGTGCA GTTGATCGGC CAGTCCCTGT CGCCCGGCAA TGAGCAGCGC
AACTACTGGA GCTGCGCCGC CGCCGAGACC CCGGGCAGCC GCAACTACGC CGGCATTTGC
GACGAGGTGG TGGACGCGCT GATCGAGCGC ATCATCCACG CCCCCGATCG CGACACCCTG
GTGGCCGCCA CCCGCGCCCT GGACCGGGTG CTGCTGCACG GCCACTATGT GGTGCCCCAC
TGGCACCTGC CGGCCTTCCG GTTGGCCTAC TGGGACAAGT TCGACCGCCC GGAGACCAGC
CCGAAATACG CCCTGGGCTT TGACACCTGG TGGTACGACG AAGAGCGCGC CGCCGAGCTT
TGA
 
Protein sequence
MPQRIVRTFV MALGLIGLPA AALAGGHAIA LHGEPKYGPD FEHFDYVNPD APKGGAVRLS 
ALGTFDSLHP YILRGVPAQG LSQVFDSLTE NSADEPFTEY GLIAETIEVD PEGYWVRFDL
RPEARFHDGE PITVDDVIWT FETLREHGHP SLRSYYRDVE RVERTGERQV TFHFAGNENA
ELPLIVGQMP VLPEHWWADR EFDRTTLDKP LGSGPYRVAE VRQGRHIVYE RVEDYWAADL
PVNRGRHNFD RIRYDYYRDA DVALEAFRAG EYDFRPENIA RNWANAYDFA AVREGRVQRE
EIAHEIPTGM QGFFINTRRD RFSDPRVREA LSLAFDFEWT NRNLFHDGYT RTRSYFSNSE
LASDGPPSAE ELEILEPYRD QLPEALFESA FEPPSTEGDR GLRRNLRQAA ALLREAGWVV
EDGRLVHGET GERMRFEVLL DNASFERVAL PWRRNLERLG MEVSVRTVDT SQYQSRMDEF
DFDITVQLIG QSLSPGNEQR NYWSCAAAET PGSRNYAGIC DEVVDALIER IIHAPDRDTL
VAATRALDRV LLHGHYVVPH WHLPAFRLAY WDKFDRPETS PKYALGFDTW WYDEERAAEL