Gene Mlg_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1086 
Symbol 
ID4270031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1264706 
End bp1266277 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content71% 
IMG OID638125838 
Productextracellular solute-binding protein 
Protein accessionYP_741928 
Protein GI114320245 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGCA AACCGGCCAC TGCGTTATCC ACCATCGACC TGCGCCGCCG GCGATTGCTC 
CAGACGCTGG GCATCGGCGG GGCCGCTGCC GGGCTCGGCG GGCTCCCCTG GCTGACCGCA
CCCGGCGCCC TGGCCGGCGA ACCCCGCCAC GGGGGGACCC TGCGAGTGGC CCTGCCCCAG
GCCGCCACCA TCAATCCGCT GCGCATGAGC GCCGGCGGTG CCATCGCCGT GGTGCAGCAG
GTGGCCGAAT ACCTGGTGCG CGTGGACGAG CACCTGAACC TGCAGCCGGC CCTGGCCACG
GACTGGGAGA CGCCCGACGA GGGGCGGACC TGGGTGTTCC GCCTGCGCGA GGGGGTGAAG
TTCAACGACG GCCGGCCGCT GGAGGCCGAC GACGTGGTGG CCAGCTTCCG GCGCCTGGTG
GATCCGGACG TCGCCAGTGT GGCCCGCTCA CAGTTGGACT TTCTGCCGCC CGAGGGCATC
AGCGCCGACG ACCGGCACAC CGTCCGCTTC GAGTTATCGC GGCCCATCGG CGCGTTCCCC
TACTACACCC AGATCTACAA CGCCGTGATC CTTCCCCGCG ACCACGACGG CGATCTCGCC
AGTAACCCCG TCGGCACCGG GCCCTTCCTA CTGACCCACT ACCAGGCCCG CGAAGAGGCC
GTGCTGGAGC GCAACCCCGA CTATTGGGAC GCGCCACGGC CCTACCTGGA CCGGGTGCGC
ATGGCGATGT ACGACGGTGA TCAGCCCCAG GTGCTGGCCC TGCGCGGCAA CGCCGCCGAC
ATGATGCTGT TCGCCAGCTA TATGAACGCC CGCCCGCTGC TGGACAACGA GGAGATCAAC
CTGCTCAGCG CCCGCAGCAC CCAGCACCGC CAACTGGCCA TGCGCTGCGA CCAGCCCCCT
TTCGAGGACG TCCGCCTCCG CCGCGCCGTG GCCCTGGCCC TGGGCCGGGA GTCCATGGTG
AACAACCTCA TTGGCGGCTA CGGCGAACCG GGCAACGACC ACCCGGTGGC GCCCCTCTAC
CCGGACGCGG CCGATATCGA CCTGGCCCGG CGCGAGCCCG ACCTGGAACG GGCGCAGGCG
CTGCTGGCCG AGGCCGGCGT GCCCAATGGC TTCCAGATCG ACCTCCACAT CGGTCGCCTG
GCCGAACTGC CGCAGTACGG CGTCCTGGTC GAACGCATGC TGAACCCGAT CGGGATCCGC
GTGAACCTGC GGGTGGAACC ACTGAACACC TACTACGAAC ACTGGACCAA GGTGAACTTC
GGCCTCACCG ACTGGGTGGG CCGGGCGGTT CCGGAGCAGA TCCTGGCGGC GGCCTTCCGG
GGGGGCGCCG ACTGGAACGC CCCCCATTGG CGCGATGATG AGTTCGACGC CGCCCTTAGC
GAGCTGGAGG CGGCCACCGA TCCGGCGCGC CGCCGGCAGC TCACGGCCAC CCTGGCCGGG
AAGCTGCACG AGGAGGTGCC GGCGGCCATC ACTTACTTCA CCCAGGCCCT GCGCCCGGTG
CGTCGGCGGG TGCAGGGTCT CCGCGCGGAC AATGCCAACT ACCTGGACCT GACCCGGGCC
TGGCTGGCCT AG
 
Protein sequence
MRRKPATALS TIDLRRRRLL QTLGIGGAAA GLGGLPWLTA PGALAGEPRH GGTLRVALPQ 
AATINPLRMS AGGAIAVVQQ VAEYLVRVDE HLNLQPALAT DWETPDEGRT WVFRLREGVK
FNDGRPLEAD DVVASFRRLV DPDVASVARS QLDFLPPEGI SADDRHTVRF ELSRPIGAFP
YYTQIYNAVI LPRDHDGDLA SNPVGTGPFL LTHYQAREEA VLERNPDYWD APRPYLDRVR
MAMYDGDQPQ VLALRGNAAD MMLFASYMNA RPLLDNEEIN LLSARSTQHR QLAMRCDQPP
FEDVRLRRAV ALALGRESMV NNLIGGYGEP GNDHPVAPLY PDAADIDLAR REPDLERAQA
LLAEAGVPNG FQIDLHIGRL AELPQYGVLV ERMLNPIGIR VNLRVEPLNT YYEHWTKVNF
GLTDWVGRAV PEQILAAAFR GGADWNAPHW RDDEFDAALS ELEAATDPAR RRQLTATLAG
KLHEEVPAAI TYFTQALRPV RRRVQGLRAD NANYLDLTRA WLA