Gene Mlg_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2073 
Symbol 
ID4270459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2350167 
End bp2352287 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content69% 
IMG OID638126829 
ProductTRAP transporter, 4TM/12TM fusion protein 
Protein accessionYP_742905 
Protein GI114321222 
COG category[R] General function prediction only 
COG ID[COG4666] TRAP-type uncharacterized transport system, fused permease components 
TIGRFAM ID[TIGR02123] TRAP transporter, 4TM/12TM fusion protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.748226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.29835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCC CGCGTAACGG TGAGGACCAG GCCCGGGAGA TGGCCCGGGC CGTGGAGAGC 
GGTGCCCGCC AGCCACGCGG CCGGCTGGCC GTGGCGGCCA TCGCGCTGCT GTGCCTGGCC
TGGTCCGCTT TCCAGGTGGC CATCGCCTGG GACCCCATCG ACGCCCACAT CGCCCGGGTC
TGGCACCTGG CCTTCGCCAT CACCCTGGCC TACCTGTTGT TCCCCGCCCG GCACCGCCCG
GCGCCGCGCT GGCTCCGCCC CCTGCAGCGC TTTGGCCTGT TCCGGGACAA CCGCCAGCGC
ATCCCGTGGG TCGATATCCT GCTCGCCCTG GTGGCGGCCG CCGCCACCCT GTGGATCTGG
TGGGACTACG AGGAGATCCT CTGGCGGGCC GGGCTGCCCG AGCCCCAGGA CATCGTCCTG
GGCATCGTGC TGGTGGTGCT GTTGCTGGAG GCAGCCCGGC GAACCTTGGG CTGGGCGCTG
CCCATTCTCG CTGCCCTGTT CCTGATCTAC TGCTTCGCCG GCCCCTGGAT GCCCGGGCTG
CTCCGCCACC GCGGCGTGTC GCTGGAGATC CTGGTCAACG ACATGTACCT GAGCGACTCC
GGGATCTTCG GGGTGCCACT GGGCGTGTCG GTGGGGTTCG TCTTCCTGTT CGTGCTGCTG
GGCGCGCTGC TGGAGCGGGC GGGGGCCGGC AAGTACTTTA TCGACGTGGC CTACTCCGCC
CTGGGCAGCC TGCGCGGCGG CCCGGCCAAG GCGGCGGTGT TCGCCTCCGG GCTCACCGGC
TCGGTGTCCG GCTCCTCCAT TGCCAACACC GTCACCACCG GTACTTTTAC CATCCCGCTG
ATGAAACGGG TGGGTCTGCC GCCGCACAAG GCGGCGGCGG TGGAGGTGGC CGCCTCCACC
AACGGCCAGT TGATGCCGCC GGTGATGGGC GCGGCCGCCT TCATCATGGC GGAGATCGTC
GGCCTGCCCT ATCTCGATGT ACTGCGGGCG GCGATCATCC CGGCGCTGGT GGCCTACATC
GCCCTCTTCT ACGTAGTGCA TGTGGAGGCC TGCAAAGAAA ACGTGGCCGT GGTGCCGCGC
AGCGAACTGC CCCCCTTCTG GTCCACGCTA CTGCGCGGCC TGCACTACCT GGTGCCGCTA
CTGATGCTCA TCTGGTTCCT AGTGATCATG CGCCGCTCGC CGGTGGCCTC GGCACTGCTG
GCCATCCAGG CCACCGTGGT GATCATGCTG GTGCAGCGGC CCATCCTGGC CTACCTGACC
CATCGGCGGA TGGGCGCCGG GGCGCCCCTC TACCGGGTGT TGGGGCAGGC CCTGCTGGGC
GGTGTGGGCG ATGTGCTGCG GGGGATGATC GGCGGCGCCC GGAATATGGT GGCCGTGGGG
GTGGCCACCG CCACCGCCGG GATCATTGTC GGGGTGGTCA GCACCACCGG GCTGGTGGGC
CGCTTCGTCA ACGTCATCGA GGTGCTCTCC ATGGGGAGCC TCTACCTGAT GCTGGGGCTC
ACCGCCCTCA CCTGCATCAT CCTCGGCATG GGGCTGCCCA CCACCGCCAA CTACATTGTC
ATGGCCACGC TGACCGCCCC GGTGATCGTC CAGTTGGGCG GTGACATGGG GCTGCTGATC
CCGGTGATCG CCGCCCACCT GTTTGTGTTC TATTTCGGCA TCCTGGCCGA CGACACCCCA
CCGGTGGGGT TGGCGGCCTA TGCCGGCGCG GCGATCGCCC AGAGCCCGCC CCTGAAGACC
GGGGTCCAGA GTTTCAGTTA CGACCTGCGC ACCGCGATCC TGCCCTTCGT CTTCGTTTTC
AATACCGAAC TGCTGATGAT CAGCGGTTTG GACGAGGCGG GGCGGATCAT CTGGCAGACC
GACCCGGTGG TGATCGGCTG GACGTTCCTC ACCGCCCTGC TGGGGCTGTT CGCGCTGGTC
TCCGCCATCG CCGGCTACGC CGGCAGCCGC TGCCACAGGC TGGAGCGGCT GCTGTTGCTG
GCCCTGGCGC TGCTGCTGCT GCGCCCGGAC TGGCTGGCCG AGCCCACCGC GGCACCGGCC
GCACTGGTCC AGATCGCCTG CCTGGCGGCG TATGGCATGC TGTATCTCTG GCAACGCCGA
CGCGCACCCC GGCCCGCCTG A
 
Protein sequence
MSTPRNGEDQ AREMARAVES GARQPRGRLA VAAIALLCLA WSAFQVAIAW DPIDAHIARV 
WHLAFAITLA YLLFPARHRP APRWLRPLQR FGLFRDNRQR IPWVDILLAL VAAAATLWIW
WDYEEILWRA GLPEPQDIVL GIVLVVLLLE AARRTLGWAL PILAALFLIY CFAGPWMPGL
LRHRGVSLEI LVNDMYLSDS GIFGVPLGVS VGFVFLFVLL GALLERAGAG KYFIDVAYSA
LGSLRGGPAK AAVFASGLTG SVSGSSIANT VTTGTFTIPL MKRVGLPPHK AAAVEVAAST
NGQLMPPVMG AAAFIMAEIV GLPYLDVLRA AIIPALVAYI ALFYVVHVEA CKENVAVVPR
SELPPFWSTL LRGLHYLVPL LMLIWFLVIM RRSPVASALL AIQATVVIML VQRPILAYLT
HRRMGAGAPL YRVLGQALLG GVGDVLRGMI GGARNMVAVG VATATAGIIV GVVSTTGLVG
RFVNVIEVLS MGSLYLMLGL TALTCIILGM GLPTTANYIV MATLTAPVIV QLGGDMGLLI
PVIAAHLFVF YFGILADDTP PVGLAAYAGA AIAQSPPLKT GVQSFSYDLR TAILPFVFVF
NTELLMISGL DEAGRIIWQT DPVVIGWTFL TALLGLFALV SAIAGYAGSR CHRLERLLLL
ALALLLLRPD WLAEPTAAPA ALVQIACLAA YGMLYLWQRR RAPRPA