Gene Mlg_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1203 
Symbol 
ID4270691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1401958 
End bp1402890 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content68% 
IMG OID638125952 
Producthypothetical protein 
Protein accessionYP_742042 
Protein GI114320359 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.511791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.098201 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCC GCGTGAGCTA CGCCCTGGTT GGGCTGTTCG TCATCCTGCT CACCCTGGCC 
ATCATCGGCG CCGGGCTCTA TCTCGGTGGC GACATCCGGA CCCAGCCTCA TACCGACTAT
GCCGTCTACA TGGATGAGTC GGTGGCCGGG CTCAATGTCA GCGCCCCCGT TCGTTACCGC
GGCGTGGACG TGGGCCGGGT CCAGGCCATC ACCCTCAACC CCCGGCATCC GGACGAGGTC
CGCATCGTCA TCTCCGTCGA GGAGCGGGTC CCCATCGGCC GGGAGACCGT GGCCACGCTC
CGTTCCCAGG GGTTGACCGG GATCTCCTTC ATCGAGCTTA GTGGCAGCAC CACCGACCCC
GTCACGCCGC AACCGCGCGC CGGCGATGAC CTGCCCGCCC TCCGCACCGT CCCCTCCTTC
GGCAGCCGCC TGGAGCAGAC GGTGGACGAG GCTTTGGGTG TGATGCGGGT GGTGGCCGAC
GAGGTGCGCG ACCTCCTGCG CGAGGAGAAT CGCGAGCGCG TGGCCCGGCT GCTCCAGAAC
GCCAACGTGC TGGTCGCCAA CCTGGCCGAG GGCAGCGAGG ACCTGGACCA GACCATGGTT
CGGTTCAACC AACTGCTCGA CCAGGGCAAT GAGGCCGCCG CGCGGCTGCC GGAGAGCATG
GACCGGCTCG ACGACACCCT GGCGCGCTGG GCGCGGCTGG CCGACGACCT GGGCCGGACC
GGTGACACCC TGGACGCCCT GGCCAGCCGG GGCGAGACCA CCCTTATCGA TGTCAATCAG
ACCCTGATCC CCGAACTGGG CACCCTGATG TACGAGATGC GCCGGTTGTC ACAGGATCTG
GAACGGACCC TGGAGGACTT CAGCGACGAG CCGCAGATGC TGATCTACGG CCGCCAACCC
ATCGCCCCGG GCCCCGGAGA GGAGACGCGC TGA
 
Protein sequence
METRVSYALV GLFVILLTLA IIGAGLYLGG DIRTQPHTDY AVYMDESVAG LNVSAPVRYR 
GVDVGRVQAI TLNPRHPDEV RIVISVEERV PIGRETVATL RSQGLTGISF IELSGSTTDP
VTPQPRAGDD LPALRTVPSF GSRLEQTVDE ALGVMRVVAD EVRDLLREEN RERVARLLQN
ANVLVANLAE GSEDLDQTMV RFNQLLDQGN EAAARLPESM DRLDDTLARW ARLADDLGRT
GDTLDALASR GETTLIDVNQ TLIPELGTLM YEMRRLSQDL ERTLEDFSDE PQMLIYGRQP
IAPGPGEETR