Gene Mlg_0747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0747 
Symbol 
ID4270508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp830129 
End bp831280 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content65% 
IMG OID638125496 
Productmajor facilitator transporter 
Protein accessionYP_741591 
Protein GI114319908 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTATCC CCCAGTTACT GATTGCTGTC TATTGTACCC TGCTGGCCTT CTCCGCCATT 
TACGCCCCGC AGCCGCTGCT GCCGGTGCTG CAGGGGGCGT TTGACGTCAG TGAGACCCGG
GCCTCGCTGC TCATTACCGT GACCCTGCTG CCCCTGGCCA TCGCCCCGGT GGCCTACGGC
TTCGTCCTCC AGCGGTTCTC GGCCAAGCGC CTGCTGATCG GCGCCACCGC CCTGCTGGCG
GTCACCGAGT ACCTGATCTT CTTCGTCACC CACTTCGAGC TGTTCCTGTT CCTGCGGCTG
CTGCAGGGCC TGCTGATCCC GGCCATCCTC ACCGCGCTCA TGACCTATCT CTCGGCCAGC
GCCGGACCGG GTCGCATCGC CCGGGTGATG GCCTTTTACG TGGCGGCGAC GGTGCTGGGC
GGGTTCCTCG GCCGGGCGCT GTCCGGTCTG ATCTCCACCG GCTTTGGCTG GCGCTGGTCG
TTCCTATTCC TGGGGCTCGC CCTGACCGTC TGCGTGCTGC TGCTGCGACG GCTGGACGCC
GACCCGCCGG TCAGTTTTCA GAAGCTGCGT GCGGGGACGG TGGTGGCGGT ATTGCGTCAG
CCCAGTTTCC TGCGGCTGTA CGGGGTGATC TTCTGCGCCT TCTACGTCTT CGCCTCATTG
CTGAACTTCC TGCCCTTCCG CCTGGTGGAG CTGGGCAGTG GCATGAACGA GACCGGGATC
GCCCTGATGT ACTCCGGCTA CCTCATGGGC GTGGTCACCT CGCTGCTCTC CCTGCGGATC
GCGGGGCGCA TCGGCGGGCC GGTCAACACC ATGCTGCTGG GGACAGTGAT CTTCGCCGGC
TCCCTGCTCT TCTTCCTGGG GCATTCGCTG TGGCTGATCT TCGCCGGCAT GTTTGTCTTC
TGCGGAGGCA TGTTTCTCAT CCATTCGCTG GCCCCCGGTT TTCTCAACCA GCGGGCTGGG
GAACAGCGGG GCGTGGTGAA TGGCCTCTAT ATCGCCTTCT ATTATGCGGG TGGCACAGTG
GGCTCCTTCA TACCCGGCTT CATTTACCAC AGCCTCGGCT GGGCGGCCTA CCTGGCATCG
CTGGCGGCGG TACTGGCCCT GGCGGGCTAT TGGCTGACAG GATTGCGCCG ACAGACGGTG
CCGGCGAACT GA
 
Protein sequence
MRIPQLLIAV YCTLLAFSAI YAPQPLLPVL QGAFDVSETR ASLLITVTLL PLAIAPVAYG 
FVLQRFSAKR LLIGATALLA VTEYLIFFVT HFELFLFLRL LQGLLIPAIL TALMTYLSAS
AGPGRIARVM AFYVAATVLG GFLGRALSGL ISTGFGWRWS FLFLGLALTV CVLLLRRLDA
DPPVSFQKLR AGTVVAVLRQ PSFLRLYGVI FCAFYVFASL LNFLPFRLVE LGSGMNETGI
ALMYSGYLMG VVTSLLSLRI AGRIGGPVNT MLLGTVIFAG SLLFFLGHSL WLIFAGMFVF
CGGMFLIHSL APGFLNQRAG EQRGVVNGLY IAFYYAGGTV GSFIPGFIYH SLGWAAYLAS
LAAVLALAGY WLTGLRRQTV PAN