Gene Mlg_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1623 
Symbol 
ID4269355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1854338 
End bp1855738 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content67% 
IMG OID638126380 
ProductNa+/solute symporter 
Protein accessionYP_742459 
Protein GI114320776 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATG GCTTCACCCA CCTCCCCATG ACCGGCCTGG TGATCGCCCT GGCGGCGGTG 
GCGCTGGTTA CCTTGCTGAT CGCCCCCCGG GTCAGTACCG CCAACGGGTT CTTCCGCGGT
TTCGCCGACG CGGGCCAGCC GCCCGGCGTG CTCACGCTCA CCCTCTCCCA GGTCACCACC
TGGATCTTTG CCCGCTCATT GCTCAATGCG GCCATCCTGG GTTATTTCTT TGGCATCGCC
GGTGTGCTCG CCTACACCGC CTACTACGGC TCCTTTGTCA CCGGCTGGCT GATCATCGAT
CGCCTTCGCT TCCGCTACGG CGCCTCCAAT CTGCAGGGTT TCATGAGGGC CCAGTATGGC
GGGGTGGGGA CCTTCTGCCT GAACGGACTC CTGGCGCTGC GCCTGGTCAG CGAGGTGTTC
GCCAACCTGC TGGTCGTCGG GATCGTCTTC GGCGCCGCCG GCAGCGCCGG CAACACCACC
GCCATGCTGT CGGTGGCCGC CCTGACCCTG GTCTATTCCC TCACCGGCGG GCTCCGGGCG
TCGCTGCGAA CCGATGTCCT GCAGATGGTG CTTTTGATCG GTCTGCTCGC CGCCCTCGTG
GTGGTCATGC TCCTGCATCC GCTCTTTGAC CCCGGGGCGC TGGCGGTCAG CAGCCCCGGG
ATCGACAACC CGGGCTGGAT CCTGCTGGCG GTCGCGGGTC TCCAGGTGCT CAGCTACCCC
ATGCATGACC CCGTCATGAT GGACCGCGGG TTCCTCGCCG ACCGCCGCAC CACACGGCGC
AGTTTTCTGC ACGCCTTCTG GCTCTCGGCG GTGTGTATCC TGGCCTTCGG GCTGCTGGGG
GTTTTCGCGG GCCTGCATCG CCTGGAGGGC GAAACCGTGC TGCTGACGCT GGACCGGCTG
ATGGGTACCC CCGTGATGTT TCTCCTGGCG GTGGCGCTGG TCCTGTCGGC CGCCTCGACC
CTGGACTCGA CCTTCTCCAG CGCGGCCAAA CTCAGCATCG CCGACATGGG ACTGGCCCGG
ACCACACCCC GCAACGGCCG GATCGCAATG CTCCTGTTCT GCCTGGGCGG GCTGCTACTG
GTGCTGTTCG GCACCGACGA CCTGTTCGCC GCCGTGGCCG TCAGTGGAAC CGCGTCGCTG
GCGCTCACAC CGGTCATCGC CTTCTCCCTA CTGGCCGGCT GGCGGCTTAG CCGGGCCAGT
CTGCTGACCT CCTTTACGCT GGCCTTCACG GGCGCTGTCG TCTACTTCCT GGAGACCTCC
GGGCATATCG ACCTGCTGAC CCCCCTGACC GGGCTGGAGC ACGACTACAG TAAGCTGCTG
GCGATCACGG TCACCATCCT GGTTCTTGCC ACCGCGGCGG CCTGGCTTGG CCGCCAGGGC
CGCATGGAGG TCGTGAAATG A
 
Protein sequence
MSDGFTHLPM TGLVIALAAV ALVTLLIAPR VSTANGFFRG FADAGQPPGV LTLTLSQVTT 
WIFARSLLNA AILGYFFGIA GVLAYTAYYG SFVTGWLIID RLRFRYGASN LQGFMRAQYG
GVGTFCLNGL LALRLVSEVF ANLLVVGIVF GAAGSAGNTT AMLSVAALTL VYSLTGGLRA
SLRTDVLQMV LLIGLLAALV VVMLLHPLFD PGALAVSSPG IDNPGWILLA VAGLQVLSYP
MHDPVMMDRG FLADRRTTRR SFLHAFWLSA VCILAFGLLG VFAGLHRLEG ETVLLTLDRL
MGTPVMFLLA VALVLSAAST LDSTFSSAAK LSIADMGLAR TTPRNGRIAM LLFCLGGLLL
VLFGTDDLFA AVAVSGTASL ALTPVIAFSL LAGWRLSRAS LLTSFTLAFT GAVVYFLETS
GHIDLLTPLT GLEHDYSKLL AITVTILVLA TAAAWLGRQG RMEVVK