Gene Mlg_0193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0193 
Symbol 
ID4268635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp223944 
End bp225221 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content64% 
IMG OID638124917 
Productmajor facilitator transporter 
Protein accessionYP_741038 
Protein GI114319355 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.81343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCG AGCGATTCAA ACAGTACTCC ATCCTGACCG CGAACACCTT TGCGTTCACG 
ATCATGTTCA TGGTCTGGAC CGTGTTCGGC GAGATCGGTG TGCCCATTCA GGAGGAGATG
GGCCTGAGCG ATACCCAGTT CGGTCTGCTG ACGGCCGTAC CCATTCTGAC GGGCTCGCTA
GTGCGTCTGC CCTTGGGGGC CTGGACGGAC AAGTACGGCG GTCGGCCCGT GTTTTTCATC
CTCATGCTGG CCGTGCTGCC GGGACTCTGG CTGGTGCAGT TCGCCACCGA GTACTGGCAG
CTGCTGGTCC TGGGCGCCAT GGTGGGTCTG GCGGGCGGCT CGTTTTCGGT CGGCATCACC
TACACGGCGA AATGGTTTCC GCGGGAGCGC CAGGGCCTGG CGATGGGTGT CTTCGGCGCG
GGCAACGCCG GCGCGGCGGT GACCAAGTAC GTGGCGCCGA CCATCATCGT GGTGTTGAGC
TGGCAGGCGG TGTCGAACAT CTATGCCGCC ATCGTGGTCG TTACCGCCAT CCTGTTCTGG
TTCTTCACCT ATACCAACCC AGAACACCGC ACTGCGGACG CGCCGAAGCT GCGCGAGCAG
ATCGCCGTAC TCAACGACCC GAAGGTCTGG AAGTACTGCC AGTACTACTC GGTGGTGTTC
GGGGGCTACG TGGGCGTTTC GCTCTATCTG ACCCGGTATT ACATGCTCGA GTACGGTGTC
GGTATCCAGC TCGCCGCCCT GATCGCCACC ATCTTCGTGC TGCCCTCCGG CGTGATCCGG
GCCTTCGGCG GCTGGCTGTC AGATCGCTTT GGCGCCCATA CGGTGACCTG GTGGTGCATG
TGGGCGAGCC TGGTGAGCTT CTTCTTCATG TCCTATCCGC CCACGGACTA CACCATCCAC
GGCCTGGACG GCCAGACCAC CACCTTACAC CTGGCCATGC CGGTGTGGCT GTTCACGGCG
CTGCTGTTTA TCGTCGGTAT CGCTTGGGGG ATCGGCAAGG CCTCGGTGTT CAAGTACCTC
TCCGATGAAT ACGACCGCAA CCTGGGGCTG GTGTCGGGCA TCGTGGGTTT GGCCGGCGGG
ATGGGCGGCT TCCTCCTGCC GCCCATGTTC GGTGCGCTGA TCGACCTGAC CGGGGTGCCC
ACCACCATCT GGATGCTCTG CTTCGGCTTC ACCCTGGTCT CGGTGGTCTG GATGTGGTGG
ACCGAGCGCC GCGAGCCGGT GCTGACCCGC GACCGGTACC ACCAGCCCAC CCTGAAGGCG
GACGATCAGA CCCGCTGA
 
Protein sequence
MASERFKQYS ILTANTFAFT IMFMVWTVFG EIGVPIQEEM GLSDTQFGLL TAVPILTGSL 
VRLPLGAWTD KYGGRPVFFI LMLAVLPGLW LVQFATEYWQ LLVLGAMVGL AGGSFSVGIT
YTAKWFPRER QGLAMGVFGA GNAGAAVTKY VAPTIIVVLS WQAVSNIYAA IVVVTAILFW
FFTYTNPEHR TADAPKLREQ IAVLNDPKVW KYCQYYSVVF GGYVGVSLYL TRYYMLEYGV
GIQLAALIAT IFVLPSGVIR AFGGWLSDRF GAHTVTWWCM WASLVSFFFM SYPPTDYTIH
GLDGQTTTLH LAMPVWLFTA LLFIVGIAWG IGKASVFKYL SDEYDRNLGL VSGIVGLAGG
MGGFLLPPMF GALIDLTGVP TTIWMLCFGF TLVSVVWMWW TERREPVLTR DRYHQPTLKA
DDQTR