Gene Mlg_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1007 
Symbol 
ID4268375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1146695 
End bp1148203 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content62% 
IMG OID638125758 
Productmajor facilitator transporter 
Protein accessionYP_741850 
Protein GI114320167 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00886] nitrite extrusion protein (nitrite facilitator) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0945762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCG TTGACGTCTT CAAATTTCGC AGTCCCGAGA TAAAGGCCCT CCACCTCACC 
TGGATTGCGT TCTTCATCAC CTTCTACGTC TGGTTCAACA TGGCCCCGCT GGCCACCAGC
ATGCTACGCA GCGTGGACTG GTTGACCCAG GATGACATCC GATTATTCGC CATCTGTAAC
GTCGCCCTCA CCATCCCCGC CCGCATCATC GTCGGCATGG CCTTGGACCG CTTCGGCCCG
CGCCGGGTGT TCTCGATCCT GATGATCCTG ATGGCCCTGC CGGCGCTGGC GTTCGCCTTC
GGCAACAACA TGACGCAGCT ACTGATCTCG CGGCTGGTGT TGAGTTCGGT AGGCGCCAGC
TTCGTGGTGG GCATTCATAT GACCGCGCTC TGGTTCCGGC CCCGGGATAT CGGCTTCGCC
GAGGGCTTCT ACGCCGGCTG GGGTAACTTC GGTTCGGCCG CGGCCGCGAT GACCCTGCCC
ACCATCGCCC TCACCCTGTT CGGCGGTGAG GACGGCTGGC GCTGGGCCAT GGCGGTGAGT
GCCCTGGTCA TGGCCGGTTA CGGGGTCTTC TACTGGTTCG CCATCACCGA CGGCCCGCAC
GCCACCTCAC ACAAGCGCAC CCGTAACGCC ATGGCCATGG AGGTCAGCAG TTGGGGCGAC
ATGATCAAGC TGATCATCTG CACCCTTCCC CTGGTGGGCG TCCTTGCTCT CCTGGTGTGG
CGCATCGAAC AGATGGGCTA CCTCAGCACT ACCGGTGCCA CCATCCTGTA CCTGGTCATC
GGCGGCATCG TGCTCTACCA GGTCGTCCAG ATCTTCCGGG TCAATGTCCC GATCCTGAAA
AAAGGCGTGC CCAAGGACGA CAAGTACCAC TTCAACAGCG TCATCGCGCT GAACAGTACC
TACTTCGCCA ACTTCGGGGC GGAACTGGCG GTGGTCTCCA TGCTGCCGAT GTTCTTCGAG
CAGACCTGGG GGCTGGGTGC CGCCGCCGCG GGCGCCATCG CCGCCTCCTT TGCGTTCGTC
AACCTCGTCG CACGCCCCAT GGGCGGCCTG GTCTCCGATC GCATGGGCAA CCGGCGCTTC
GTGATGCTGT GCTACATGTT CGGGATTGGT ATCGGCTTCG TGCTCATGGG CCTGTTGGAC
TCCAACTGGC CACTGATCGT TGCCATCGCC ATCACGATCT TCACCTCCTT CTTCGTACAG
GGCTCAGAGG GGGCGACCTT CGGGATCATC CCGTCGATCA AGCGCCGGAT CACCGGCCAG
ATCTCGGGCA TGGCGGGGGC GTACGGCAAT GTGGGTGCGG TGGTCTACCT GACCATCTTC
ACCTTCGTCA CCCCGACCCA GTTCTTCTTC ATCATCGCCA CCGGCGCGTT CCTGAGCTGG
CTGATCTGCC TGCTCCTGCT GAAGGAGCCG GAAGGCGCCT TTGCCGAGGA CTACCACGTC
TCATCGGTGG ACCGCATGAT CGAGGAAGAG GACCTGAAAC GGGAGCGCCA GAAGGCCTGG
GCGCGATAA
 
Protein sequence
MKVVDVFKFR SPEIKALHLT WIAFFITFYV WFNMAPLATS MLRSVDWLTQ DDIRLFAICN 
VALTIPARII VGMALDRFGP RRVFSILMIL MALPALAFAF GNNMTQLLIS RLVLSSVGAS
FVVGIHMTAL WFRPRDIGFA EGFYAGWGNF GSAAAAMTLP TIALTLFGGE DGWRWAMAVS
ALVMAGYGVF YWFAITDGPH ATSHKRTRNA MAMEVSSWGD MIKLIICTLP LVGVLALLVW
RIEQMGYLST TGATILYLVI GGIVLYQVVQ IFRVNVPILK KGVPKDDKYH FNSVIALNST
YFANFGAELA VVSMLPMFFE QTWGLGAAAA GAIAASFAFV NLVARPMGGL VSDRMGNRRF
VMLCYMFGIG IGFVLMGLLD SNWPLIVAIA ITIFTSFFVQ GSEGATFGII PSIKRRITGQ
ISGMAGAYGN VGAVVYLTIF TFVTPTQFFF IIATGAFLSW LICLLLLKEP EGAFAEDYHV
SSVDRMIEEE DLKRERQKAW AR