Gene Mlg_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1703 
Symbol 
ID4269789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1949259 
End bp1950662 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content63% 
IMG OID638126461 
Productnitrate transporter 
Protein accessionYP_742539 
Protein GI114320856 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.543775 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGG ACCATGAAAA GCGCCTGAAC CGGCGCAGCT TTCTGAAGAC CGGGGGGCTC 
CTCAGTGGCG CGGCACTGAT GGGCATGGTA GCGCCCGGCG TGCGCACCGG CGCCTGGGCC
GGGGGAAGTG ACGGCCTGGA AGTGACCCGG GTAGCCCTGG GGTTCATTCC GCTGACCGAT
TGCGCCCCCC TGGTCATTGC CCGTGAAAAG GGCTTCTTCC AGGCGCACGG CCTCGACGTG
CAGGTGTCGA AGGAAGCCTC CTGGGCCAAC ATCCGTGACA AGGTCTCCGT CGGTGCATTG
GATGGCGGCC ACATGCTGGC GGGCATGCCC ATCGCCTCCA CCCTCGGCGT GGGCGCCACC
CGCACCCCCA TGGTCACCGG CTTCAGCATG GACCTGAACG GCAATGCCAT TACGGTCTCG
AACGCGCTCT ACGACCGCAT GATGGAGGCC GATCCCGAGG CGATGCAGGA ACGCCCCATC
ACTGCTCGGG CCCTCAAAAA GGTCATCGAC CAGGACAAGG CGGCCGGTCG GGCACCACTG
ACCTTCGCAA TGGTCTTCCC CGTCTCCACA CACAACTACG AGCTGCGCTA CTGGATGGCC
GCCGCAGGCA TCAACCCCGA CGAAGATGTC CGGCTGGTGG TCATCCCACC ACCGCAGATG
GTGGCCAACC TGCGGGCCCG GAACATGGAT GGCTACTGTG TGGGTGAACC TTGGAACATG
CGCGCCGTAG ACATGGGGAT CGGCAAGGTG CTGGTCACCA ACTATGAGAT CTGGAACAAC
AACCCGGAAA AAGTGCTCGG CCTAACGGAA GAATGGGTTG AGAAACACCC CAATACCCAC
AGAGCCTTGC TGCAGGCATT GATCCAGACC AATAAGTGGA TGGACGAGCC CGAGAACCGC
GAGGAGGTCG TGGACATCAT CTCCCGCCGG GCCTACGTCA ACGCACCACC CCATGTGGTG
GGCATGTCTA TGAAGGGCAC ACTGCAGTTC CAGCGGGACG AGAAGCCGCG CCCGTTCCCG
GATTTCAACG TCTTCCACCG ATATGCCGCC ACCTACCCGT GGCGCTCCCA CGCCGTCTGG
TTCCTGACCC AGATGGTGCG TTGGGGTCAG CTACGCGAGC CGGAGGATCT GCTCAAAGTC
GCCGAGCGGG TCTACCGGCC CGACCTTTAT CGTGAGGCCG CCCGTGCCGT GGGCGAGCCC
TACCCCACCA TCGATTACAA AAGCGAAGGC ACGCGGGATA AGCCTTGGAC CCTCCAGGAG
GCCAGCCACC CCATTGAGAT GGGGCCGGAC CGCTTCCTCG ATGGCCGCAC TTTCGACCCC
CGGGATGTCA TGGCTTACCT GGAAGGCTTT GATGTGCACT CCCGGGCCCT GAGCCTGCCG
CAACTGGCCG AATTGAACGG CTGA
 
Protein sequence
MDKDHEKRLN RRSFLKTGGL LSGAALMGMV APGVRTGAWA GGSDGLEVTR VALGFIPLTD 
CAPLVIAREK GFFQAHGLDV QVSKEASWAN IRDKVSVGAL DGGHMLAGMP IASTLGVGAT
RTPMVTGFSM DLNGNAITVS NALYDRMMEA DPEAMQERPI TARALKKVID QDKAAGRAPL
TFAMVFPVST HNYELRYWMA AAGINPDEDV RLVVIPPPQM VANLRARNMD GYCVGEPWNM
RAVDMGIGKV LVTNYEIWNN NPEKVLGLTE EWVEKHPNTH RALLQALIQT NKWMDEPENR
EEVVDIISRR AYVNAPPHVV GMSMKGTLQF QRDEKPRPFP DFNVFHRYAA TYPWRSHAVW
FLTQMVRWGQ LREPEDLLKV AERVYRPDLY REAARAVGEP YPTIDYKSEG TRDKPWTLQE
ASHPIEMGPD RFLDGRTFDP RDVMAYLEGF DVHSRALSLP QLAELNG