Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1703 |
Symbol | |
ID | 4269789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1949259 |
End bp | 1950662 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638126461 |
Product | nitrate transporter |
Protein accession | YP_742539 |
Protein GI | 114320856 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.543775 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAGG ACCATGAAAA GCGCCTGAAC CGGCGCAGCT TTCTGAAGAC CGGGGGGCTC CTCAGTGGCG CGGCACTGAT GGGCATGGTA GCGCCCGGCG TGCGCACCGG CGCCTGGGCC GGGGGAAGTG ACGGCCTGGA AGTGACCCGG GTAGCCCTGG GGTTCATTCC GCTGACCGAT TGCGCCCCCC TGGTCATTGC CCGTGAAAAG GGCTTCTTCC AGGCGCACGG CCTCGACGTG CAGGTGTCGA AGGAAGCCTC CTGGGCCAAC ATCCGTGACA AGGTCTCCGT CGGTGCATTG GATGGCGGCC ACATGCTGGC GGGCATGCCC ATCGCCTCCA CCCTCGGCGT GGGCGCCACC CGCACCCCCA TGGTCACCGG CTTCAGCATG GACCTGAACG GCAATGCCAT TACGGTCTCG AACGCGCTCT ACGACCGCAT GATGGAGGCC GATCCCGAGG CGATGCAGGA ACGCCCCATC ACTGCTCGGG CCCTCAAAAA GGTCATCGAC CAGGACAAGG CGGCCGGTCG GGCACCACTG ACCTTCGCAA TGGTCTTCCC CGTCTCCACA CACAACTACG AGCTGCGCTA CTGGATGGCC GCCGCAGGCA TCAACCCCGA CGAAGATGTC CGGCTGGTGG TCATCCCACC ACCGCAGATG GTGGCCAACC TGCGGGCCCG GAACATGGAT GGCTACTGTG TGGGTGAACC TTGGAACATG CGCGCCGTAG ACATGGGGAT CGGCAAGGTG CTGGTCACCA ACTATGAGAT CTGGAACAAC AACCCGGAAA AAGTGCTCGG CCTAACGGAA GAATGGGTTG AGAAACACCC CAATACCCAC AGAGCCTTGC TGCAGGCATT GATCCAGACC AATAAGTGGA TGGACGAGCC CGAGAACCGC GAGGAGGTCG TGGACATCAT CTCCCGCCGG GCCTACGTCA ACGCACCACC CCATGTGGTG GGCATGTCTA TGAAGGGCAC ACTGCAGTTC CAGCGGGACG AGAAGCCGCG CCCGTTCCCG GATTTCAACG TCTTCCACCG ATATGCCGCC ACCTACCCGT GGCGCTCCCA CGCCGTCTGG TTCCTGACCC AGATGGTGCG TTGGGGTCAG CTACGCGAGC CGGAGGATCT GCTCAAAGTC GCCGAGCGGG TCTACCGGCC CGACCTTTAT CGTGAGGCCG CCCGTGCCGT GGGCGAGCCC TACCCCACCA TCGATTACAA AAGCGAAGGC ACGCGGGATA AGCCTTGGAC CCTCCAGGAG GCCAGCCACC CCATTGAGAT GGGGCCGGAC CGCTTCCTCG ATGGCCGCAC TTTCGACCCC CGGGATGTCA TGGCTTACCT GGAAGGCTTT GATGTGCACT CCCGGGCCCT GAGCCTGCCG CAACTGGCCG AATTGAACGG CTGA
|
Protein sequence | MDKDHEKRLN RRSFLKTGGL LSGAALMGMV APGVRTGAWA GGSDGLEVTR VALGFIPLTD CAPLVIAREK GFFQAHGLDV QVSKEASWAN IRDKVSVGAL DGGHMLAGMP IASTLGVGAT RTPMVTGFSM DLNGNAITVS NALYDRMMEA DPEAMQERPI TARALKKVID QDKAAGRAPL TFAMVFPVST HNYELRYWMA AAGINPDEDV RLVVIPPPQM VANLRARNMD GYCVGEPWNM RAVDMGIGKV LVTNYEIWNN NPEKVLGLTE EWVEKHPNTH RALLQALIQT NKWMDEPENR EEVVDIISRR AYVNAPPHVV GMSMKGTLQF QRDEKPRPFP DFNVFHRYAA TYPWRSHAVW FLTQMVRWGQ LREPEDLLKV AERVYRPDLY REAARAVGEP YPTIDYKSEG TRDKPWTLQE ASHPIEMGPD RFLDGRTFDP RDVMAYLEGF DVHSRALSLP QLAELNG
|
| |