Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2574 |
Symbol | |
ID | 4270283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2914337 |
End bp | 2915356 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638127333 |
Product | extracellular solute-binding protein |
Protein accession | YP_743404 |
Protein GI | 114321721 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0232421 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATCA AACAGGTTCT GACCGGATTG CTGGCCGCGC CGCTGGCCAT CGCTTTGGCC ATGCCGGCCA CCACCCTGGC CGACGACGAC ACCATCACCG TCTATTCCGC GCGCCAGGAG CACCTGATCA AGCCGCTGTT CGACCGCTTC ACCGAGGAGA CCGGCATCCG GGTGCGCTAC GTGACCGACA GCGCCGGTCC GCTGCTGGCC CGCCTCCAGC AGGAGGGGCG CCGCACCCCC GCCGACATGT TGATGACAGT GGATGCCGGC AACCTCTGGC AGGCCGCCGA CCGGGGCGTG CTCCGGCCCA TCGACTCCGA GCCGCTGCGG GAGGCCATTC CGGAACACCT GCGTGACCCG GACGACCAGT GGTTCGGCCT GTCGGTGCGG GCACGGACCA TCATGTACGC GCCCGACCGC GTCGATCCCG AGGAACTGTC CACCTACGAG GCCCTCGCCG ACCCGGAGTG GGAGGGCCGC CTGTGCGTGC GCACCTCGCA GCACGTCTAC AACCAGTCGC TGGTCGCCAC CATGATCTCC CACCACGGTG AGGAGCGGAC CCGGGAGGTG CTGGAGGGCT GGGTGAACAA CTTTGCGGAC CGCCCCTTCT CCAACGACAC CTCGACCCTG CGCGCCATCG CCGCCGGCCA GTGTGATGTG AGCATCACCA ACACCTACTA CCTGGGCCGG GTGCTGAAGG ACGACCCGGA CTTCCCGGTG GCGCCCTACT GGCCCAACCA GGATGACGTG GGCGTTCACG TCAACGTCTC CGGTGCCGGT GTCACCCGCC ATGCCGGCAA CCCTGAAGGG GCGCAGCGGC TCATCGAATG GCTCGCCAGC GAGGCCGCGC AGAAGGACTT CGCCGCCCTG AACATGGAGT ATCCGGCGAA CCCGGAGATC GGCCTGGACC CGATCGTCGC CGACTGGGGC GATTTCAAGG CCGATAACAT CAATGTCTCC GAGGCCGGCC GGCTGCAGCG CCAGGCCGCC ATGCTGATGG ACCGGGTCGG CTGGCGCTGA
|
Protein sequence | MRIKQVLTGL LAAPLAIALA MPATTLADDD TITVYSARQE HLIKPLFDRF TEETGIRVRY VTDSAGPLLA RLQQEGRRTP ADMLMTVDAG NLWQAADRGV LRPIDSEPLR EAIPEHLRDP DDQWFGLSVR ARTIMYAPDR VDPEELSTYE ALADPEWEGR LCVRTSQHVY NQSLVATMIS HHGEERTREV LEGWVNNFAD RPFSNDTSTL RAIAAGQCDV SITNTYYLGR VLKDDPDFPV APYWPNQDDV GVHVNVSGAG VTRHAGNPEG AQRLIEWLAS EAAQKDFAAL NMEYPANPEI GLDPIVADWG DFKADNINVS EAGRLQRQAA MLMDRVGWR
|
| |