Gene Mlg_2574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2574 
Symbol 
ID4270283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2914337 
End bp2915356 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content68% 
IMG OID638127333 
Productextracellular solute-binding protein 
Protein accessionYP_743404 
Protein GI114321721 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0232421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCA AACAGGTTCT GACCGGATTG CTGGCCGCGC CGCTGGCCAT CGCTTTGGCC 
ATGCCGGCCA CCACCCTGGC CGACGACGAC ACCATCACCG TCTATTCCGC GCGCCAGGAG
CACCTGATCA AGCCGCTGTT CGACCGCTTC ACCGAGGAGA CCGGCATCCG GGTGCGCTAC
GTGACCGACA GCGCCGGTCC GCTGCTGGCC CGCCTCCAGC AGGAGGGGCG CCGCACCCCC
GCCGACATGT TGATGACAGT GGATGCCGGC AACCTCTGGC AGGCCGCCGA CCGGGGCGTG
CTCCGGCCCA TCGACTCCGA GCCGCTGCGG GAGGCCATTC CGGAACACCT GCGTGACCCG
GACGACCAGT GGTTCGGCCT GTCGGTGCGG GCACGGACCA TCATGTACGC GCCCGACCGC
GTCGATCCCG AGGAACTGTC CACCTACGAG GCCCTCGCCG ACCCGGAGTG GGAGGGCCGC
CTGTGCGTGC GCACCTCGCA GCACGTCTAC AACCAGTCGC TGGTCGCCAC CATGATCTCC
CACCACGGTG AGGAGCGGAC CCGGGAGGTG CTGGAGGGCT GGGTGAACAA CTTTGCGGAC
CGCCCCTTCT CCAACGACAC CTCGACCCTG CGCGCCATCG CCGCCGGCCA GTGTGATGTG
AGCATCACCA ACACCTACTA CCTGGGCCGG GTGCTGAAGG ACGACCCGGA CTTCCCGGTG
GCGCCCTACT GGCCCAACCA GGATGACGTG GGCGTTCACG TCAACGTCTC CGGTGCCGGT
GTCACCCGCC ATGCCGGCAA CCCTGAAGGG GCGCAGCGGC TCATCGAATG GCTCGCCAGC
GAGGCCGCGC AGAAGGACTT CGCCGCCCTG AACATGGAGT ATCCGGCGAA CCCGGAGATC
GGCCTGGACC CGATCGTCGC CGACTGGGGC GATTTCAAGG CCGATAACAT CAATGTCTCC
GAGGCCGGCC GGCTGCAGCG CCAGGCCGCC ATGCTGATGG ACCGGGTCGG CTGGCGCTGA
 
Protein sequence
MRIKQVLTGL LAAPLAIALA MPATTLADDD TITVYSARQE HLIKPLFDRF TEETGIRVRY 
VTDSAGPLLA RLQQEGRRTP ADMLMTVDAG NLWQAADRGV LRPIDSEPLR EAIPEHLRDP
DDQWFGLSVR ARTIMYAPDR VDPEELSTYE ALADPEWEGR LCVRTSQHVY NQSLVATMIS
HHGEERTREV LEGWVNNFAD RPFSNDTSTL RAIAAGQCDV SITNTYYLGR VLKDDPDFPV
APYWPNQDDV GVHVNVSGAG VTRHAGNPEG AQRLIEWLAS EAAQKDFAAL NMEYPANPEI
GLDPIVADWG DFKADNINVS EAGRLQRQAA MLMDRVGWR