Gene Mlg_2731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2731 
Symbol 
ID4270985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3099903 
End bp3100919 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content68% 
IMG OID638127493 
Productextracellular solute-binding protein 
Protein accessionYP_743561 
Protein GI114321878 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.476445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.605201 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACTA CTGATCTGAT CAAGGCCGGC CGCCTGGCCG CCACCGCCGC CCTCTGCGGT 
GCCACGCTGT TCGCCGGTAG CGCGAACGCC GAGCGGCTGC ACGAGATCCG CGACCGCGGC
ACACTGACCG TTGCCCTGTA CAACGACTTC GGGCCCTACT CCTGCGTCGG CACCGGCGGC
GAGCTGATCG GGGTTGACGT GGCCCTGGCC CGCGCCCTGG GCGAGAAGCT GGACCTGAAG
GTGGAGCTGG CCGGCTTCGG CGCCCAGGAC AGCATGGACC AGGACCTGGC CCTGCTGCAG
GACGAACAGG CCGAGGACGA GTGGGACGAG GCCCTGCTCG AGCGCGCCCC CGACCTGATG
ATGCACGTAC CGGTGGACCC CGTCTTCCAG GAGCGCAACG CCGATTACGA CATCATGGGC
GCCTACTTCC ACGAGGCCAT GGCGGTGCTC TACGACCGCG AGGAAATCGG CGATCTGGGG
TACTCGGTGA ACACCCCGGA CCCGTTCGAT GGCCTGCGCG TGGGTGTGGA GATGTACACC
TACTCCTACA CCATGCTCAC CAACGGCTTT GACGGCCGCC TGCGCTCCGG CGTGGTCAAC
CACAAGAACG TGCCCGAAGC AGTGGAGGCC CTGCTTGCCG GTGAGACCTC CGCGGTTTTC
GCACCGCGCG GCGAGCTTCA GAGCGCCCTG GCCGCCTTCC CCGAACCGCG CACCAGCCTC
GCCCTGAGCG AGCTGCGCGA CCTGTTCCGC ACGGACCGGG TGCGTAGCGA CTGGGACGTG
GGCATGGCCG TCAAGGCCGG CAACCCGGAG CTCTCCCGGG CAGTCGAGGA GGCCATGGCA
CAACTGGTGG CCGACGGGAC CGTGGAGCGT ATCTTTAACG AATACGGCAT CGCCTGGGTG
GGCCCGGGGG AGGAGTACCG CCTAGCCCGC AACGGCAATG GCCCCGCGGG CGTGACCCGC
ACCGGCCTGG AACGCGCGCA GCTCTGCCGG GCAACAATGC CGGCCGGACT GTACTGA
 
Protein sequence
MNTTDLIKAG RLAATAALCG ATLFAGSANA ERLHEIRDRG TLTVALYNDF GPYSCVGTGG 
ELIGVDVALA RALGEKLDLK VELAGFGAQD SMDQDLALLQ DEQAEDEWDE ALLERAPDLM
MHVPVDPVFQ ERNADYDIMG AYFHEAMAVL YDREEIGDLG YSVNTPDPFD GLRVGVEMYT
YSYTMLTNGF DGRLRSGVVN HKNVPEAVEA LLAGETSAVF APRGELQSAL AAFPEPRTSL
ALSELRDLFR TDRVRSDWDV GMAVKAGNPE LSRAVEEAMA QLVADGTVER IFNEYGIAWV
GPGEEYRLAR NGNGPAGVTR TGLERAQLCR ATMPAGLY