Gene Mlg_0874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0874 
Symbol 
ID4269695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp990946 
End bp991932 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content66% 
IMG OID638125626 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_741718 
Protein GI114320035 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAACA GCACCCGCAA TGTATTCGTA AGCCTGTGCG CCGGCGCGGC GCTGGCCGCC 
GGTGGCGCTG CCATCGCCGA CGACCGCGGC GACTGGCCGC GCAGCATCAC CGTGGGCACC
GCCAGCCAGG GCGGCACCTA CTTCATTTAC GGTTCCGGCT GGGCCAACAT GGTGGGCGAA
GCGCTGGACA TCAACGCCGG CGCCGAGGTC ACCGGCGGCC CGGTCCAGAA CGCCACCCTG
GTGCAGACCG GCGATCACCA ATTCGGCATG GTCACCATGG GCCCCGCGCT GGCGGCCTGG
GAGGGCGAAA GCGAACTGGC ACCGGGCCTG GAGCACAAGG ACATCCGCGC AGTGTTCCCC
ATGTACCAGA CGGCCTTCCA GGTCATCGCC CTGTCCGGGT CCGGCATTGA GAGTGTCGCG
GATCTCGACG GCAAAACCGT GGGCATCGGC CCGGCCGGCG GCACCGCTGA CATGTACTGG
CCCCAGTTCT TCGAGCAGCT CGGTCTGGAT GTGCGCACCC GTAACGGCGG CGCCTCCGAC
CAGGTGGGCC AGCTCCAGGA CGGCCTGATC GATGCCTTCG CCTTCGCCGC CGGCATCCCG
ATCTCCGCCT TCAGCCAGGC CGAGGCCCAG GCCGACGTCA ACATCTTCTC CATCGCTGAA
GCCGATCAGG AGGCCATTCT GGAGGCCTTC CCCGAGCTGG TCGGCTCCAG CGTCCCGGGC
GACGCCTACC AATCCCTGGA CGCCGATATC CCGGCCATCT CCATCTGGAA CTTTGCCATC
ACTCACAAGG ACATGCCGGA GAGTCTGGTC TATGGTGTGA CCAAGACGGT GATGGAAAAC
AACGATGAGA TGGTGCAGAT CCACGGCGCT TCCAAGGAAA CGCTGCCGGA GAACTGGGAG
GTCAACGACT GGCTTCCGTT CCACCCGGGC GCGGTGCGCT GGTTCGAAGA GAACGGATTC
GACATCCCGG ATGACCTGCG CGGCTAA
 
Protein sequence
MLNSTRNVFV SLCAGAALAA GGAAIADDRG DWPRSITVGT ASQGGTYFIY GSGWANMVGE 
ALDINAGAEV TGGPVQNATL VQTGDHQFGM VTMGPALAAW EGESELAPGL EHKDIRAVFP
MYQTAFQVIA LSGSGIESVA DLDGKTVGIG PAGGTADMYW PQFFEQLGLD VRTRNGGASD
QVGQLQDGLI DAFAFAAGIP ISAFSQAEAQ ADVNIFSIAE ADQEAILEAF PELVGSSVPG
DAYQSLDADI PAISIWNFAI THKDMPESLV YGVTKTVMEN NDEMVQIHGA SKETLPENWE
VNDWLPFHPG AVRWFEENGF DIPDDLRG