Gene Mlg_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0428 
Symbol 
ID4268281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp476165 
End bp478429 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content67% 
IMG OID638125158 
Productphosphoenolpyruvate-protein phosphotransferase PtsP 
Protein accessionYP_741272 
Protein GI114319589 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0167147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAGG TTCTGCACCG CATTTCACGC GAGGTCAACG CCGCACCCGG GCTGCGCCAG 
GCTCTGAGCA TCATCGTCAA GCGGGTCGCT GACGCCATGA AGGTGGACGT CTGCTCGGTC
TACCTGATGG ACCCGGAGAC CGAGCGCCTG GTGTTGATGG ACACCCGTGG CCTCAATCCC
GATGCGGTTG GCCGGGTACG GCTGCGCCTC TCCGAAGGGC TGGTGGGCAT GGTGGCCGAG
CGCGGCGAGC CGATCAATCT GGACAACGCT TACGACCACC CCCGCTTCCG CTATTTCCCC
GAGACCGGGG AGGAGCTGTT TCACTCCTTT CTCGGCGTGC CCATCGTCCA CTACCGCAAG
CTGCTGGGCG TACTGGTGGT GCAGCAGATG GCGGAGCGCC GTTTCGACGA GGATGACGTT
GCCTTTCTGG TGACGCTGGG TGCCCAGCTG GCCGGCGCCA TCGCCCACGC CGAGGCCAGC
GGTGGCCTGG ACGGGTTGCG TGACGGGGAC GTGGCGCAGG GGCGCATGCT CCAGGGCCTG
CCCGCTGCAC CGGGCGTCGC CATGGGGACC GCGGTGGTGC ACTCGGCCCA GGTGGACCTG
GAGTCGGTGC CGGACCGCGA CCCGGAGGAC ATCGAGGAGG AAAAGCGCCA GTTCATGGTC
GCGGTGCGGG CGGTCCGTGA GGAGCTGGGG CGCCTTTCCA AAGAGTTGGA GGGCACCCTG
TCCTCCGACG AGCACATGCT GTTCGATGTC TACTTGCGCA TGCTCGACGA CGACAGTCTG
GTCGGGGAAA CGCTCCGGGC CATCGAGGCG GGTAACTGGG CGCCCGGTGC CCTGCGCGAG
ATTATCGGCG GCCATGTGCA GGTCTTTCAG GACATGGAGG ACCCCTACCT GCGGGAACGC
GCCAGCGACA TCCGCGACCT GGGCAGCCGC ATCCTGGCGC GCCTGCGGGA GGACACCCGC
CGGAGCCGGC CCTTGCCCGA GCGGGTGATC CTGGTGGGCA AGGAGGTGAG CGCGACCCAG
TTGGCCGAGG TGCCGCAGGA TCACCTCGCG GGGGTGGTCT CCGCCTCCGG CACCCGCAAC
TCCCATGTGG CGATCCTGGC CCGGGCGCTG AGTGTGCCGG CGATTATGGG GGCCTCCGAC
CTGAGTACCG GGCGTTTGGA TGGCAAGCCG GTGATCGTCG ACGGCTATTC CGGGCGTTTC
TACGTCCAGC CCAGCGATGC GGTGCGCGAG GAGTACGAGC GTCTGGCCCG GGACGAGGCG
GAGTTTGCCG ATAGCCTGGA AAGCCTCAAG GATGAGCCGG CGGAGACGCC GGACGGGTAT
CGCATCAAGC TGCTGGCCAA TACCGGGCTG ATCTCGGACA TTAACCTGTC GCTGGCCAGC
GGTTGTGACG GCATCGGCCT GCACCGCACC GAGTTCCCCT TCATCGTGCG CGACCGCTTC
CCCGGTGAAG AGGAGCAGGC CATGCTCTAT CGCCGGGTGT TGGAGTATTT CCGCCCACGC
CCGGTGGTGT TGCGGACGCT GGATATCGGC GGTGACAAGT CCCTGCCCTA TTTCCCGGTC
CATGAGGAGA ACCCCTTCCT GGGCTGGCGG GGCATCCGGC TCACCCTCGA CCACCCGGAG
ATCTTCCTCA CCCAGCTTCG GGCGATGGTG CGGGCCAACA TCGGCAACGG CAATCTGCGG
GTCATGTTCC CCATGATCAG CCGTCTGCAC GAGGTGGATG AGGCCAAGGC GCTGCTCGCC
CGCGCGGTGG AGGAGCTGCG CGAGGAGGGC ATGGAGGTGG AGATGCCCCC GGTGGGCGTC
ATGGTGGAGG TGCCTGCCGC CGTTGCCATG GCCGACAAGC TGGCCCAGCG GGTGGCCTTC
CTGTCCGTGG GCACCAATGA CCTCACCCAG TACCTGCTGG CGGTGGATCG CAACAATGCC
CGGGTGGCGG CGCTTTACGA CGAGCTGCAC CCGGCGGTCC TCAACGCCAT TGCCCAGGTG
GTGGACTCAG GCCGGCAGTA TCGCTGTCCG GTCAGTGTCT GTGGCGGTAT GGCGGGTGAT
CCGGCCGGTG CCATCCTGCT CCTCGCTCTG GGCGTCAGCA GCCTGAGCAT GAGCGTGGCC
AGCCTGCTGC GCATCAAGTG GGTGGTGCGC AGTATCAGCC GGGAGAGGGC CACGGAGCTG
CTTACCCTGG CGCTGGATAT GGAATCACCC GATGACGTGC GGGCGATGCT GAAACGGGCC
CTGGACGAGC AGGGGCTGGG TGGCCTGATC CGCGCCGGGA AGTGA
 
Protein sequence
MLEVLHRISR EVNAAPGLRQ ALSIIVKRVA DAMKVDVCSV YLMDPETERL VLMDTRGLNP 
DAVGRVRLRL SEGLVGMVAE RGEPINLDNA YDHPRFRYFP ETGEELFHSF LGVPIVHYRK
LLGVLVVQQM AERRFDEDDV AFLVTLGAQL AGAIAHAEAS GGLDGLRDGD VAQGRMLQGL
PAAPGVAMGT AVVHSAQVDL ESVPDRDPED IEEEKRQFMV AVRAVREELG RLSKELEGTL
SSDEHMLFDV YLRMLDDDSL VGETLRAIEA GNWAPGALRE IIGGHVQVFQ DMEDPYLRER
ASDIRDLGSR ILARLREDTR RSRPLPERVI LVGKEVSATQ LAEVPQDHLA GVVSASGTRN
SHVAILARAL SVPAIMGASD LSTGRLDGKP VIVDGYSGRF YVQPSDAVRE EYERLARDEA
EFADSLESLK DEPAETPDGY RIKLLANTGL ISDINLSLAS GCDGIGLHRT EFPFIVRDRF
PGEEEQAMLY RRVLEYFRPR PVVLRTLDIG GDKSLPYFPV HEENPFLGWR GIRLTLDHPE
IFLTQLRAMV RANIGNGNLR VMFPMISRLH EVDEAKALLA RAVEELREEG MEVEMPPVGV
MVEVPAAVAM ADKLAQRVAF LSVGTNDLTQ YLLAVDRNNA RVAALYDELH PAVLNAIAQV
VDSGRQYRCP VSVCGGMAGD PAGAILLLAL GVSSLSMSVA SLLRIKWVVR SISRERATEL
LTLALDMESP DDVRAMLKRA LDEQGLGGLI RAGK