Gene Mlg_1434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1434 
SymbollpxK 
ID4269244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1639205 
End bp1640200 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content74% 
IMG OID638126190 
Producttetraacyldisaccharide 4'-kinase 
Protein accessionYP_742273 
Protein GI114320590 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1663] Tetraacyldisaccharide-1-P 4'-kinase 
TIGRFAM ID[TIGR00682] tetraacyldisaccharide 4'-kinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.463776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.49685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGC TGCCCGCCTT CTGGCTGCGC CGCCCGCCGG ACTGGCGTGC CCACGCGCTG 
CGGCCGCTGG CGGCCCTCTA TGGCGGGGTG ATGCGCCTGC GCCGCTATGG CTACCGCAAG
GGCTGGATCC GGCGCGGCCG GCTCCCCGTA CCCGTGGTGG TGGTGGGCAA TATCTTCGTC
GGCGGTACCG GCAAGACCCC GTTGGTGGCC TGGATCGCCG ATACCCTGGC GGCGATGGGC
CGGCGGCCCG GGATCGTCAG CCGCGGCTAC GGCGGCCGCT CCCGGGAGTG GCCGCGGCGG
GTGGCCGCCG ACAGCGACCC AGCTGAGGTG GGGGACGAAC CGCTGCTGCT GGCCCGCGGC
ACCGGCTGTC CGGTGGCGGT CGGGCCCGAT CGGGTGGCGG CCGCGCAACT GCTGCTGGCT
GCCGGCTGCG ACGTGGTGGT CAGCGACGAC GGCCTGCAGC ACTACCGCCT GCCGCGGGCG
CTGGAGCTGG TGGTCTGCGA CGGTCACCGG GGCCTGGGCA ACGGGCTCTG CCTGCCGGCC
GGTCCGCTGC GGGAACCGGC CGACCGCCTG GCCGACGTGG ACATGGTGAT CAGCAACGGA
CGCGCACCGG CATTGACGCC CTGGTGGTTC GAACTGGTGC CCGGTCCGCT CCGGCCACTG
GCCGCGGACG CAGCGCCGGA AGGGGGCCCG GAACCCGGCA CCACGGTGCA TGCGGTGGCC
GGCATCGGTC ACCCCGCGCG CTTCTTCGCC ACACTGGAAG GGCTCGGCTA CCGGGTGATC
CCGCACCCCT TCCCGGACCA CCACCCTTAT CGGGCCGGGG AGTTGCGCTT TGGGGATGAC
CGGCCGGTGA TCATGACCGA GAAGGACGCG GTCAAGTGCG CCGGCCTGGC GCCGGCGCGG
AGCTGGTTCC TGCCGGTGGA GGCGCGGCCC GAGCCCGCCA CGCGGGAGCG CCTGGAGGCC
AGCCTGGCCC GGCTGCACTC ACTGACCAAC AGGTGA
 
Protein sequence
MSELPAFWLR RPPDWRAHAL RPLAALYGGV MRLRRYGYRK GWIRRGRLPV PVVVVGNIFV 
GGTGKTPLVA WIADTLAAMG RRPGIVSRGY GGRSREWPRR VAADSDPAEV GDEPLLLARG
TGCPVAVGPD RVAAAQLLLA AGCDVVVSDD GLQHYRLPRA LELVVCDGHR GLGNGLCLPA
GPLREPADRL ADVDMVISNG RAPALTPWWF ELVPGPLRPL AADAAPEGGP EPGTTVHAVA
GIGHPARFFA TLEGLGYRVI PHPFPDHHPY RAGELRFGDD RPVIMTEKDA VKCAGLAPAR
SWFLPVEARP EPATRERLEA SLARLHSLTN R