Gene Mlg_0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0147 
Symbol 
ID4269278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp169987 
End bp171498 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content67% 
IMG OID638124871 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_740992 
Protein GI114319309 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA TTTATCAGGA GGTCCTTCAG CAGGCGCGCG CCACCTGGCG CAAGCGTTGG 
TGGATCATCC CAATCGCGTG GCTGATCTGT CTGACGGGGT GGGCCTATAT CCAGACCATC
CCGGACACTT ACCAGTCGTC GGCCCGCGTC TACGTCAACA CCCAGTCGGT GCTCGACCCC
CTGCTTCGGG GTATGACCGT GCGGCCCGAC ACCGAGCAGC GGTTGCGGAT GATGACCCGC
ACCCTGCTCA GCCGCGACAA CCTGGAGCGC ATCGCCGAGG CCAGCGACCT TGGCGTGCTC
ACCGGCAGCG ACAACATCGA CAGCCAAGTG GGTGTCCTGC GCTCCCGACT CTCACTGGAC
GGCGGGCAGC GCGACAACAT TTACAACATC TCCTTCCGCC ACGGCGACCC GGAGGTGGCC
CACCGCGTCG TTCAGGAGAC AGTCAATCTC TTCATGGAAC GCGGCCTGGG CGACTCCCGG
CTGGACCTGA CCAGCTCCCG GCAGTTCATC GAGCGCCAGC TGGAGAACTA CGAGCGGCAA
CTGGAGGAGA AAGAGGCTGA GATCGAGCAG TTCAAGCGCG ATAACGCGGC CTATCTGAGC
GCCGGCGGCA GCTTCTACAA CCGCCTGGAG CAGGCCAAGG AGCGTCTGGA GCAAGCCCGG
CTGGAACACC GGGAGGTACA GCGCCGGGTG AACACCTTTG CCCAACGCAT CCGCGAGGGC
GGCACGTCGG CTGACGGCCT GGGGTACGAG AACCCTGAGC TGAAACAGCG CATCAGCCGC
CTTGAGAGCG AGCTGGACAC CCTGCGCCAG CGGTATACCG ACGAGCACCC CGATGTGAAG
TCGGCCCGCC GGGTGCTGGA CGAGTTGCGC ACCCAGATGG CCGAGGAGGC GGAGCAGTTC
GCGGCCTCCG GCGCCGACGG CCTAGACGGC GCCAGCCAGT CCCAGCATCC GCTGCAGATG
GCCCTGGCCG AGGCGCAGAG TCGCGCGGCG GCGCTGGAGA CACGGGTCGA GGAGTTCGAG
GACCGGGTCG CCCGCCTGGA GGCGCAGGTG GACCGGGTGC CGGCGGTGGA ATCCGAATTC
ACGTCGTTGA CCCGCAATTA CGACGTACTG AAGAACAGCT ACCGCCAGCT GCTCAGCACC
CGGGAGCGGG CGATCATGTC CGGAGAGGTG GAGACGCAGA CCGACTCGGT GGACTTCCGC
GTGCTCGAGC CGCCGCGTCT GCCCAGCAAC CCGGCCTCAC CCAACCGGCC GGCACTGGCC
AGCATGGTGC TCATCCTGGG GCTGGGTGCC GGCGGCGGTT TTGCCTTCCT GCTGGCGCAG
CTGCGCGGCA CCGTGAACAG CAACAGTCAA CTGGCCGAAC TGACCGGGCG CCCGGTGCTG
GGGCAGGTCT CCCGCGTGCG GACCCCGATC CGCCGCCGGC GGCGCATGCT GGAGCTGTTG
GTCTTCGCCA CCGCCACCGG CAGCCTGCTG GTCGCGTTCT TCGTGGTGGT CGGCGTTTAC
TTCTCCGGTT AG
 
Protein sequence
MEKIYQEVLQ QARATWRKRW WIIPIAWLIC LTGWAYIQTI PDTYQSSARV YVNTQSVLDP 
LLRGMTVRPD TEQRLRMMTR TLLSRDNLER IAEASDLGVL TGSDNIDSQV GVLRSRLSLD
GGQRDNIYNI SFRHGDPEVA HRVVQETVNL FMERGLGDSR LDLTSSRQFI ERQLENYERQ
LEEKEAEIEQ FKRDNAAYLS AGGSFYNRLE QAKERLEQAR LEHREVQRRV NTFAQRIREG
GTSADGLGYE NPELKQRISR LESELDTLRQ RYTDEHPDVK SARRVLDELR TQMAEEAEQF
AASGADGLDG ASQSQHPLQM ALAEAQSRAA ALETRVEEFE DRVARLEAQV DRVPAVESEF
TSLTRNYDVL KNSYRQLLST RERAIMSGEV ETQTDSVDFR VLEPPRLPSN PASPNRPALA
SMVLILGLGA GGGFAFLLAQ LRGTVNSNSQ LAELTGRPVL GQVSRVRTPI RRRRRMLELL
VFATATGSLL VAFFVVVGVY FSG