Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0147 |
Symbol | |
ID | 4269278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 169987 |
End bp | 171498 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638124871 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_740992 |
Protein GI | 114319309 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGA TTTATCAGGA GGTCCTTCAG CAGGCGCGCG CCACCTGGCG CAAGCGTTGG TGGATCATCC CAATCGCGTG GCTGATCTGT CTGACGGGGT GGGCCTATAT CCAGACCATC CCGGACACTT ACCAGTCGTC GGCCCGCGTC TACGTCAACA CCCAGTCGGT GCTCGACCCC CTGCTTCGGG GTATGACCGT GCGGCCCGAC ACCGAGCAGC GGTTGCGGAT GATGACCCGC ACCCTGCTCA GCCGCGACAA CCTGGAGCGC ATCGCCGAGG CCAGCGACCT TGGCGTGCTC ACCGGCAGCG ACAACATCGA CAGCCAAGTG GGTGTCCTGC GCTCCCGACT CTCACTGGAC GGCGGGCAGC GCGACAACAT TTACAACATC TCCTTCCGCC ACGGCGACCC GGAGGTGGCC CACCGCGTCG TTCAGGAGAC AGTCAATCTC TTCATGGAAC GCGGCCTGGG CGACTCCCGG CTGGACCTGA CCAGCTCCCG GCAGTTCATC GAGCGCCAGC TGGAGAACTA CGAGCGGCAA CTGGAGGAGA AAGAGGCTGA GATCGAGCAG TTCAAGCGCG ATAACGCGGC CTATCTGAGC GCCGGCGGCA GCTTCTACAA CCGCCTGGAG CAGGCCAAGG AGCGTCTGGA GCAAGCCCGG CTGGAACACC GGGAGGTACA GCGCCGGGTG AACACCTTTG CCCAACGCAT CCGCGAGGGC GGCACGTCGG CTGACGGCCT GGGGTACGAG AACCCTGAGC TGAAACAGCG CATCAGCCGC CTTGAGAGCG AGCTGGACAC CCTGCGCCAG CGGTATACCG ACGAGCACCC CGATGTGAAG TCGGCCCGCC GGGTGCTGGA CGAGTTGCGC ACCCAGATGG CCGAGGAGGC GGAGCAGTTC GCGGCCTCCG GCGCCGACGG CCTAGACGGC GCCAGCCAGT CCCAGCATCC GCTGCAGATG GCCCTGGCCG AGGCGCAGAG TCGCGCGGCG GCGCTGGAGA CACGGGTCGA GGAGTTCGAG GACCGGGTCG CCCGCCTGGA GGCGCAGGTG GACCGGGTGC CGGCGGTGGA ATCCGAATTC ACGTCGTTGA CCCGCAATTA CGACGTACTG AAGAACAGCT ACCGCCAGCT GCTCAGCACC CGGGAGCGGG CGATCATGTC CGGAGAGGTG GAGACGCAGA CCGACTCGGT GGACTTCCGC GTGCTCGAGC CGCCGCGTCT GCCCAGCAAC CCGGCCTCAC CCAACCGGCC GGCACTGGCC AGCATGGTGC TCATCCTGGG GCTGGGTGCC GGCGGCGGTT TTGCCTTCCT GCTGGCGCAG CTGCGCGGCA CCGTGAACAG CAACAGTCAA CTGGCCGAAC TGACCGGGCG CCCGGTGCTG GGGCAGGTCT CCCGCGTGCG GACCCCGATC CGCCGCCGGC GGCGCATGCT GGAGCTGTTG GTCTTCGCCA CCGCCACCGG CAGCCTGCTG GTCGCGTTCT TCGTGGTGGT CGGCGTTTAC TTCTCCGGTT AG
|
Protein sequence | MEKIYQEVLQ QARATWRKRW WIIPIAWLIC LTGWAYIQTI PDTYQSSARV YVNTQSVLDP LLRGMTVRPD TEQRLRMMTR TLLSRDNLER IAEASDLGVL TGSDNIDSQV GVLRSRLSLD GGQRDNIYNI SFRHGDPEVA HRVVQETVNL FMERGLGDSR LDLTSSRQFI ERQLENYERQ LEEKEAEIEQ FKRDNAAYLS AGGSFYNRLE QAKERLEQAR LEHREVQRRV NTFAQRIREG GTSADGLGYE NPELKQRISR LESELDTLRQ RYTDEHPDVK SARRVLDELR TQMAEEAEQF AASGADGLDG ASQSQHPLQM ALAEAQSRAA ALETRVEEFE DRVARLEAQV DRVPAVESEF TSLTRNYDVL KNSYRQLLST RERAIMSGEV ETQTDSVDFR VLEPPRLPSN PASPNRPALA SMVLILGLGA GGGFAFLLAQ LRGTVNSNSQ LAELTGRPVL GQVSRVRTPI RRRRRMLELL VFATATGSLL VAFFVVVGVY FSG
|
| |