Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2370 |
Symbol | |
ID | 4270709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2689132 |
End bp | 2690253 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638127128 |
Product | capsular polysaccharide biosynthesis protein-like protein |
Protein accession | YP_743200 |
Protein GI | 114321517 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4421] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0553875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0113185 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCCCT GGGTCAGGGC CGTGGATCGG CTCCTGGGCA CCGATTTCAA TAGCCACCCG GCCTGCGCCT GGCAGCCCAG CCCGGTGCCG TTGCACGCCA TCGAGCAGGC CGGCGACGGG GCCGTGGACC AGGCCGTAAA CTGGCGGCAG ACGCCCCCCA TCGCGCCGGT TGGGGTGGGG CGGGTGCGCC GGCCGTTGAT CCTGCTGCCG AGTCCGGCCC CGCAACTGGC GCATGACCCG CTGCGGCGCC GCGCCGTCAT CCTCACCAGC GGCGGGCGGC CCGTGCGCTA TCCGAAGTCT GCCCCGGCCC TGCGCCATCT GCTGCGCGGC GGTTGGCGCT ACAGTGCCCG GCTGGGCCGC GCCGCCCGCC CGGCTGTCCG GCTGGGCACC GTCGCCGTCC TCGGCAACCA CGATCCCGGC TGCAACAACT ACTACCACTG GTGGGCGGAC ACCCTCGCCG ACCTCTGGTT TCTGCGCGAG TCCGGCGTGG ACCTGGGCCG GGTCGACAGC TTCCTGATGG CCTATGGCGG CTACCCCTGG CAACAACAGT CCCTGGCCCT GTGCGGCATT GACCAGGAGC GGGTGGTGGC CTTTGCCGAC CACCCCGCGC TGACCGCGGA GCAGGCGCTT GTACCGGTGC GGAGCAGGGG GAGTTGGGTG TCGCCGGTCT GGCTGGCGAG GGCGCTGCGG GAGCTGACCG GGTGGCGGCC GCCGGCCGTC ACCACCCCGG GCCGTCGCAT CTACCTGTCG CGGCGCGATG CCCCTCGCCG GCAGGCGGCC AACGAGGCGG CGGTGGAGCG GCTGCTGGTG GATGAGTCGG GTTTCGAGAG TCACCAGTGC AGCGGCCTGA GCGTGCCCCG CCAGCAGGCC TTGTTCGCCG ACGCCGAGGT CATCGTGGCG CCCCACGGTG CGGCGCTCAC CAACCTCGTC TGGTGCCGCC CGGGTACCCG GGTGGTGGAA CTGGTCCCCG AGGGCCACCG CAACCCCTGC TTCCGTGACC TGGCCGCCCA GTCCGGCCTG GACTACCGCG CCATCCTCTG TCCGGCAACG GGTGCCGGGG GCGGCCTGAC TGCCGACATC CAGGTGCCGC TGGCGCGCCT GCGAGAGGCA CTGGCCGGGT GA
|
Protein sequence | MSPWVRAVDR LLGTDFNSHP ACAWQPSPVP LHAIEQAGDG AVDQAVNWRQ TPPIAPVGVG RVRRPLILLP SPAPQLAHDP LRRRAVILTS GGRPVRYPKS APALRHLLRG GWRYSARLGR AARPAVRLGT VAVLGNHDPG CNNYYHWWAD TLADLWFLRE SGVDLGRVDS FLMAYGGYPW QQQSLALCGI DQERVVAFAD HPALTAEQAL VPVRSRGSWV SPVWLARALR ELTGWRPPAV TTPGRRIYLS RRDAPRRQAA NEAAVERLLV DESGFESHQC SGLSVPRQQA LFADAEVIVA PHGAALTNLV WCRPGTRVVE LVPEGHRNPC FRDLAAQSGL DYRAILCPAT GAGGGLTADI QVPLARLREA LAG
|
| |