Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2322 |
Symbol | |
ID | 4270577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2632647 |
End bp | 2633645 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638127080 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_743152 |
Protein GI | 114321469 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | [TIGR03589] UDP-N-acetylglucosamine 4,6-dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0743791 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAACG ACTCATCCGT TCTCGTCACC GGCGGGACAG GCTCCTTCGG CCACACCTTC ATCCCCATGC TCCTGGAGCG TTACAACCCC AGGCGCGTGA TCATTTTCTC TCGGGACGAG ATGAAGCAGT GGGAGATGGC CAAGAAGTTC GAGGGCGATC CGCGCGTGCG GTTCTTCATC GGCGACGTGC GCGACCGCGA ACGTGTCTAC CGCGCCTTTG ACGGCGTGGA TTACGTCGTG CATGCCGCCG CCTCGAAGAT CGTGCCCACG GCCGAGTACA ACCCGTTCGA ATGCATCAAG ACGAACGTCA CGGGTGCGAT GAACGTCATC GACGCCGCCA TCGACAAGGG CGTCAAGAAG GTCGTCGCGC TCTCCACCGA TAAGGCCAGC AGCCCGATCA ACCTCTACGG CGCCACCAAG CTCACCTCCG ACAAGCTCTT CGTCGCGGGC AACCACTACG CCGGTCACTC GGAGACGCGC TTCTCCGTGG TCCGTTACGG CAATGTCATG GGGTCCCGCG GTTCGGTGAT CCCGTTCTTC ATGTCCATCC GCGACCGGGG CGTGCTACCG ATCACCGACG ATCGCATGAC CCGCTTCATG ATCTCGCTAG AGCAGGGCGC GGAATTGGTC TGGCATGCCT TCGATGACAT GGAAGGCGGC GAGATCTACG TCAAGAAGAT CCCCTCCATG AAGGTCACCG ACCTCGCCCG CGTGATCGCC CCCGCGGCCA GGCAGGAAAT CGTCGGCATC CGGCCGGGCG AGAAGCTCCA CGAGCAGATG ATCGGCGGGG AGGATGCCTA CTACACCTAC GAATACCCCG AACACTTCAA GATTCTGCCG GCCATCAACG GCTGGGACCG TGACGCCAAT CGCATTAAGG ACGGCAAGCG CGTGCCGGAA GGCTTCGTCT ATGCCAGCGA CAACAACGCC GAATGGATGA GCGAGGACGA GTTGCGGGCG TGGATCGACG CCAACGAAGG TAAGATCGGG GCCATCTGA
|
Protein sequence | MFNDSSVLVT GGTGSFGHTF IPMLLERYNP RRVIIFSRDE MKQWEMAKKF EGDPRVRFFI GDVRDRERVY RAFDGVDYVV HAAASKIVPT AEYNPFECIK TNVTGAMNVI DAAIDKGVKK VVALSTDKAS SPINLYGATK LTSDKLFVAG NHYAGHSETR FSVVRYGNVM GSRGSVIPFF MSIRDRGVLP ITDDRMTRFM ISLEQGAELV WHAFDDMEGG EIYVKKIPSM KVTDLARVIA PAARQEIVGI RPGEKLHEQM IGGEDAYYTY EYPEHFKILP AINGWDRDAN RIKDGKRVPE GFVYASDNNA EWMSEDELRA WIDANEGKIG AI
|
| |