Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1749 |
Symbol | |
ID | 4270856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2005454 |
End bp | 2006608 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126507 |
Product | hypothetical protein |
Protein accession | YP_742585 |
Protein GI | 114320902 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.698415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCTCG ATCTGCTCAG CGGCGAACAG CGGGCCCCGG CGGAGTGCCG GATCTACCTC CACGACCAGG AGGTGCCGGA GTACTACCCC TTCCTCCGCG AGTTGGAGGT GGACACCAGC CGGGAGGAGG CCTGGACCGC GACCCTGCGC CTGGCCACCG TGCGCGATGA GCTGGGGAGC TGGGGTATCC AGGACGACGC GCTGTTGCGG CCCTGGGCGG AGATCACCCT GGCCATCGTC TTCGGCGATG ACGAGGAGCG CCTGTTCAAG GGCTATATCC GCGAGGTCAA CGCCGACTTC CCGGAGAGCG CCGGTGAGGC GGAGGTCGTG GTCGAGTGCC AGGACCAGTC CCTGCGCCTG GACCGCACCC ACCAGCGCGA GCCCTGGGGC GACGAGGAGG CCCCGGTCAG CGACCGGGTG ATCATCGAGG AGATCCTCAG CCACTACGGC CTCGCCCTCA GCCCGGACAG CAAGAGCGGC CAGCAGGGGC TGGTGGAACT GCCGCAGGAC AGCTCCGATA TCCAGTTCCT GCGCAAGCGC GCCGAGGAGA ACAACTACGA GCTGATCTTC TACCCGGACG AGGTCTACTT CGGGCCCTAC CGCCTGGACG GCGCCGGGGC CCAGGCCACC ATCCAGGTCT ACGCCGGCCA GGCCACCAAC TGCCTGCGGC TCAACGTCAG CGCCGACGGC CACCTGCCGG ACCTGCTCCA GGTGGAGGTG CCGGCGGAGC GCGAGGATGA CGACTCGCGC ACGCTGCAGA TGTTCTCCAC GCTGCCCCCC ATGGGGCCGG AGCGGGCCGA CAGCCGGGGC GGCGGCCTGG AGCCCAATGT AGACACCCTG AGCGGGGAGG GGGGCGATGC CGCCGAGCAG CTCGAGGCCC GGGCCCAGGC TCGCATCAAC GAATACGACC TGCACCGCCT GCAGGCCGAC GGCGAGCTGG ACGGCAGCCT CTACGGCCAC GTGCTGCGCC CGGGCCGGCC GGTGCCGGTG GACGGCCTGG GCGAGCGGCT GAGCGGGCTT TACTACGTCG ACCGGGTGGC CCATCACTTC AGCCCCGACG GCTACTTCCA GCGCTTTCAA CTGCTGCGCA ACGCCTACGG CGACAACGTG GAGACCGCCG CCCCGGTCGC TTCCCGGCTG GCGGGGGTGC TCTGA
|
Protein sequence | MVLDLLSGEQ RAPAECRIYL HDQEVPEYYP FLRELEVDTS REEAWTATLR LATVRDELGS WGIQDDALLR PWAEITLAIV FGDDEERLFK GYIREVNADF PESAGEAEVV VECQDQSLRL DRTHQREPWG DEEAPVSDRV IIEEILSHYG LALSPDSKSG QQGLVELPQD SSDIQFLRKR AEENNYELIF YPDEVYFGPY RLDGAGAQAT IQVYAGQATN CLRLNVSADG HLPDLLQVEV PAEREDDDSR TLQMFSTLPP MGPERADSRG GGLEPNVDTL SGEGGDAAEQ LEARAQARIN EYDLHRLQAD GELDGSLYGH VLRPGRPVPV DGLGERLSGL YYVDRVAHHF SPDGYFQRFQ LLRNAYGDNV ETAAPVASRL AGVL
|
| |