Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1727 |
Symbol | |
ID | 4268976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1976020 |
End bp | 1977741 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638126485 |
Product | peptidase M14, carboxypeptidase A |
Protein accession | YP_742563 |
Protein GI | 114320880 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGCAC CCACCGACAT CGAGCCCTTC CGGCACCAAT ACCTGGATTA CGACACCCTC ACCGGGCAGT TGCAGCACTG GGCGTCGGCG CACCCGGAGG TGGCCCGCCT GGAGAGCCTG GGCACCAGCC CGGAGGGCCG CGAGATCTGG CTGCTCACCG TGGGCAGGCG GCCGGAGCGG TCGCGTCCCG CCGTCTGGGT GAACGGCAAC ATGCACGGTT CGGAGTTGGC CGGCTCCAGC GTGGCGCTGG CCGTGGCCGA GGCGGCCCTG CAACTCCACC TGAGCGGTGC CAACACCCAC GGCGGACTGC TGCCCCATCT GGAGGAACAA CTCCGCGGCG TGCTCTTCTA CATCTGCCCG CGCGTCTCGC CCGACGGTGC GGAACAGGTG CTTCATCACG GCGGCTTCGT GCGCTCGGCA CCCCGGCGCA GCCCCCACGC CCCCGACACC CCCCGTTGGG AACCGAGCGA CTTGGACGGC GATGGCCGCT GCCGCTATCT GCGGATGGAG GATCCGGCCG GGCCGTTCGT CGCCTCGCCC CGGCATGCCG GGCTGATGCT GCCCCGTGAA CTGGACGACC CACCCCCCTA TTACCGCCTC TACCCGGAGG GGCTCATCCG TCACTGGGAT GGCCACACCG TGCCGGAGCC GGAACCGTTG CGAGACACCC CCGATTTCAA CCGCAACTTT CCCTGGAACT GGCGGCCAGA GCCGGACCAG ACCGGCGCTG GGCACTTTCC AGGCTCCGAG CCCGAGACCC ACGCGGTGCT CGATTTCGCC ACCCGCCATC CCAACATCTA CGCCTGGCTC GATCTACACA CCTTCGGCGG GGTCTTCATC CGCCCGCTGA CCGGCGCCCC GGACGCCGCC ATGGACCAGG ATGACCTGGC GCTCTACCGC CAATTGGCCG CCTGGGGCGA AATGCTCACC GGCTATCCCA CGGTCAGCGG CTTCGAGGAG TTCACCTACG AACCCGAGAC CCCGCTTTAC GGGGACCTGA CCGACTTCGC CTATCACCAA CGCGCCTGCC TGGCCCAGGT CTGTGAACTC TGGGACCTCT TCCGCCGGCT CGACCTGCCC CGCCCGAAAC GCTTCGTGGA CCTCTACACC AGCCTGCACC GTGGCGACAT GGAACGGCTG GCCCAGTGGG ATGCCGAGCA CAACCGCCAA CGGCTCTTCC GGCCCTGGTT GCCCCTCAAG CACCCGCAAA TCGGGCCGGT AGAGGTTGGC GGGCTGGACC CCAGCATCGG CATCTGGAAC CCGCCCCCCG AAGCGCTGCC CGACATCTGT GACGGCATCG CCACCTATTG GTTGCGGGCT GCCGCCCTGC TACCCCGACT GACTATCGCC GGGCTGGAAT GCCGCCCGCT GGGGGACGAC CACTGGGAGA TCATCGCGGT CGTGGCGAAT CACGGCTACC TGCCCACCTA CGGCGTAGCC GCCGGCCGCA GTCGGCCCTG GAACGACGGT GTGGAAACCG AACTCTGGCT GGAGGGCTGT ACCCTGACCG AGGGCCAGCC GGCCCGCCAA GCCCTTGGCC ATCTCGACGG CTGGGGCCGC GGATTGGGGA ACATGGCGCA CATGCCCTGG TTCCAGCGCT CACGGGGCAG CAGCCATCAG GCCCGGGCAC GCTGGGTGGT GCGGGGCCGG GGCACAGTCA CGCTGAGCGT CCGAAGCACC CGGCTCGGGA CCCTGGCACA GACCCGGCGA CTCACCCCTT GA
|
Protein sequence | MGAPTDIEPF RHQYLDYDTL TGQLQHWASA HPEVARLESL GTSPEGREIW LLTVGRRPER SRPAVWVNGN MHGSELAGSS VALAVAEAAL QLHLSGANTH GGLLPHLEEQ LRGVLFYICP RVSPDGAEQV LHHGGFVRSA PRRSPHAPDT PRWEPSDLDG DGRCRYLRME DPAGPFVASP RHAGLMLPRE LDDPPPYYRL YPEGLIRHWD GHTVPEPEPL RDTPDFNRNF PWNWRPEPDQ TGAGHFPGSE PETHAVLDFA TRHPNIYAWL DLHTFGGVFI RPLTGAPDAA MDQDDLALYR QLAAWGEMLT GYPTVSGFEE FTYEPETPLY GDLTDFAYHQ RACLAQVCEL WDLFRRLDLP RPKRFVDLYT SLHRGDMERL AQWDAEHNRQ RLFRPWLPLK HPQIGPVEVG GLDPSIGIWN PPPEALPDIC DGIATYWLRA AALLPRLTIA GLECRPLGDD HWEIIAVVAN HGYLPTYGVA AGRSRPWNDG VETELWLEGC TLTEGQPARQ ALGHLDGWGR GLGNMAHMPW FQRSRGSSHQ ARARWVVRGR GTVTLSVRST RLGTLAQTRR LTP
|
| |