Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2401 |
Symbol | |
ID | 4269988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2728254 |
End bp | 2730275 |
Gene Length | 2022 bp |
Protein Length | 673 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638127159 |
Product | general secretion pathway protein D |
Protein accession | YP_743231 |
Protein GI | 114321548 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02517] general secretion pathway protein D |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.539176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.000473161 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTCAGC TGATAGCGGG GGCGCGGCGC TGGCTCGGCT GCGGGGCCGG CGTGCTGCTG GTGGGGCTGC TGCTCTGCGG GCCGGTACTG GCCGAAGAGG ACGAGGGGAT CACCCTCAAC TTCCAGGATG CGGACCTGCG CGAGGTGGTG GCACTGGTCT CCCAGGAGAC CGGGGTCAAC TTCATCGTCG ACCCGCGGGT GCGGGGCGAT GTCACCATCG TCTCGCAGAC CCCGGTGGAC GCTGACGGCC TCTATCAGGT CTTCCTCTCG GCGCTCAAGA TCCACGGCTT TGCCGCCGTG CCCACCCCCG AGGCGGTGCG CATCATCCCC CGTGCCCAGG CCCGCCAGGA CCGTATCCCC AGTGATGGTG ATACCACCCG AAGCCTGGGG GACCAGGCGG TCACCCGGGT CATCCGGGTG GACCACGTCA ACGCCGCGGA GCTGGTGCCG GTGCTGCGCC CGCTGGTGCC CCAGGACGGC CACCTGGCCG CCTACAGCCC GGCCAACAGT CTCATCATCT CCGACACCGC CGCCAACGTG GACCGGCTCG AGGTGTTGAT CGGCCGGGTG GACCGGGACA CCGAGGGCGA GATCGAGGTT ATCCAGCTCG AGCACGCCGC CGCCAGCGAA GTGGTGCGCA TGGTACGCGA GCTGGAGGAC GAGGAGCGCG AGGGGCGCCG CCTGCGGGTG GTGGCCGATG AGCGGGCCAA CAGTGTGATG ATCGCCGGCG ACCGCCAGCG CCGCCTGCTG GTGCGCGCCC TGATCGGCCA GATTGACAGT GAGGTGGCCG CCGAGGGCAC GGCACAGGTG GTCTATCTTC GCTTTGCCGA TGCCGAGGAG CTGGTGCCGG TACTGGAGGG CATCGGCACC AGCCTGTTGG AGAGTCGGGG GGGCAACGAC GCCGGTGGCC AGGGGCTGGA CATCCGCGCC CATTCCAGCA CCAACGCCCT GGTCATGAAC GGGCCGGCGG ACGTGCTGCG CTCGCTGCGC TCGGTGCTGA ACCGGCTCGA TATCCGTCGC GCCCAGGTGC TGGTGGAGGC GGTGATCGCC GAGGTCTCCC AGGACCGGGT GGAAGAGCTG GGGGTGCAAT GGGGGGTGCT GTCCGAGGAC CGCGGCGTCG GGCTGATCAA CTTCGGAGGG GCCGCCGGCG CCGGTGGCGG TATCGCCGAC GTCATCCGGG CTGCCGGGGC CATTTCCGAC GGCAGCACCG CCGACCTGCC GCAGGTCGGC GACGGCGCCG CCATCGGCGT CGGCGACCTG CGGGGCTCGA CCCAGGTGGC GGCGCTGATC CGCGCCCTGT CCGGTGACAG CGCCAGCAAC ATCCTCTCCA CCCCCTCGCT GATGACCATG GACAACGAGG AGGCCGAGAT CGTCGTCGGC CAGAATGTCC CCTTCATCAC CGGCCGGGCC ATTGAGGACT CCGGCCAGGC CTTCAGCAGC ATCCAGCGTC AGGACGTGGG CGTGCAACTG CGCATCCGGC CGCAGATCAA CGAGGGCGAC ACCCTGAAGC TGGAGATCGA ACAGGAGGTC TCCAGCGTCA GCGGCGGCAT TCAGGGGGCG GCGGACCTGG TCACCGACTT GCGCAGCGTG CGGACCAGCG TGATGGTGGA GAACGGGCAG ATGGTGGTCC TGGGCGGGCT GATCGACGAC CAGCTACGCA CCCGCAGCCA AGCGGTACCG CTGTTGGGCA ACCTGCCGGG TCTGGGGCGG TTGTTCCGCT ACGACCGCAG CCAGGTTGAG AAGCGCAATC TGATGGTCTT CCTGCGCCCG GTGATCATCC GCGATACCGC CGCGCAGGAG CGCGCGACCG CCGAGCCCTA CAACCGCATG CGCGCCATGC AGCAGCGCTT TCGCGACGAG GGCGTGCCGC TGATGCCCGA TGATGCCGCG CCGGTGCTGG CCGAATCGGA GCAGTTCATG TCGTTGCCGC CCGCCTACGA CGACCGGCCG GAAAGCACCG GCCCCACCGG CCGTTCGCGG GGCGCCCTGC CGCCCCCCTC GCGCCAGGGT TTTCTGGACT GA
|
Protein sequence | MIQLIAGARR WLGCGAGVLL VGLLLCGPVL AEEDEGITLN FQDADLREVV ALVSQETGVN FIVDPRVRGD VTIVSQTPVD ADGLYQVFLS ALKIHGFAAV PTPEAVRIIP RAQARQDRIP SDGDTTRSLG DQAVTRVIRV DHVNAAELVP VLRPLVPQDG HLAAYSPANS LIISDTAANV DRLEVLIGRV DRDTEGEIEV IQLEHAAASE VVRMVRELED EEREGRRLRV VADERANSVM IAGDRQRRLL VRALIGQIDS EVAAEGTAQV VYLRFADAEE LVPVLEGIGT SLLESRGGND AGGQGLDIRA HSSTNALVMN GPADVLRSLR SVLNRLDIRR AQVLVEAVIA EVSQDRVEEL GVQWGVLSED RGVGLINFGG AAGAGGGIAD VIRAAGAISD GSTADLPQVG DGAAIGVGDL RGSTQVAALI RALSGDSASN ILSTPSLMTM DNEEAEIVVG QNVPFITGRA IEDSGQAFSS IQRQDVGVQL RIRPQINEGD TLKLEIEQEV SSVSGGIQGA ADLVTDLRSV RTSVMVENGQ MVVLGGLIDD QLRTRSQAVP LLGNLPGLGR LFRYDRSQVE KRNLMVFLRP VIIRDTAAQE RATAEPYNRM RAMQQRFRDE GVPLMPDDAA PVLAESEQFM SLPPAYDDRP ESTGPTGRSR GALPPPSRQG FLD
|
| |