Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1571 |
Symbol | |
ID | 4270593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1796128 |
End bp | 1797891 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126328 |
Product | type II secretion system protein E |
Protein accession | YP_742408 |
Protein GI | 114320725 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00763489 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGGTCG CCGACGAGCA CGATCTGGAA CAGGCGCTGA TCAAGGCGCG GGGCAAGCAG GTTCGGCGCC TCGGCGGCAT CCTCCTGGAA CGGGGCCTTA TCGACGAGAC CACGCTGCGT GCGGCCTTGG ACACCCACCG GGCGCAGCCG CACCTGCAGC TTGGCCGCTG GCTGGTGGAG CATCGCCACA TCACCCGGGA GCAGCTCGAG GACGCCCTGT GCGAGCAGCT CGGGATCCCT CGGGTGGATC TGGCGGGGTT TGTGGCCAAG CCGGAGGTCG CCGGGCTGAT CCCCTACGAG ATGTGCCTGC GACTCAACGT CCTGCCCCTG GCGCGCCACC GATCGGTGCT GATGGCTGCC ACCGCCACCC CGACGGACGA GGAGTTGCTG GCCAACCTCC GTTTCCATAC CGGACTCAAC GTGGAGCCGG TCCTGGCGCC TCCCCATCAG ATCAGCAGCG CGATTAATCG CTCCTATAAA TCCCTGGCCA TCGGCGGTGA GGAGGGCATG GACACCCTGC TGACCACCGA TGAGGACCGG GACCTGCGCC GCGACCAGGA GATAGAGAGC CAGGCCAGCA GCCGCCCGGT GGTGCGGCTG GTCAATACCG TCATCCTGCA GGCGATCAGC CGCGGGGCGT CCGACATCCA CTTCATGCCA CGCGAGAACG ACCTGGCGGT GATGTTCCGT ATCGACGGCG CCATCCAGCG GGTGCGACTG GTGGACAAGG CGCAGCTGGC GGCGGTTGTC GCCCGCATCA AGATCCTGGG TCGCATGAAT ATTGCCGAAA AGCGCCTGCC CCAAGACGGC CATGCGCGGG TGCGGGTGAG CGGCAAGGCG GTGGACCTGC GCATTTCGGT GATGCCCACC TACACCGGCG AGAGCGTGGT CGTCCGCATC CTCAACAAGG CCCACGGACT CAAGCGGCTG GAGGAAATCG GGTTCTCTGA GCGCGACGAC CGGATCGTCC GCACCCTGAT CCAGCGCCCC CAGGGGATGA TCCTGGTCAC CGGACCCACC GGCTCGGGCA AGTCCACCAC CCTCTACTCC CTGCTCCAGG AGGTGCGTCG TGCCGAGCCC CACATCCTTA CCGTGGAGGA GCCGGTGGAG TACGACATGG AGGGGGTGGA ACAGATCCAG GTGAATGCCG GCATCGGCTA CACCTTTGCC CGGGCGCTGC GCAATATCCT CCGCCACGAT CCCGATGTGA TCATGGTCGG GGAGATCCGC GATCTGGAGA CGGCCGAGAT CGCCACCAAG GCGGCGCTCA CCGGCCACAT GGTCCTTTCC ACCCTGCACA CCAACGATGC ACCCAGCGCC GTGACCCGCC TGGTGGACAT GGGGGTGGAG CCCTACCTGG TGAGTTCCAC CGTGATGGGC GTGCTGGCCC AACGGCTGGT CCGTGTTATC TGTACCCATT GCCGGGTGGC GCATGAGCCA GAGGCGCTGG TGCGCCAGGT GATGGGCGTC GGCGACGAGC CATTCTGGAC CGGGACCGGC TGTGATCGCT GCGACTATAC CGGGTTTCAC GGGCGTGCCA TGGCCTATGA ACTGCTGGTG GCCGATCGCA ACATGGCCAC CCGTATCGCT CAGGGCATCA CCACCGAGGC GCTGAGGGAA CTGGCCGTGG AGGGGGGTAT GCGCAGTCTG ACCCGGCACG GCCTGCACCT GGCGCGTACC GGGGTCACCA CCCTGGAAGA GGCCTTCCGG GTCCGGCTGG AGGACCTGGA CGATGTCAAG AAGGTGGTGG ACGCGGGTTA CTGA
|
Protein sequence | MKVADEHDLE QALIKARGKQ VRRLGGILLE RGLIDETTLR AALDTHRAQP HLQLGRWLVE HRHITREQLE DALCEQLGIP RVDLAGFVAK PEVAGLIPYE MCLRLNVLPL ARHRSVLMAA TATPTDEELL ANLRFHTGLN VEPVLAPPHQ ISSAINRSYK SLAIGGEEGM DTLLTTDEDR DLRRDQEIES QASSRPVVRL VNTVILQAIS RGASDIHFMP RENDLAVMFR IDGAIQRVRL VDKAQLAAVV ARIKILGRMN IAEKRLPQDG HARVRVSGKA VDLRISVMPT YTGESVVVRI LNKAHGLKRL EEIGFSERDD RIVRTLIQRP QGMILVTGPT GSGKSTTLYS LLQEVRRAEP HILTVEEPVE YDMEGVEQIQ VNAGIGYTFA RALRNILRHD PDVIMVGEIR DLETAEIATK AALTGHMVLS TLHTNDAPSA VTRLVDMGVE PYLVSSTVMG VLAQRLVRVI CTHCRVAHEP EALVRQVMGV GDEPFWTGTG CDRCDYTGFH GRAMAYELLV ADRNMATRIA QGITTEALRE LAVEGGMRSL TRHGLHLART GVTTLEEAFR VRLEDLDDVK KVVDAGY
|
| |