Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0156 |
Symbol | |
ID | 4269287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 182160 |
End bp | 185027 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638124880 |
Product | TPR repeat-containing protein |
Protein accession | YP_741001 |
Protein GI | 114319318 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAC GCCCGCAACG CTCCCGTATC ACCCGTGCGC TCCGCCCCAT CAATCACGCC GAGCGCATGC ATGGCGCCGG CTGGCGGCGG CGCCTGGGCG TCCTGGCCTT CTGTATCACC CTCGGCGGCG GCCTCACTGC CTGCGACAAC ATGGGCGCCA GCACGGAGGA GGACTACCTG GAACGGGCCC AGTCCCGGAT GGAGCAGGGG GACTATGCAG CTGCACGGGT CGAGTTTCGT AATGCCCTGC AGTTGAACCC CCATGCGGCG GACACCCGGC GGGACCTGGG GCTCACCTAC CTGGCGTTGG GGAATGTGGA CGAGGCCCGC CGGCAATTGC GCCGCGCCCT GGAGGAGGGG GCCGACGAGG CGGAGATCGC CCTGCCGCTG GCAGAGGCGC TGTTCGAGCT GGACCGGCTG GACGAGATCC GGGCCATGGG CGTGCCCCCG GGGCTGGACC GAGCATCCAA TGCCCGACTC CACGCGCTGC GGGCGGTTGC CTTCGCCGCC ATCGGCCGGA CCGAGCCGGC ACAGTCAGAG CTGACCGCCG TGGACGGGCA TGAATCGGCC GCCGCGCTGA GCGCGCTGGC CGAGGCCCAC CTGGCACTGG CGGCCGGCGA CCATGAACAG GCCGAGCACT GGCTGGACCA GGCGCTGGAT GCCGATCCGG AGCTGGGCCA GGCCTGGAGC CTGCTGGCCC GATACCACCG CATCATGGGC GATAACGAGA GCGCCGAGGC GGCATACGGT CGGGCCATCG AACACCGGGC GGCCCCCGGC CGCGATCACC TGCGGCGGGC CATGGTGCGC CTGGACCTGG GTGACCGCGA AGGCACCCGG GCGGACATTG AGGGCCTCCG CCAGGGGGGC TCCGCGCACC CGGCCATCCC CTACCTGGAG GGCATGCTGG CGCTCCAGAA CGAGAACTAC GGCGACGCCC GGCGCAGCTT CGAGGAAGCC TTGGCCATGG ATCGCAGCTA CAGCGCGGCG CTGCTGGCCC TGGGCCAGAC CCTCCGCCGG CTGGGCAGCG ACGAGCAGGC GGAGCACTTC CTCAACCGCC ATGTACAGGA GAGCCCGGGC TCGCTTCAGG GGATTCGCAC CCTCATGGCG CTCTACGCCG AGCAGGAGCG CTTTGACGAC GCCCTGCAGT TCCTGAACCG CGCCAGCCTG GACTATCCCG GTGATGCCGC CGACCTGCAC GAGCTGCGGG GCCGCCTGAT GATGCTCTCC GGCGACCCGG AGCGCAGCGT GCAGGCCCTG CGCAACGCCG TGGCCGCCCG CCCGGACGAG ACCGGGCTGC AGGAGCTGCT GGCCGTGGCC CTGCTGCGCA GCGGCGAGAC CGAGGCCGGG CTCGACACCC TGCGCGCCGC GGGCACCCAG GATGTGACCT CTCAGCAGCT TGACGCCACC ATGGTGCTCT TCCTGCTCCA GACCGGCCGG TACGAGGAGG CCCTGGAGCG GGCCGAGCTG CTGCAGGCAC GGCAACCCGA CGCCGCGAGT CCGCACAGCC TCGCCGGCGC GGCGCTGATG GGGCTGGGCC GGGTCGAGGA GGCGCGTGAG GCCTTCAAGA AGGGCCTCGA GTTGGAGCCC GACAACCTCT CCGTGGCCAT GAACCTGGCG AACCTGGAGT TGCAGACGGG CGACCGCGAG GCCGGCCGTC AGGTGCTCGA AGGTATTCAG GAAGCACACC CGGGACACGC CCGCTCCGCC CAGCGCCTGG CCATGCTCAG CCTGCAGGAT CAGGACACCC AGGGCGCCGC GAAGTGGCTG CAGCGGGCCA TCGATGCGCA GCCGGAGAGC CTGCCCCCCC ACCTGATGCT GGCCCGCATC CAGGAGGACG AGGGCCGCCG GGAGGCGGCA TTGCAGACCC TGCTGGGTGC CCGCGAACAC CACGGTGACA ACCCGGAGTT GCTTTACGCG CTGGCCGATG TGCAGATGAG CCTGAATCAG ACCGACGAGG CCGTCGTGAG CCTCCAATCG GCGGTCGAGC GCGCCCCCGA TAACACCGGC CTGCAGTTGG CCCTGGCGCG GGCGCAGGCC CGGGCGGACG ACAACGAAGG GGCCGAGGCC ACCCTGGAGG CCCTGCTGGA ACAACAGCCC AGCCACTACG AGGGGCGGAT GGCACTGACC CGGGGCCTGG TCAACCAGGG CCGGCTGGCC GATGCCCGCC CCCACCTGGA TGAGCTGCAA GAGCGCTACG GCGACCGCCC GGAGGTTCAG GCGTTGCTCG GCCAAGCGGC ACTGGCAGAC GACCGGCTGG AGCCGGCCAT TGAGCACTAC CAGCAGGCAC TGGCCGACGC CGACCCGCAG CCACGCCCCT GGGTCCACGC CCTGGCGGAG GCCCAGGTCG CCGCCGACCG CCCCGGGGAG GCCCTGGCCA CTCTGGGACG CTGGCTGGAG GCCCACCCGG AGGACCGGGG CACCTGGCAC CTGTACGCCA GCCGTCAGCT CGCCTTGGGC GACACGGCGG AGGCCTTGCA GGCCTATGAG CGCATCCTGG CGCAGGACAA TGACGACGCC CTGGCCCTCA ACAACGCCGC CTGGCTCCTG CGCGACCGCG ACACCGAGCG CGCACTGGAC TACGCCCGCC GGGCCGTGGA ACTGGTACCG GAATCGGCCC AGATCCATGA CACCCTCGGC GTGGTGCTGA TCCACGCAGG CCAGCCCGCG GAGGCCCTGG AGACCCTGAC CCGGGCCGGC GAACTGGCCC CCGAGTCGCC CACCATCCAG TACCACCTGG CCTGGGCCGA GCGCGAGGCC GGCGATACCG AGGCGGCAAC CCGGCGCCTG AACCGGCTCC TTGAGGCCGA GCCCGATTTC CCGGAACGGG ATGATGCGGA GCAACTGCTG GAGGACATCC GGCGCTGA
|
Protein sequence | MKKRPQRSRI TRALRPINHA ERMHGAGWRR RLGVLAFCIT LGGGLTACDN MGASTEEDYL ERAQSRMEQG DYAAARVEFR NALQLNPHAA DTRRDLGLTY LALGNVDEAR RQLRRALEEG ADEAEIALPL AEALFELDRL DEIRAMGVPP GLDRASNARL HALRAVAFAA IGRTEPAQSE LTAVDGHESA AALSALAEAH LALAAGDHEQ AEHWLDQALD ADPELGQAWS LLARYHRIMG DNESAEAAYG RAIEHRAAPG RDHLRRAMVR LDLGDREGTR ADIEGLRQGG SAHPAIPYLE GMLALQNENY GDARRSFEEA LAMDRSYSAA LLALGQTLRR LGSDEQAEHF LNRHVQESPG SLQGIRTLMA LYAEQERFDD ALQFLNRASL DYPGDAADLH ELRGRLMMLS GDPERSVQAL RNAVAARPDE TGLQELLAVA LLRSGETEAG LDTLRAAGTQ DVTSQQLDAT MVLFLLQTGR YEEALERAEL LQARQPDAAS PHSLAGAALM GLGRVEEARE AFKKGLELEP DNLSVAMNLA NLELQTGDRE AGRQVLEGIQ EAHPGHARSA QRLAMLSLQD QDTQGAAKWL QRAIDAQPES LPPHLMLARI QEDEGRREAA LQTLLGAREH HGDNPELLYA LADVQMSLNQ TDEAVVSLQS AVERAPDNTG LQLALARAQA RADDNEGAEA TLEALLEQQP SHYEGRMALT RGLVNQGRLA DARPHLDELQ ERYGDRPEVQ ALLGQAALAD DRLEPAIEHY QQALADADPQ PRPWVHALAE AQVAADRPGE ALATLGRWLE AHPEDRGTWH LYASRQLALG DTAEALQAYE RILAQDNDDA LALNNAAWLL RDRDTERALD YARRAVELVP ESAQIHDTLG VVLIHAGQPA EALETLTRAG ELAPESPTIQ YHLAWAEREA GDTEAATRRL NRLLEAEPDF PERDDAEQLL EDIRR
|
| |