Gene Mlg_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0156 
Symbol 
ID4269287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp182160 
End bp185027 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content72% 
IMG OID638124880 
ProductTPR repeat-containing protein 
Protein accessionYP_741001 
Protein GI114319318 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID[TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC GCCCGCAACG CTCCCGTATC ACCCGTGCGC TCCGCCCCAT CAATCACGCC 
GAGCGCATGC ATGGCGCCGG CTGGCGGCGG CGCCTGGGCG TCCTGGCCTT CTGTATCACC
CTCGGCGGCG GCCTCACTGC CTGCGACAAC ATGGGCGCCA GCACGGAGGA GGACTACCTG
GAACGGGCCC AGTCCCGGAT GGAGCAGGGG GACTATGCAG CTGCACGGGT CGAGTTTCGT
AATGCCCTGC AGTTGAACCC CCATGCGGCG GACACCCGGC GGGACCTGGG GCTCACCTAC
CTGGCGTTGG GGAATGTGGA CGAGGCCCGC CGGCAATTGC GCCGCGCCCT GGAGGAGGGG
GCCGACGAGG CGGAGATCGC CCTGCCGCTG GCAGAGGCGC TGTTCGAGCT GGACCGGCTG
GACGAGATCC GGGCCATGGG CGTGCCCCCG GGGCTGGACC GAGCATCCAA TGCCCGACTC
CACGCGCTGC GGGCGGTTGC CTTCGCCGCC ATCGGCCGGA CCGAGCCGGC ACAGTCAGAG
CTGACCGCCG TGGACGGGCA TGAATCGGCC GCCGCGCTGA GCGCGCTGGC CGAGGCCCAC
CTGGCACTGG CGGCCGGCGA CCATGAACAG GCCGAGCACT GGCTGGACCA GGCGCTGGAT
GCCGATCCGG AGCTGGGCCA GGCCTGGAGC CTGCTGGCCC GATACCACCG CATCATGGGC
GATAACGAGA GCGCCGAGGC GGCATACGGT CGGGCCATCG AACACCGGGC GGCCCCCGGC
CGCGATCACC TGCGGCGGGC CATGGTGCGC CTGGACCTGG GTGACCGCGA AGGCACCCGG
GCGGACATTG AGGGCCTCCG CCAGGGGGGC TCCGCGCACC CGGCCATCCC CTACCTGGAG
GGCATGCTGG CGCTCCAGAA CGAGAACTAC GGCGACGCCC GGCGCAGCTT CGAGGAAGCC
TTGGCCATGG ATCGCAGCTA CAGCGCGGCG CTGCTGGCCC TGGGCCAGAC CCTCCGCCGG
CTGGGCAGCG ACGAGCAGGC GGAGCACTTC CTCAACCGCC ATGTACAGGA GAGCCCGGGC
TCGCTTCAGG GGATTCGCAC CCTCATGGCG CTCTACGCCG AGCAGGAGCG CTTTGACGAC
GCCCTGCAGT TCCTGAACCG CGCCAGCCTG GACTATCCCG GTGATGCCGC CGACCTGCAC
GAGCTGCGGG GCCGCCTGAT GATGCTCTCC GGCGACCCGG AGCGCAGCGT GCAGGCCCTG
CGCAACGCCG TGGCCGCCCG CCCGGACGAG ACCGGGCTGC AGGAGCTGCT GGCCGTGGCC
CTGCTGCGCA GCGGCGAGAC CGAGGCCGGG CTCGACACCC TGCGCGCCGC GGGCACCCAG
GATGTGACCT CTCAGCAGCT TGACGCCACC ATGGTGCTCT TCCTGCTCCA GACCGGCCGG
TACGAGGAGG CCCTGGAGCG GGCCGAGCTG CTGCAGGCAC GGCAACCCGA CGCCGCGAGT
CCGCACAGCC TCGCCGGCGC GGCGCTGATG GGGCTGGGCC GGGTCGAGGA GGCGCGTGAG
GCCTTCAAGA AGGGCCTCGA GTTGGAGCCC GACAACCTCT CCGTGGCCAT GAACCTGGCG
AACCTGGAGT TGCAGACGGG CGACCGCGAG GCCGGCCGTC AGGTGCTCGA AGGTATTCAG
GAAGCACACC CGGGACACGC CCGCTCCGCC CAGCGCCTGG CCATGCTCAG CCTGCAGGAT
CAGGACACCC AGGGCGCCGC GAAGTGGCTG CAGCGGGCCA TCGATGCGCA GCCGGAGAGC
CTGCCCCCCC ACCTGATGCT GGCCCGCATC CAGGAGGACG AGGGCCGCCG GGAGGCGGCA
TTGCAGACCC TGCTGGGTGC CCGCGAACAC CACGGTGACA ACCCGGAGTT GCTTTACGCG
CTGGCCGATG TGCAGATGAG CCTGAATCAG ACCGACGAGG CCGTCGTGAG CCTCCAATCG
GCGGTCGAGC GCGCCCCCGA TAACACCGGC CTGCAGTTGG CCCTGGCGCG GGCGCAGGCC
CGGGCGGACG ACAACGAAGG GGCCGAGGCC ACCCTGGAGG CCCTGCTGGA ACAACAGCCC
AGCCACTACG AGGGGCGGAT GGCACTGACC CGGGGCCTGG TCAACCAGGG CCGGCTGGCC
GATGCCCGCC CCCACCTGGA TGAGCTGCAA GAGCGCTACG GCGACCGCCC GGAGGTTCAG
GCGTTGCTCG GCCAAGCGGC ACTGGCAGAC GACCGGCTGG AGCCGGCCAT TGAGCACTAC
CAGCAGGCAC TGGCCGACGC CGACCCGCAG CCACGCCCCT GGGTCCACGC CCTGGCGGAG
GCCCAGGTCG CCGCCGACCG CCCCGGGGAG GCCCTGGCCA CTCTGGGACG CTGGCTGGAG
GCCCACCCGG AGGACCGGGG CACCTGGCAC CTGTACGCCA GCCGTCAGCT CGCCTTGGGC
GACACGGCGG AGGCCTTGCA GGCCTATGAG CGCATCCTGG CGCAGGACAA TGACGACGCC
CTGGCCCTCA ACAACGCCGC CTGGCTCCTG CGCGACCGCG ACACCGAGCG CGCACTGGAC
TACGCCCGCC GGGCCGTGGA ACTGGTACCG GAATCGGCCC AGATCCATGA CACCCTCGGC
GTGGTGCTGA TCCACGCAGG CCAGCCCGCG GAGGCCCTGG AGACCCTGAC CCGGGCCGGC
GAACTGGCCC CCGAGTCGCC CACCATCCAG TACCACCTGG CCTGGGCCGA GCGCGAGGCC
GGCGATACCG AGGCGGCAAC CCGGCGCCTG AACCGGCTCC TTGAGGCCGA GCCCGATTTC
CCGGAACGGG ATGATGCGGA GCAACTGCTG GAGGACATCC GGCGCTGA
 
Protein sequence
MKKRPQRSRI TRALRPINHA ERMHGAGWRR RLGVLAFCIT LGGGLTACDN MGASTEEDYL 
ERAQSRMEQG DYAAARVEFR NALQLNPHAA DTRRDLGLTY LALGNVDEAR RQLRRALEEG
ADEAEIALPL AEALFELDRL DEIRAMGVPP GLDRASNARL HALRAVAFAA IGRTEPAQSE
LTAVDGHESA AALSALAEAH LALAAGDHEQ AEHWLDQALD ADPELGQAWS LLARYHRIMG
DNESAEAAYG RAIEHRAAPG RDHLRRAMVR LDLGDREGTR ADIEGLRQGG SAHPAIPYLE
GMLALQNENY GDARRSFEEA LAMDRSYSAA LLALGQTLRR LGSDEQAEHF LNRHVQESPG
SLQGIRTLMA LYAEQERFDD ALQFLNRASL DYPGDAADLH ELRGRLMMLS GDPERSVQAL
RNAVAARPDE TGLQELLAVA LLRSGETEAG LDTLRAAGTQ DVTSQQLDAT MVLFLLQTGR
YEEALERAEL LQARQPDAAS PHSLAGAALM GLGRVEEARE AFKKGLELEP DNLSVAMNLA
NLELQTGDRE AGRQVLEGIQ EAHPGHARSA QRLAMLSLQD QDTQGAAKWL QRAIDAQPES
LPPHLMLARI QEDEGRREAA LQTLLGAREH HGDNPELLYA LADVQMSLNQ TDEAVVSLQS
AVERAPDNTG LQLALARAQA RADDNEGAEA TLEALLEQQP SHYEGRMALT RGLVNQGRLA
DARPHLDELQ ERYGDRPEVQ ALLGQAALAD DRLEPAIEHY QQALADADPQ PRPWVHALAE
AQVAADRPGE ALATLGRWLE AHPEDRGTWH LYASRQLALG DTAEALQAYE RILAQDNDDA
LALNNAAWLL RDRDTERALD YARRAVELVP ESAQIHDTLG VVLIHAGQPA EALETLTRAG
ELAPESPTIQ YHLAWAEREA GDTEAATRRL NRLLEAEPDF PERDDAEQLL EDIRR