Gene Mlg_2160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2160 
Symbol 
ID4270155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2455108 
End bp2456424 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content67% 
IMG OID638126916 
Producttype II secretion system protein E 
Protein accessionYP_742992 
Protein GI114321309 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0397447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0352451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGG GGATGTACGG CAAGCGCATC GCGCTGCGGC AGAAAGCCGA TGAGCAGAAC 
CTCAAACTGC GGCTGCACCG GTATCTCATC GACGCCATTG AGGAGGACGG GGTCGACCTA
GCCACCTGGG TGCGGCCCGC CATCCTGGAA TACGTCCGCG AGAAGGTGGG CGGGTACGTG
GCCAGCCACC GGCTGGCCGT CAGCCGCCAG GACCTGGACC GGCTCGCGGA GGACATCGCC
GATGAGCTCT GCGGCCTCGG GCCGCTGCAG ACCCTGCTGG CCGACCCCTC GATCTCCGAG
GTGTTGGTGA ACGGTCCCGA GCACATCATC GTGGAGCGCG ACGGCAAGCT GTTCGAGACC
GACCACCGAT TCATCGACGA CGACCACGTC AAGCGGGTCA TCGACCGGAT CCTCGCCCCC
CTGGGCCGGC GGTTGGACGA GGCCAGCCCG ATGGTCGACG GCCGTCTCCC AGACGGCAGC
CGGGTCAACG CGGTGATCCC ACCGGTGGCG CTGGACGGCC CCTGCGTCTC CATCCGCAAG
TTCGCCGCGG ACCCCCTGCG GGGCAATGAC CTGATCGCCT ACAAGACGCT GGACGAGGGC
CTGTTGGCCT TCCTGCGCAA CGCGGTGGAG CAGCGCGCCA ACATCCTCAT CAGCGGCGGG
ACGAGCACCG GGAAGACCAC GCTGCTCAAT GTCATGAGCG GCTATGTGGG TGCGACGGAG
CGCATTGTCA CCATTGAGGA CACCGCCGAG TTGCAGCTTC ACCATAGCCA CGTGGTTCGG
CTGGAGACCC GGCCGCCCAA CGTCGAGGGC TATGGGGAGA TCACCGCCCG GGACCTGGTC
AAGAACGCCC TGCGCATGCG CCCGGACCGC ATCATCCTCG GTGAGGTGCG CGGCGATGAG
GTGGTCGAGG TGATGCAGGC AATGAACACC GGCCACGACG GCTCCATGTC CACGGTGCAC
GCCAACAACG CCACCGATGC GCTGCTGCGT ATGGAGATGC TGTTCGGCAT GGCGGGGCGG
CAGATGTCTG AGGTCACGGT GCGCCGCATG ATCGGCGCCG CCGTAGACCT GATCGTCCAG
TTGGTCCGGC TCAGGGACGG AACCCGCTGT ATCAGCGAGG TGCGGGAGCT GGTGACGGTG
CGGGACGGTA ACTTCGTCAC CACGGTCCTG TATGAACGGG ACGCGGAGAG CGGACAGTTT
GTGCGCAGGG ACGACCCGGT CTCCAACCCG AAGCTTCAGG CCCTCAACGC CCCCGCCCGG
CGGGAAGGCC CCCGGGTGGG GACCCGGTGG GCAGCCCCAC GCACCCGGCA TGAGTGA
 
Protein sequence
MKVGMYGKRI ALRQKADEQN LKLRLHRYLI DAIEEDGVDL ATWVRPAILE YVREKVGGYV 
ASHRLAVSRQ DLDRLAEDIA DELCGLGPLQ TLLADPSISE VLVNGPEHII VERDGKLFET
DHRFIDDDHV KRVIDRILAP LGRRLDEASP MVDGRLPDGS RVNAVIPPVA LDGPCVSIRK
FAADPLRGND LIAYKTLDEG LLAFLRNAVE QRANILISGG TSTGKTTLLN VMSGYVGATE
RIVTIEDTAE LQLHHSHVVR LETRPPNVEG YGEITARDLV KNALRMRPDR IILGEVRGDE
VVEVMQAMNT GHDGSMSTVH ANNATDALLR MEMLFGMAGR QMSEVTVRRM IGAAVDLIVQ
LVRLRDGTRC ISEVRELVTV RDGNFVTTVL YERDAESGQF VRRDDPVSNP KLQALNAPAR
REGPRVGTRW AAPRTRHE