Gene Mlg_1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1631 
Symbol 
ID4270352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1864904 
End bp1866481 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content69% 
IMG OID638126388 
Productsporulation domain-containing protein 
Protein accessionYP_742467 
Protein GI114320784 
COG category[S] Function unknown
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3266] Uncharacterized protein conserved in bacteria
[COG3267] Type II secretory pathway, component ExeA (predicted ATPase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.258294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACC AAGCGTTTAA GGCCATCCTG CCTCCCGCCA GCCTCCATCG GCTGGGTCTG 
CTGGAACAAC CCTTCGGGGA CCGGCCGCAC GGACCCGCGA TCTTCGAGGA CGCCGCCTAC
CGCACCCAAG TCAATGTGGC CTTGAACCTG CTGCAGACCG GGGAGCGCGT CCTGCTCGTC
CGGGGCGAGC CTGGCCTGGG CAAGTCCACC TTCCTGCGCA AGTTGCTGGA CTCCGCTCAG
CCGGGGGTGG ACTTCCAACC CTGCGTCGCC GACCCGGACC TGCTGTTCAG CGATATCTGG
CTCGACTATC TGGAGCGCCT GGATCCGGAC ACTGACCACG GTGATCACGT CCGCCATACC
CAATTGGTGA ACCTGATCCA GGCAATGAAC CGGCGCGGCA TGCGCCCGGT GCTGCTGATC
GACGACGCCC ACGATCTGGC GGACGACACC GCGGGCCAAC TGCTCGATTT CTGGACTGAA
ATGGCCGAGG CGGGCGAGGG GTTCGGGCTG GTGGCGGCGC TGGACCCGGG CGTGGAGGGC
AGCGAGGAGG GCTACCTGGC CGGTACCCGG CTGGATCCGG CGCGGGTTTA TAACATCACC
CTCTACCCCT ACGACTTGGA TCAGACGGAA CGCTACCTGC GGCATCGGTT TCAGTTGGCC
GGCGGCGAGC CCGACCTGCT CAGCCGCAAG GATGTGGAGC GGATCTTCGA GCGTTCGGGG
GGGCGGCCCG GCTTCGTCAA TCTGGCGGCG CGGGACCTGC TCCAGGACAA GGCGACCCGG
GGAGGGCGGG GGTTCGCGCT GGCCTGGCCC GATCTGTCGG GGTTCCGGGT GCGGGCGCCT
CAGGGTCGGG CGCGGCACCT GCTGGCGGGT GGCGTGGTGG TCGTGATCGG CGGGCTGCTG
GCCATTAACC TGTTCACGGG CGGTGGCGGG GAAGATGCGG AAATCGTCGA CGATGAGCTG
ACGCTCGATC TGCCGCAGCT GAGCCAGGCC GACCCGGAGA CCGTGCGTCC GGACGAGGGA
GCAACGGACC ACCTGCCGCT GGGCATGACC CGCGACGACC CGTTGGCGCC GGAGCAGGGG
GAAGCGCCGA CGGAGAGCGA CCGCCAGCCG GAGGAAACCC CGGTGCCAGA TCCCGAGCCG
CAACTACAGG CAGAACCGCA ACCGGAGCCG GAGATCGCGG CACCGGAGCC CGAACCGAGT
GCCCCGGTCC CCGAACCGGA GCCGGAGGCC GCCGACGAAA GGACCGAGGA ACCGGCGGCT
GACGCGGATG GGGTGGAGGC CTGGCTGGCG CGCGGTGAGG ACTGGGCGCG CGGTCAGCCC
GCTGATCACT ACACCATCCA GGTCCTGGCC GCCGGAGGTG CCGAGACCCT ACTGCCCTAC
CTGAGCCGCC ACGGGCTTGA GGAGGATGCC CACCTAGTGT TGACCCGACG CCAGGGCAAT
GACTGGTATC TGGTCCTCGT GGGCAGCCAC GCCGACCGCG AGGCCGCGCG CGATGCCATA
GACGCCCTGC CGGAGGCAGT GCGTGCCTCC GGGCCGTGGG TGCGCACTAT GGGGTCGGTC
GCCGACGTCA TGCCCTGA
 
Protein sequence
MSDQAFKAIL PPASLHRLGL LEQPFGDRPH GPAIFEDAAY RTQVNVALNL LQTGERVLLV 
RGEPGLGKST FLRKLLDSAQ PGVDFQPCVA DPDLLFSDIW LDYLERLDPD TDHGDHVRHT
QLVNLIQAMN RRGMRPVLLI DDAHDLADDT AGQLLDFWTE MAEAGEGFGL VAALDPGVEG
SEEGYLAGTR LDPARVYNIT LYPYDLDQTE RYLRHRFQLA GGEPDLLSRK DVERIFERSG
GRPGFVNLAA RDLLQDKATR GGRGFALAWP DLSGFRVRAP QGRARHLLAG GVVVVIGGLL
AINLFTGGGG EDAEIVDDEL TLDLPQLSQA DPETVRPDEG ATDHLPLGMT RDDPLAPEQG
EAPTESDRQP EETPVPDPEP QLQAEPQPEP EIAAPEPEPS APVPEPEPEA ADERTEEPAA
DADGVEAWLA RGEDWARGQP ADHYTIQVLA AGGAETLLPY LSRHGLEEDA HLVLTRRQGN
DWYLVLVGSH ADREAARDAI DALPEAVRAS GPWVRTMGSV ADVMP