Gene Mlg_1571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1571 
Symbol 
ID4270593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1796128 
End bp1797891 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content66% 
IMG OID638126328 
Producttype II secretion system protein E 
Protein accessionYP_742408 
Protein GI114320725 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00763489 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGGTCG CCGACGAGCA CGATCTGGAA CAGGCGCTGA TCAAGGCGCG GGGCAAGCAG 
GTTCGGCGCC TCGGCGGCAT CCTCCTGGAA CGGGGCCTTA TCGACGAGAC CACGCTGCGT
GCGGCCTTGG ACACCCACCG GGCGCAGCCG CACCTGCAGC TTGGCCGCTG GCTGGTGGAG
CATCGCCACA TCACCCGGGA GCAGCTCGAG GACGCCCTGT GCGAGCAGCT CGGGATCCCT
CGGGTGGATC TGGCGGGGTT TGTGGCCAAG CCGGAGGTCG CCGGGCTGAT CCCCTACGAG
ATGTGCCTGC GACTCAACGT CCTGCCCCTG GCGCGCCACC GATCGGTGCT GATGGCTGCC
ACCGCCACCC CGACGGACGA GGAGTTGCTG GCCAACCTCC GTTTCCATAC CGGACTCAAC
GTGGAGCCGG TCCTGGCGCC TCCCCATCAG ATCAGCAGCG CGATTAATCG CTCCTATAAA
TCCCTGGCCA TCGGCGGTGA GGAGGGCATG GACACCCTGC TGACCACCGA TGAGGACCGG
GACCTGCGCC GCGACCAGGA GATAGAGAGC CAGGCCAGCA GCCGCCCGGT GGTGCGGCTG
GTCAATACCG TCATCCTGCA GGCGATCAGC CGCGGGGCGT CCGACATCCA CTTCATGCCA
CGCGAGAACG ACCTGGCGGT GATGTTCCGT ATCGACGGCG CCATCCAGCG GGTGCGACTG
GTGGACAAGG CGCAGCTGGC GGCGGTTGTC GCCCGCATCA AGATCCTGGG TCGCATGAAT
ATTGCCGAAA AGCGCCTGCC CCAAGACGGC CATGCGCGGG TGCGGGTGAG CGGCAAGGCG
GTGGACCTGC GCATTTCGGT GATGCCCACC TACACCGGCG AGAGCGTGGT CGTCCGCATC
CTCAACAAGG CCCACGGACT CAAGCGGCTG GAGGAAATCG GGTTCTCTGA GCGCGACGAC
CGGATCGTCC GCACCCTGAT CCAGCGCCCC CAGGGGATGA TCCTGGTCAC CGGACCCACC
GGCTCGGGCA AGTCCACCAC CCTCTACTCC CTGCTCCAGG AGGTGCGTCG TGCCGAGCCC
CACATCCTTA CCGTGGAGGA GCCGGTGGAG TACGACATGG AGGGGGTGGA ACAGATCCAG
GTGAATGCCG GCATCGGCTA CACCTTTGCC CGGGCGCTGC GCAATATCCT CCGCCACGAT
CCCGATGTGA TCATGGTCGG GGAGATCCGC GATCTGGAGA CGGCCGAGAT CGCCACCAAG
GCGGCGCTCA CCGGCCACAT GGTCCTTTCC ACCCTGCACA CCAACGATGC ACCCAGCGCC
GTGACCCGCC TGGTGGACAT GGGGGTGGAG CCCTACCTGG TGAGTTCCAC CGTGATGGGC
GTGCTGGCCC AACGGCTGGT CCGTGTTATC TGTACCCATT GCCGGGTGGC GCATGAGCCA
GAGGCGCTGG TGCGCCAGGT GATGGGCGTC GGCGACGAGC CATTCTGGAC CGGGACCGGC
TGTGATCGCT GCGACTATAC CGGGTTTCAC GGGCGTGCCA TGGCCTATGA ACTGCTGGTG
GCCGATCGCA ACATGGCCAC CCGTATCGCT CAGGGCATCA CCACCGAGGC GCTGAGGGAA
CTGGCCGTGG AGGGGGGTAT GCGCAGTCTG ACCCGGCACG GCCTGCACCT GGCGCGTACC
GGGGTCACCA CCCTGGAAGA GGCCTTCCGG GTCCGGCTGG AGGACCTGGA CGATGTCAAG
AAGGTGGTGG ACGCGGGTTA CTGA
 
Protein sequence
MKVADEHDLE QALIKARGKQ VRRLGGILLE RGLIDETTLR AALDTHRAQP HLQLGRWLVE 
HRHITREQLE DALCEQLGIP RVDLAGFVAK PEVAGLIPYE MCLRLNVLPL ARHRSVLMAA
TATPTDEELL ANLRFHTGLN VEPVLAPPHQ ISSAINRSYK SLAIGGEEGM DTLLTTDEDR
DLRRDQEIES QASSRPVVRL VNTVILQAIS RGASDIHFMP RENDLAVMFR IDGAIQRVRL
VDKAQLAAVV ARIKILGRMN IAEKRLPQDG HARVRVSGKA VDLRISVMPT YTGESVVVRI
LNKAHGLKRL EEIGFSERDD RIVRTLIQRP QGMILVTGPT GSGKSTTLYS LLQEVRRAEP
HILTVEEPVE YDMEGVEQIQ VNAGIGYTFA RALRNILRHD PDVIMVGEIR DLETAEIATK
AALTGHMVLS TLHTNDAPSA VTRLVDMGVE PYLVSSTVMG VLAQRLVRVI CTHCRVAHEP
EALVRQVMGV GDEPFWTGTG CDRCDYTGFH GRAMAYELLV ADRNMATRIA QGITTEALRE
LAVEGGMRSL TRHGLHLART GVTTLEEAFR VRLEDLDDVK KVVDAGY