Gene Mlg_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0040 
Symbol 
ID4270909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp41926 
End bp43257 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content64% 
IMG OID638124766 
Producthypothetical protein 
Protein accessionYP_740888 
Protein GI114319205 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGAT CAACCATCGA CGATACGTTA CCCCCTATGC GCGCCGGCGA ACGGGAAAGC 
CGCGCCGGCG GCGAGCAGTA CTGTCTCAGC CCTCTGCCGC TGCCCACCGA CAGCGAACAT
GGCGTGGACG TGGCCCACAT CGACGTTCGC CGCGACGAGG TCACGATGGA CCTGCGCCAG
GCTGCCAGCC TCGATAGCGT CCTGATTACG GGGCATATCT TTAGTGTCGG CCTGACGCTA
ATTTCGGGAC TGGGGCTCCT TTTCTCCCTG AGTGCGGTGT ACAAGGGCTC ACTGCCCTTG
GAGTTGGTTG CAGGGGGGCT GGGAAGCATT TACGCGGTTA TATTGTTTCC TCTTTTTGCC
GGCATTCTCT ATCCCGATGT CGTACTGAGA CGCATCCCCC CCATCCGCCT CCATCGCCAA
CGCCGGGAGG TGGCCTTCGT CGTGGATGCC CCCGGGCGGC GCTTCTGGCT GCCCGACCCA
ACCAACATGT GGTTGATGGC TTCTTCGGGT GTCATTGCCT CGCTGACCGG AATTATGGTT
TTTGTGGACG TGTTCGAATG GATGGCGAAT CCAGATACGG TCTTCCCTCT CAAGGTACTG
CTGATCCACA TCGCCTCCTT GGCCTTCCTG CCTCTCTATC CCTATTTCCA TGACTTCTGC
CGCAAGCTCG TCGGCCAACA GCGCCAGACC GTGTTGGTGC CCTGGGAGGA CGTGATCGCG
GTCTGCGGGT TCAACCCCAG CCTCAGCGCC GGGGCCGTCA CCGGCTTCGG CTGGAACTTC
GCTCTGATGC CGCCCGATCC GGAACGACCC GGCTATACCC TGCCCGGGGC CGGCATCATC
GTCGGGGTCG GCGGCCTCCC GGGCGCGCTG GCCCAGTGGG AGTACATCCG CCGCTTCATG
GAGGAGGGGC CCGAGGCCAT CACCTCTTCC GCCCGGGAGT GGGGCCTCGA GTGGTACGAC
GCCTACGTGG CCCGGGAAAA GGCCGAGTGC GAACGCACCC ACGACAAGGC CCGCTGGCGG
CGCTTTCGGC GCGAGCAGTG GTGGAATCAT GCACGCTTCG CCCACTGGTA CACCGAGTAC
CGCATGAAGC ATGTGCTGCC CAAGGCGGTG CCCAAGGGCT GGCTGGCGGG ATGGTCCAGG
CCCCTTCCCC GGGACCAGTG GGCCCGACCT TCGCGCGAAC TCACTGAACT CGGCGAGAAG
CTGCGGCAGG CCTACCAGCG CGGCGAGAAG TTCATCGAGA TGGGCAATAT CGAGAAACGC
TTTGGGGTCG AGGTGGAGCC CTCCCCAGGT ACGGCTTATC GCACACTGCC ATTTGCGGCC
AGCGCGGCCT GA
 
Protein sequence
MPRSTIDDTL PPMRAGERES RAGGEQYCLS PLPLPTDSEH GVDVAHIDVR RDEVTMDLRQ 
AASLDSVLIT GHIFSVGLTL ISGLGLLFSL SAVYKGSLPL ELVAGGLGSI YAVILFPLFA
GILYPDVVLR RIPPIRLHRQ RREVAFVVDA PGRRFWLPDP TNMWLMASSG VIASLTGIMV
FVDVFEWMAN PDTVFPLKVL LIHIASLAFL PLYPYFHDFC RKLVGQQRQT VLVPWEDVIA
VCGFNPSLSA GAVTGFGWNF ALMPPDPERP GYTLPGAGII VGVGGLPGAL AQWEYIRRFM
EEGPEAITSS AREWGLEWYD AYVAREKAEC ERTHDKARWR RFRREQWWNH ARFAHWYTEY
RMKHVLPKAV PKGWLAGWSR PLPRDQWARP SRELTELGEK LRQAYQRGEK FIEMGNIEKR
FGVEVEPSPG TAYRTLPFAA SAA