Gene Mlg_0500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0500 
Symbol 
ID4268436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp548323 
End bp549564 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content52% 
IMG OID638125241 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_741344 
Protein GI114319661 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.565018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC TCAGTTGGCC AGATGTCAGT CTCGGGAATA TATTCACCAT CAACACTAGT 
GCGGTGATCC CGAATGCGGC CCCCAATACC GAGTTCTATC ACCATAGCCT GCCCGCTTGG
GACGCCACAG GGGGACCTAC GGTGGAAAAA GGCTCTTCGA TCGAAAGCAA TAAAGTCAAT
ATCACCAAGC CTTGCGTGCT GGTTTCAAAG TTGAACCCAA GGAAACCTCG GGTATCTGTG
CTTGAATCGG TGGGAAAAGA CGAGCGTCAT TGCGCCTCAA CAGAATTCGT TTGTCTTGAA
CCAAAAGCCA AAGAGCACCT CAGGTTCTGG GGCCATCTTT TTTCGAACAA GAGGTTTGCA
GGCCACCTTG ATCGCATGGC TATTGGTTCG ACCAATAGCC ACAAACGGTT TAGCCCCGGG
GTGCTCTTGT CCTTAAGGAT TGAGCTGCCC TCAGAGCCGG AAAGGAGGCT GATTGCCCGA
ATCCTCGACA CCCTCGACAC CCAGATCCAG AAAACCGAGG CGCTTATTGC CAAGCTGGAG
AAGGTCAAGG AAGGCCTGCT CCACGACCTG CTGACCCGCG GCATCGACGA CAACGGCCAG
CTACGCCCCA GCCCCGAGCA GGCACCGGAG CTCTACAAGG AATCGCCGCT GGGGTTGATT
CCGAGGGAGT GGAACGCAGT CAGGCTTTAT GAAATGGCGG AAAATCATGA TGGACAGCGA
ATTCCGCTCA AAAAGTCGGA GCGCAAACAT GGCACATATC CATACTATGG AGCCTCCGGG
ATAATTGATT GGGTTGAGGG ATATCTATTT GAAGGAAGCT ATGTTCTTCT TGGGGAGGAT
GGTGAGAACG TTGTATCTAG GAACTTGCCG TTGGCATTCC CTGTTACTGG AAGGTTCTGG
GTGAATAATC ATGCGCACAT ATACTCTCCA AAAGATGACT GCGACACCCG GTTCTTGGTT
GAGGTGCTTG AGCAAAAGGA TTATTCGCGC TGGGTAAATG GTTCGGCCCA GCCGAAAATA
ACGCAAGCAT CATTGAGAAT GATGTGGTTC TGCAAGCCAC CAACGGCTGA GCAGAAGGCT
ATCTCAAATA GCCTTGAGGC AATCAATCAG CAGATTGACG AAGAGAAGAT CAAGATTGCC
AAAGTGAGAA CGCAAAAAGC AGGGGTCATG GACGACCTGC TAACCGGCCG CGTCCGCGTC
ACCCCCCTTC TCGACAAGGC CCAGGCCACG ACGCCAGCAT AA
 
Protein sequence
MSDLSWPDVS LGNIFTINTS AVIPNAAPNT EFYHHSLPAW DATGGPTVEK GSSIESNKVN 
ITKPCVLVSK LNPRKPRVSV LESVGKDERH CASTEFVCLE PKAKEHLRFW GHLFSNKRFA
GHLDRMAIGS TNSHKRFSPG VLLSLRIELP SEPERRLIAR ILDTLDTQIQ KTEALIAKLE
KVKEGLLHDL LTRGIDDNGQ LRPSPEQAPE LYKESPLGLI PREWNAVRLY EMAENHDGQR
IPLKKSERKH GTYPYYGASG IIDWVEGYLF EGSYVLLGED GENVVSRNLP LAFPVTGRFW
VNNHAHIYSP KDDCDTRFLV EVLEQKDYSR WVNGSAQPKI TQASLRMMWF CKPPTAEQKA
ISNSLEAINQ QIDEEKIKIA KVRTQKAGVM DDLLTGRVRV TPLLDKAQAT TPA