Gene Mlg_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0842 
Symbol 
ID4270779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp956389 
End bp958446 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content67% 
IMG OID638125594 
Producthypothetical protein 
Protein accessionYP_741686 
Protein GI114320003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.456068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTGA ACCTGGCGCC TCACATTGCC AATGCAGGCG ACGCCCATAT GCCTGAGTGG 
CCGGGCGAAG CCGGTGACGC CGAGCGTAAC GACCTGGAGT CGCCCGCCGA GCTCTATTCG
CAGCATCGCG AAGCCATAGA GCAGGTCACT GCCCGTTCAG GCGGCCACAC GGGCTTCATT
GGCGGGGCGC TGGCGCCCAA CGGCATCGTC GACGTGATCC GCCGGCAGGC GATGGAAGAC
TTCCTGGCCT ATGCCACCCC GATGCAGGCG CAGTGGCAGA GCGAGTATCA CCGTATCGGG
GCCGACCTGG CCGCCATACT GCCCACCTGG CACACCCAGG CCCTGCTGCT CGACCGCGAA
GAGGAGCATC ACATCCTGCT GACCTGCCTG CTGGAAAAGC AGGCCGTCGA GACCCTGCTG
GCCTGCGGGC AGGAGGACTT CCTGTCGAGC TACTACGCCG GCGACGACCC GGTACCGGCA
CACCTGATGC ATTACGTCCC CACCGCGTCC TTCGTGGAGG GATTCCTTTC CCAAAACACT
GGCCTCCAGA AAGCGCTCAC CCAGGCGTCC GCACTGATGG GCGCCCAGGG CGCACTCAGC
CGCTACCAGC AGTGGCGGGG CGAGGTTGAA CATCAGACCG GGCTGCGCTT TCGCAGCGTC
GAGGGCCTCT CCGACGAAGC CCGAACCGCC ATCGCGGGCG AAGTGCAAAT CAAGGAGAAG
CTGCTGGGGC AGGCGGTGCT CGGGACGTTG CTCGATGACG TACAGGATGT CGACCTGGGG
CAGCGCATCA CTACCCTGGC TTCGCGCTTG CCCGATGGGC AACGGCTGAT GTTCGCGGAA
CGGCTGGGCC TGCTGGAGCT GGGCTGGGAC ATTCCCGATC AATCGGTACT GGGCCGCATC
CAGCAGGCGC TGGACGACAC CGATACGGCC ATGACCCGGC TGACCGCGCT GGAGCGGGAG
CTCGAGCAAC TCCGGCGGAA ACGCCAGCAG GAGATGGCGC GCGCGTCAAG GCGCGGTACC
CGCCAGGCCC ACCGCCGGGC GGCCGACCAA TTCAACGCGC GCAAGATCCG GGAGGCCCAG
GCAGACATCA ACCGGCACAA GCGCCTGCTG GGCGAGGCAT TCGACGCCCT GGCCGAGACC
CGTGCGGCCG CCGGCAGCCC GGCGCAACTG GAGCGCTGGG CCAGGGTAGC CAATGGGGCG
ATGGGGGCCG TGGCCCTATT GGGTGGCCTT TCAGGTGCTC TCGAGGTTTT CAAGCAATCT
CGGCGTATCG ACCGCGCAGA CACCGACGCC GAGCGACTGG CCTCACAGGT TGCCTTTACG
GGCGCGGCCG GAGTCGCCCT GGGTGGCCTA TCAATCGGCA TCATGAGTAT GGTCGGTCGA
ACCCTGGGCA AGCCGGCCGT CGCCTGGCGA CTGCTGCTGC TCAAATTCGC CGGCCCCGCC
GGCTGGGTGG TGGCGGTTGG CACCGCCCTG CTGATCATCG GCGAGGTGCT GGCCAATCGC
TTCTCGTTGA GCCCTGTGCA GCGCTGGTGC CAGCGCAGTC ACTGGGGGCG AGAAGATCAG
GGCTGGGATC GCGAGGCCCA CGAGCGGGAA CTGGCCCGAC TTGGCGATAC CGATCTCACG
GTGGAACGGC AGGGGCAGGC CGAGCCCCAT GGCGGCCCGG GGCCCGGGCC GGCAGGCACC
GACCTCGCCA TACGCATTGG CTTGCCCGGG CTTGACGCCC CCAATGCGGA AAACCTCGCG
CTGGGCCTCT GGGGCGTCAC CCCTCGCCTC AAGGAAATGA CCCGAGACTT TCTCGAACAT
GCCGAGCTCG AAACCCGGGG CTCGAGCTAT GCCCTGCACT ACCATTTCGA TCCCGAAACA
TTGGCCGAAT GCCACGAGTT CCGCCTCGTC ATCCGCACGA AGGGCCCCGA AGCATCCACC
ACCCGGGTCT TCCAGTTGCA TCGCCGCGGC ACATCGCTCT CCGATGAGTG GAGGGAGATC
TCCGCCCTCG GCGATCGTTT CCTCACGCGG TACCAAGTGG GCAACTGGCC GGACATGCCC
CTGACGCCCT GGCCGTGA
 
Protein sequence
MSLNLAPHIA NAGDAHMPEW PGEAGDAERN DLESPAELYS QHREAIEQVT ARSGGHTGFI 
GGALAPNGIV DVIRRQAMED FLAYATPMQA QWQSEYHRIG ADLAAILPTW HTQALLLDRE
EEHHILLTCL LEKQAVETLL ACGQEDFLSS YYAGDDPVPA HLMHYVPTAS FVEGFLSQNT
GLQKALTQAS ALMGAQGALS RYQQWRGEVE HQTGLRFRSV EGLSDEARTA IAGEVQIKEK
LLGQAVLGTL LDDVQDVDLG QRITTLASRL PDGQRLMFAE RLGLLELGWD IPDQSVLGRI
QQALDDTDTA MTRLTALERE LEQLRRKRQQ EMARASRRGT RQAHRRAADQ FNARKIREAQ
ADINRHKRLL GEAFDALAET RAAAGSPAQL ERWARVANGA MGAVALLGGL SGALEVFKQS
RRIDRADTDA ERLASQVAFT GAAGVALGGL SIGIMSMVGR TLGKPAVAWR LLLLKFAGPA
GWVVAVGTAL LIIGEVLANR FSLSPVQRWC QRSHWGREDQ GWDREAHERE LARLGDTDLT
VERQGQAEPH GGPGPGPAGT DLAIRIGLPG LDAPNAENLA LGLWGVTPRL KEMTRDFLEH
AELETRGSSY ALHYHFDPET LAECHEFRLV IRTKGPEAST TRVFQLHRRG TSLSDEWREI
SALGDRFLTR YQVGNWPDMP LTPWP