Gene Mlg_0639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0639 
Symbol 
ID4270828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp689254 
End bp691800 
Gene Length2547 bp 
Protein Length848 aa 
Translation table11 
GC content62% 
IMG OID638125387 
Producthypothetical protein 
Protein accessionYP_741483 
Protein GI114319800 
COG category[S] Function unknown 
COG ID[COG4983] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0775939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCAC AAAAACGAAA CCCCGAGACC CTAGAGCTTG GCGGCGGCGG GATCACGGGG 
CGGGCGATGA GCAAAGCTGA TTGTTCTGCC GGTGATTATA CCACTACCAT TCCCTCCAAC
GCCATGCCGT ATGAGCGGCA TATTCCCGAT GCCTGGGAGT TGCCCCTTGA TCTGGTGGAT
ATGCCGATCT GGTGCGCTTG GAAGCTGGTG CAAAAGCCTG GAAAGCCGAA GCCGGACAAA
GTGCCGGTAA GCGCCGCTGA TGGCTTGCAG GGTTCCAAGG CATGGGGCGG GAAGAAAAAC
CCAAAGCCCG AGTTCTGCAC CACCGCAGAG AAGGCAATCT ATTACGCGAA TCAGGCCAAA
GATATTACTG GCGTCGGCAT TATCCTGATG CCGGGTTTCG GGCTGATCGG TGGCGACCTG
GACGGTTGCT GTAATTCAGG TACGGGTGCG CCTACGGAAC AGGCCCGCCG AATCATCGAA
GCCGCCGACA CTTACACCGA AGTTTCCCCC GGCCTGGAGG GATACCGCTT TATTGCCCGT
GGTACGTTCG GCGGCCATAC CGGCAACAAT AGGGCGGAGG GCGTGGAGTT CTACGAGGAT
GGCCGTTTCC TCACGATCAC CGGCTATCAC GTCGAAGGCA CGCCGCACGC TATCGAGGAA
CGCGACCTGT CCGAGTTGGG GGCGGAGTAC TTCGACAAGG CTAGTTCCGA CACCAAGGCC
GGAGAGGCTG AACCCGAAAC GGGCGGAGGC CGTGGCCTGG AGGCGTTCGA GCTGCCGCCC
CATGTGCGCG GCTGGATTAC CGATGGCGTA GAGCAGGGCA CGCGGTCGGA TCGCATATTC
CAGTGCGCGA AAGACCTTGT ACGTGCTGGT GCCACCGAAG GGGAGGCCGT GGCCATCCTG
GCGAACCCTG AGCACGGCAT CAGCGACAAG CCGCTTCAGG AGCGCGGCGG AGACATTCAA
GGCGCTAGGC AGTGGGTGCA GCGTTACGCC GTCGCGCCTG CCCGCCGTGA TGTTGAAGCC
GAGCCGGACG AGTTCGACGA CGAGACGGGA AGCACTGAGA CCGCCGGATG GCCGGAGCCG
GTGGACCTGT TTGCATCTCG CCCGGTGCCG CCTTTCCCTG CCCAAGTTAT GCCGGAAGCC
TGGCAGCGCC ACGCGGCTGG CCTCAGTGCG CAAACCGGGT TTGACCCTGG TGGCTACCTG
TTCGTGATGC TGGCGCACGC GGGCTGCCTG ATGGACCACC GAACGCGGGT GGCCGTTAAT
TCGAGCTGGC GAGAGCCGCC GCTCCATTGG GCTGCCCTTG TGGATTCCAG CGGCGGAGGT
AAAACGCCAA TCCTGGGCGC AGCAGGGAAA CCTGCGATGA GCATTTACAG CGAGATCAAC
AAGCGGAGTG CCCGCGAGTT GATGGAATGG CACAAAGCGG CGGACAGCGC CAAGAAGGGC
GAGGAACCGC CCCGCCCGAT CTGGAAGCAG CGCAATACCG ACGATGGGAC CATTGAAGGG
ATGCGCGATG CCCTGGAGGG CAACCCGGAA GGCGTAACCC TCGCCATTGA TGAGCTGACC
GCGTGGCTCG GCTCTATGGA TGCCTTCACG AGCGCGAGAG GCGCAGCCAG CAAAGACAGA
CCGGCTTGGT TGGAAGCGTG GAGCGGCAAG GAGGATCGCG TTATTAACCG AGCCGGGAAG
GTGAAGGTTA TTCCCCATTG GGCTGCTGGG GTTATCGGCG CTATTCAACC CGAAGTGTTG
GCCCAGCAGT TCAAACGCGG GCATGGCAGC AGTGACGGCA TGATGCAGCG TTTCATGCTC
TACCAACTGC GCCCCGCCGC CGATGGCGAC CTGCTGACCG AGCCGGATAT GCTGGCGGAT
GCCAGTGCTC ACAACGTGTT CCAAGCCGTT GCCGATCTTG CCGAGGAAGG GCCGCAGCAC
TACCAGCTAG ACCGCGAGGC GGTGGCGCTG CTACAGGACT ACATGCAAGC CATTCGGGTG
CTTGCGGCCC GCACGCCGGG AGCACGCTTT GCGGAGCACT TGGGCAAGTT TCCCGCCTTC
GCGATCCGCG TTGCCTTGAC GCTCCACGTT GTCCACGCCG TGGCCGCCAA AGAGCGCCCC
TACACCGTTA TAAGCGCGGA GACGATGGAA CGGGCGCTAA CCATTATGCG CGTGCTCTAT
CGCCATTCTG AGGCCGTTTA TACGGTACTG GATGAGGCCA GCGAGGGAGC CCGCCGCTTG
ACGGTATCTG CTGCCGAGGC GGTGCTGGCC AAGGGTTGGA TGGTGCTAAC CCGTGGCGAC
TTGACCAGGG ATGCCACCGG ATGGCGAGGG GCTAACAGTG GCGATGCCGA AGCCGCTACC
GATCTGCTGA TTGAGTTCGG ATGGTTCGCG GATATTACCG ACCAAGCGCA GAGAGGCAAG
CGGGGGCGGC GCAGTGATGG CCGGTTCGCT GTTAATCCGC GAGTGCATGA GGTGTTTATG
CAGCACAGCC AGCGGATCAA AAAGGAGCGG GCCGAGCGGT TCCAAGCGAT CCACACGGCG
GCCACCGAGA GGCAGGGAAT TAGTTGA
 
Protein sequence
MEAQKRNPET LELGGGGITG RAMSKADCSA GDYTTTIPSN AMPYERHIPD AWELPLDLVD 
MPIWCAWKLV QKPGKPKPDK VPVSAADGLQ GSKAWGGKKN PKPEFCTTAE KAIYYANQAK
DITGVGIILM PGFGLIGGDL DGCCNSGTGA PTEQARRIIE AADTYTEVSP GLEGYRFIAR
GTFGGHTGNN RAEGVEFYED GRFLTITGYH VEGTPHAIEE RDLSELGAEY FDKASSDTKA
GEAEPETGGG RGLEAFELPP HVRGWITDGV EQGTRSDRIF QCAKDLVRAG ATEGEAVAIL
ANPEHGISDK PLQERGGDIQ GARQWVQRYA VAPARRDVEA EPDEFDDETG STETAGWPEP
VDLFASRPVP PFPAQVMPEA WQRHAAGLSA QTGFDPGGYL FVMLAHAGCL MDHRTRVAVN
SSWREPPLHW AALVDSSGGG KTPILGAAGK PAMSIYSEIN KRSARELMEW HKAADSAKKG
EEPPRPIWKQ RNTDDGTIEG MRDALEGNPE GVTLAIDELT AWLGSMDAFT SARGAASKDR
PAWLEAWSGK EDRVINRAGK VKVIPHWAAG VIGAIQPEVL AQQFKRGHGS SDGMMQRFML
YQLRPAADGD LLTEPDMLAD ASAHNVFQAV ADLAEEGPQH YQLDREAVAL LQDYMQAIRV
LAARTPGARF AEHLGKFPAF AIRVALTLHV VHAVAAKERP YTVISAETME RALTIMRVLY
RHSEAVYTVL DEASEGARRL TVSAAEAVLA KGWMVLTRGD LTRDATGWRG ANSGDAEAAT
DLLIEFGWFA DITDQAQRGK RGRRSDGRFA VNPRVHEVFM QHSQRIKKER AERFQAIHTA
ATERQGIS