Gene Mlg_1742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1742 
Symbol 
ID4270849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1991043 
End bp1995389 
Gene Length4347 bp 
Protein Length1448 aa 
Translation table11 
GC content71% 
IMG OID638126500 
Producthypothetical protein 
Protein accessionYP_742578 
Protein GI114320895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.416454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCCT TCGCCCAGCA ACAGCGCGCG CAACCGGCAC CGGCCCGGCG CCCGCAGCCG 
GTGGCGAAGC CCCATAGCCA GCGCGCGGCC ATCGCCGCGG TGATCGGCCA GGCGGGCCTC
CAGCCCCGGC TCAAGCTCGG TGCCCGCGAC GACCCGCTGG AGCGCGAGGC CGAGCGCACC
GCCGAGCGGG TGACCCGCGG CCCGGCGCCG GCATCGGACG ACCCGACGGC CCCCGGCGCC
GGGTCGCCCG ATGCCGCCCG CCGGGCGCCT CCCGCCACCG CTCCGGTCGC GAGCGGGGCG
CACGGTGCCG CGCCGGCGGA CGCCCCCGTC CCCCTGCCGG AGGAGCGGGA GGAGGAAGCC
CCCGGCGCGG CCGATGCCCT GCCGGAACCA CTGGCCCGGC GCATCGGACA CTTGCAGCAG
GGGGGGCGCC CGCTGCCGGA GGACCTGCGC GCCTTCATGG AGCCGCGCTT CGGGCAGGAT
TTCTCCGCGG TGCGCATACA CACCGACGAC GAGGCCGCCC GCCTCTCCGA GGCCATCCAC
GCCCACGCCT TCACCCTGGG CGAGCACATC GCCTTCAACC GCGGCGCCTT CCGGCCCGAC
AGCCGTGACG GCCGGCGGCT CATCGCCCAC GAACTGACCC ACGTGGTGCA GCAGCGCAGT
GCGGGCGTGG ACCATCAGGA TGCCACTGCC CCGCCGGTGC GGCGGTCCTG GATTGACAGC
TTGCGCCGAG GTGGTCGCCG GTTGGTGGGG GCCGGGATCG ACATGGCCAA GGGGGCGTAC
GACCGCCTCA CCGGGGCCGT GGGCGATATC TTCGACAAGG CCGGCAACTT CATTGCCTCC
AAGGGCATGG CGGTGGTGGA GAAGCTCGCC CCCAACCTGG CCCCCATACT CCAGGAGATC
ATCGACAAGG GCCCGGTGGG GTGGCTCAAG GACCGGGTGG CCGCCGTATT CGACCGGCTG
GTGGAGGCGG TCACCCGGCT CACGCCCGAG GGGTCGGTGG AGCGTCTGAG CGAGCTCTTC
TCCGGCATGC TCGAGCGGGC GGGCGCCATC GTGGAGGCGC TGGCCTCGGG GGACTGCGGG
CCGCTGTTCG ATGCCATCCG GCGGCTGCGC ACCCTGGTGA CCGATGCGGC CGGGGCGGCC
TGGGACCGGC TCACCGACTT TCTCGCCCCC GTCGGTAACT TTTTCGCCGA TCTCTGGGAG
AGCACCGGTG CCCGCGCCCT GGACTTCATC ACCGGGGTGG CCGGTGACCT CTGGGACGGC
ATCAAGGCCC TCGGCCGTCG GATCTGGAAC TGGACGCGGC CCATTCGCGA GACCCTGGGT
GCGGCCTGGG ACTGGGTCAA GGGCAAGCTA TTCGGATCCA GCAACGACAG CAGCGGAAAC
GATCAGGGCG GCCTGGTCGG CTGGGTGCAG GACAAGGCCG GCGAGGCGTG GGACTGGATC
AAGGCACGCA CCCGGCCGCT CTGGCAGCCC ATACAGCAGG GCGTGGAGTG GCTGCGGGAG
CTGGTGCCGC CCGCTTTCGT CAAGCGCCTG GGCGAGGACA TGCAGGCGCT CAGCAACGAC
CTGGGCACGG CCGAGCAGCA GATGGCGGGG GCCGGCGAGG AGGGCGGGCC GGGCGCTGGC
GTGGCGGAGA ACCGGGCGGC GCTGGCCAAC GCGTTGCCCA CCGTGGAAGC GGTGCTAGCG
CGGGTGCGGG GCCTCCTGGT CGACTCCGGG CGTTGGCTGG TGGAGCGGCT TGGCGGGGTG
GGGGATAAGG TGGCGGCCTT CTTCTCCGGT CTGCGCCAGT TGGACATCAC CCGGCCCCTG
GCGCGGGCGC TGGGCTGGCT GGAGCGGGGC ATCGCCGGGC TCAGCCGCTG GGCGGAGAGC
GGGGTAAAGC AGCTGTTCGA CGGCCTGGTC GCCGGCTTTG ACCGGCTGAC GCCCTTCATC
GAGCGGATGA TCGGCGTGGT GCGCCGGTTG CTCTCGGTGG TGACCGACCT GATGCAGTTG
CCGCAACTGG TGTTGTCCGC GGCCTGGCGG CTGATCCCGG AGTGCATCCG CAAGCCGGTG
CAGGACTTCA TCGTCAACCA GATACTGGCG CGCATCCCGG TCTTCGGCCA GCTCCTGGCC
CTCCCGGACC TCTGGGAGAA GGTCAAGGCC AAGGCCATGG AGATACTGCG CAAGCTGTTC
GTGGACGGAG ATCTCGCCGC GGCCGCCTGG GCCTTCTTCC AGGCCGCACT GCGGCTGATC
GGGCTGCCGC CCGAGCTGGT GGTCAGCCTC CTGGCCAACG CCGCGGCGGC CATCGGGCAT
ATCCTGCAGG ACCCCATCGG CTTCCTGCTC AACCTGGTGC GGGCGGCCGG GCGCGGCTTC
TCCCAGTTCT TCAGCAACAT CGTCGGCCAC CTCAAGGCGG GGATAGCCGG CTGGCTGTTC
GGCACCCTGC GCAAGGGCGG GATCGAGCCG CCGGAGGACT TCTCCCTGCG CTCGATCCTC
GGGGTGGTGC TGCAGATCCT GGATATCACC ACCGACCGGA TCTTCGACCG CATCGGCCGC
CAGGTGGGCT CCGGCGTGGC GCTTCGCATG CGGCGGATGC TGGAACACGC CTCCGGCGCC
TGGCAGTTTC TGCGCACCCT GGTCGAGGAG GGGCCGGGCG CGCTGTGGGC GGCGTTGCGG
GAGCGCTTGA GCGATCTGGG CGGGCAGGTG CTGGACAGCA TCATCAACTG GGTGAGCGTC
ACCATCATCA AGCAGGCCAG TATCCGGCTC ACCCCGCTGC TCGCCCCCAC CGGGGTCTCC
AACGTCATCG CCCTGGTGAT GGAGCTCTAC CGGGTGATCA CGGCGCTGAC CGCCCAGTTG
CGGGCGTTGC TGGAGGTGGC GAACCGCTTC CTGGCCGGGG TGGCCGAGAT CGCCTCCGGG
GCGTTGGGCA AGGCCGCCGA CTACCTGGAG GATGCCCTGG GCCGGGCGGT GCCTCCGGCG
CTGGCCTTCC TGGCCCACTA CGCCGGGTTG GGGGATGTCA GCAGCCGCAT CCGCGAGATG
GTGGAGGGGA TCCGCGAGCG GGTCGACGCC GCCCTCGACT GGCTCATCGA GCGCGCCCTG
CGCCTGGGCG AGGGCTTCGT GGAGCTGGCC CGCCGCGGGG GGCGGGCGGT GCGGCGCGGG
GCCCGCGCCC TGGTCAACTG GTGGCAGGCC GAGCGCGAGG TGGAGACCGA GGGCGGGGAA
CGGCATACCC TGTCCATCGA CGCGGAGAAG GAGCGCCAGG CGCTGACCAT CGAGTCGCGG
CCCACCCCCT ACGCCGAATT CATCGCCGCC CTGAAACTGC CGGACGACGC CGCCGACACC
CGCCGACGGG CGTTGGCGGC CGCGGAAGCG GTGGAGGGGC TTATCGGGGA GTCCCGCGAC
CTGGAGGGCC AGTTGGAGGC CTCGGCCATC GACGAGCCGA CCTACACGGA GAAGCGCAAG
GCACTCAAGA ACAGGATGGA TGAGGCCATG GACCAGCTCT CGACCCTGAC CCGGGAGCTG
CTGGAGCAGT CCACCGGGGG TTCGGTCGCC GACCTGCCTT CCACGCCGGC CATCTACGGC
CCCAGAACCG CGGCGGGTTT CGGCAGCTCG GTGCGGGTGG AGTTGCTGAC CCGCGACCAT
CCGACCGGCA GCACGCCGCG TAATGCGCCC GAGAACGAGA GCTGGAACCT GTTGCGGCGG
CGCATGGACG GCGGGGGTAC CTACTATGTC CGTGGGCACC TGTTGAATGA GCATCTGGGC
GGCCCGGGGG ATACCTGGGA CAACCTGACG CCGCTCACGC AGGGTGCCAA CAACCGGGAT
TCACAATCGA TGCTTCACCG GTTTGAAGAC CCGGTCAAGG ACGCGGTGGA AGGGGGACAG
GCCGTTAACT ACATCGTCAC CGCCAACTAT GGCGTATCAC ACCCGCTGGT GGCGGAGGCC
GAAGCCCACC GGACGGAGGA GGGCGATACC GACGCCGACG TGATCGCGGA TATCATCCAG
GCGGAGCAGC GGATCCCCCG GACCCTCGAC TGCAGCTCGG AGAAGATAAC GCCGGACGGC
AAGGCCGCCG GCACGGTGGC GAGCCATCAG GTGGATAACC GTTTCAAGGC GAACGCCCTG
GATGATTACA GCATCCGCGC CCGGCCCAAG ACCCGGTTCT ATATCGACGA CGAGGCCCTT
GCGGCCAAGC GTGCGGACAA TGTCGGCCGG TTGGCCGAGC TGGACGGGGT GGATCACGAC
CTGGCACGGG CGATTGTCGA CAATCGGCCG GATGGCGGTT ACCGGCGGAG CGCTACGCTG
AAGAAGGAGG CCAGGATGAC CGACGCCCAG TGGGAGGCGG CCCGCAAGAC CGATGCCTTT
CATGTCCATT TCTTTCGACG CAGCTAG
 
Protein sequence
MVAFAQQQRA QPAPARRPQP VAKPHSQRAA IAAVIGQAGL QPRLKLGARD DPLEREAERT 
AERVTRGPAP ASDDPTAPGA GSPDAARRAP PATAPVASGA HGAAPADAPV PLPEEREEEA
PGAADALPEP LARRIGHLQQ GGRPLPEDLR AFMEPRFGQD FSAVRIHTDD EAARLSEAIH
AHAFTLGEHI AFNRGAFRPD SRDGRRLIAH ELTHVVQQRS AGVDHQDATA PPVRRSWIDS
LRRGGRRLVG AGIDMAKGAY DRLTGAVGDI FDKAGNFIAS KGMAVVEKLA PNLAPILQEI
IDKGPVGWLK DRVAAVFDRL VEAVTRLTPE GSVERLSELF SGMLERAGAI VEALASGDCG
PLFDAIRRLR TLVTDAAGAA WDRLTDFLAP VGNFFADLWE STGARALDFI TGVAGDLWDG
IKALGRRIWN WTRPIRETLG AAWDWVKGKL FGSSNDSSGN DQGGLVGWVQ DKAGEAWDWI
KARTRPLWQP IQQGVEWLRE LVPPAFVKRL GEDMQALSND LGTAEQQMAG AGEEGGPGAG
VAENRAALAN ALPTVEAVLA RVRGLLVDSG RWLVERLGGV GDKVAAFFSG LRQLDITRPL
ARALGWLERG IAGLSRWAES GVKQLFDGLV AGFDRLTPFI ERMIGVVRRL LSVVTDLMQL
PQLVLSAAWR LIPECIRKPV QDFIVNQILA RIPVFGQLLA LPDLWEKVKA KAMEILRKLF
VDGDLAAAAW AFFQAALRLI GLPPELVVSL LANAAAAIGH ILQDPIGFLL NLVRAAGRGF
SQFFSNIVGH LKAGIAGWLF GTLRKGGIEP PEDFSLRSIL GVVLQILDIT TDRIFDRIGR
QVGSGVALRM RRMLEHASGA WQFLRTLVEE GPGALWAALR ERLSDLGGQV LDSIINWVSV
TIIKQASIRL TPLLAPTGVS NVIALVMELY RVITALTAQL RALLEVANRF LAGVAEIASG
ALGKAADYLE DALGRAVPPA LAFLAHYAGL GDVSSRIREM VEGIRERVDA ALDWLIERAL
RLGEGFVELA RRGGRAVRRG ARALVNWWQA EREVETEGGE RHTLSIDAEK ERQALTIESR
PTPYAEFIAA LKLPDDAADT RRRALAAAEA VEGLIGESRD LEGQLEASAI DEPTYTEKRK
ALKNRMDEAM DQLSTLTREL LEQSTGGSVA DLPSTPAIYG PRTAAGFGSS VRVELLTRDH
PTGSTPRNAP ENESWNLLRR RMDGGGTYYV RGHLLNEHLG GPGDTWDNLT PLTQGANNRD
SQSMLHRFED PVKDAVEGGQ AVNYIVTANY GVSHPLVAEA EAHRTEEGDT DADVIADIIQ
AEQRIPRTLD CSSEKITPDG KAAGTVASHQ VDNRFKANAL DDYSIRARPK TRFYIDDEAL
AAKRADNVGR LAELDGVDHD LARAIVDNRP DGGYRRSATL KKEARMTDAQ WEAARKTDAF
HVHFFRRS