Gene Mlg_0840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0840 
Symbol 
ID4270777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp952764 
End bp955370 
Gene Length2607 bp 
Protein Length868 aa 
Translation table11 
GC content70% 
IMG OID638125592 
Producthypothetical protein 
Protein accessionYP_741684 
Protein GI114320001 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG CCCAGTGCCC CTTGCTCGCC GCCATCGTCC CCGTCCGGTA CGCCATCGGG 
GTCAACGGGG TGTCCAGCCC CTTCCTGGAG GACTTCGACC TGCCACCGTT GCAAGGCCGG
CCGGTCACCG AGAGCCAGGG GTCTCCGGAC GAGGCGGCTC CGCTGCGTTA CGTCGCCCGG
CCCCTTCGCA ATGGCTGGTG TTACGTCTGG TTAGACAGCC AGCAACGGCT GGTCGAATAC
CGGGTGCGCG GCAGCGCGCT GGAGGAGACC GATCGGGCCG GCGCCCCCAT CGGGCCCACC
GCCAGGGTCT GCATCTACGT ACCGGCGGGC GAGACGGCCG CCATCGCCTG GTCCCCGGTG
CGCTGGAGCG ATGAGCAGTT CACCGCCATC GGGCGAGACG ACGGCAGGCG CCGGGCCGTG
ATGCGCGAAT TCGTGCCCGG CCAGGGGCCA GCCAGCAGCG CCGATGCCTG GTCGAACGAG
ATCCCTGAAC TCGGCGAGTT TGACGAAAGG GACTTCCACT GGAGTATCGA GCAGCCCGCC
ACCCTGCCCG TATGGGAGGA CATCAAGCGT GCGGTGGACG ATGCCGAGCA GCACGCCACC
GTGCTGGTGG ACGACCCCTG GGGGGTGGTC ATCGAACTGG CCCACCTGGT GCGCCAGGGC
CAGGCGCAGC GCGCCGACTG GCTCGCCAAC GAAGGCGAGG AACGCATTCT GGCCGAGAAC
ATCCTGGCCC TGGACCGTCA GGGCGAGGGC TTTCGCGGGC GACTCCCGCG GCTGGCCGAC
CGCGACCGCC TCGAGCAGGC CATTCACCAC CATGGGCGGG AAATTAGGGC CATCGAGGAG
AACCTCGATA CCCTGGCCGC CGACTGGGCG CGCTGGATGG GCACCCTCTG GGGGGGCGAA
GGCCCCGAGT CCATGGCCTC CGCCCAGAGC CACTTCGACC CCTCGCTTGA CGAACACCAC
GAGGCGATGG AAACCCTCTG GAGCGCGGCC CTGGGCGGTG TCACCCAATA CGAAACCGGT
GCCACCCTGG CCACCAACCT GCTGGATCCC GATACCGGCC CCCCGGTCAT GCCCGGCGGC
CACTCCCTGT GGACGGCACT GTTGGGGCGG TTAGAACCGG TGCAACTGGC CGATGTCCAG
CGCCTGGTCG GGGTCAGCGA GTCCCTGCAG GCCCAGGACT GGGAAACCTG GGCCCACAGC
CTCAACCACC TGGCCGGACA GCTGGGCCAC GGCCTGGCCA CGGCCCGCGA GGGGCTGTTC
CTCGTGCTCG CCACCACCAT TGGGCCCATT CTCCGGGAAC AGGGCGCCAC CTCCGCCCAC
CGGACCCTGA TCGCCGGCTA TCTCGCCGCC GCGCTGGCGC GCAGCCGGCA GCGCCTGAAA
GTGGAGTCGG TCGCCGCGCG CGCCCTGTTG GACTGGATGA ACGAACCCAG CGCCCGCGCC
GCCGGCGCCC CGTCGCCCCT GTACCAGATG CGCCCCGACC TGCTGCCCGA GCTCGACGCC
CGGCAGGTCA TCACCATCCG CGTGGTCGCG GAGGGCGCTT CTTCCGCCGA GGGCAACCCC
TTCCTGCAGC GCGCCCTGCA GGAGGCCCCG CTGAAAAGCC TGCTGGTGCT GATGAATGGG
TTGGTGGTGG TTCATGCGGG CAGCCAATTG GCCGCCGGTG ATCGTTCCGT ACAGACCGCA
ACAGCGGCGG CTGGGGGCGT ATCCGGCACC ATCAGCGCCA CCGCCGCCAC CATTCAGCAC
TTCGCGCAGA TCCGCTCCGA CGACATCCTT GCCCGCCAGG GCCTGTCCGC CGGCTGGCGG
GGCGCCTTCG ACCGCTACCT CCTCTGGGGT CAGGCCACCA ACCTGACTCT GTCGATCACT
GCGGTCTTCG ACGCCGTCTA CTTCGGCTAC GGCGCCTGGG AGAGCGTCCG CCAGGGGGAT
CGCCGATCGG GCACCATCCA GGCCGGCATG GCCACGGCCG CCGCCGGCCA GACCGCCGCC
GGCGCCCACG CCTTCCACAC CTACCGCCAG GCCCGGCAAG CCCTGCTCGT GGGCCGGTCC
ACCCAGGCCG CGCGCACCGC CGCCCGGGCC CGGATCGCAC CGGCCGTGAT CCTCGCCCTG
ACGCTGGTCA TCGTCGCCGG CGCGGTCAGC CTGCGCTTCA CCCGCGACAA CGAACTGGAA
CACTGGCTGC GCCGTACCCG CTTCGGCACC CACCCGGCCG ACTGGGCCGG CGACCTGGAC
GAGGAACTCG GCCACCTCTA CCGGCTGCTC TACCAGCCAC GCATCCGGCT GGAGGCCCGC
CAGGCCCGTA ACCCCCGCTC CGGCGATGTC TACCACTACC GTGTCCTGCT GGTCACCTTC
CCCGGCGCCA CGCCCTTCCC GGGCATGTTC ACCCTGGAGG CCACCGAGCG CTGGCGCACC
GGGCTGGTCC GGCATGACGA ACGCAGCCGG ACCATCACCG AAAAGGATCT CGACCTCGAC
ATCGGCGGGG CGGAGGCGCC CGACGGCCCC GTCTACCGAC TCATCTACCA CAACACCCGG
GAAGGCGACC GGCTGAGCTC CCTTTCCGGC ACCCTGTACT ATCGCCCGTT CCCCGATCTC
ACCCTGCCTC CCATCCGGAT CAACTAG
 
Protein sequence
MTIAQCPLLA AIVPVRYAIG VNGVSSPFLE DFDLPPLQGR PVTESQGSPD EAAPLRYVAR 
PLRNGWCYVW LDSQQRLVEY RVRGSALEET DRAGAPIGPT ARVCIYVPAG ETAAIAWSPV
RWSDEQFTAI GRDDGRRRAV MREFVPGQGP ASSADAWSNE IPELGEFDER DFHWSIEQPA
TLPVWEDIKR AVDDAEQHAT VLVDDPWGVV IELAHLVRQG QAQRADWLAN EGEERILAEN
ILALDRQGEG FRGRLPRLAD RDRLEQAIHH HGREIRAIEE NLDTLAADWA RWMGTLWGGE
GPESMASAQS HFDPSLDEHH EAMETLWSAA LGGVTQYETG ATLATNLLDP DTGPPVMPGG
HSLWTALLGR LEPVQLADVQ RLVGVSESLQ AQDWETWAHS LNHLAGQLGH GLATAREGLF
LVLATTIGPI LREQGATSAH RTLIAGYLAA ALARSRQRLK VESVAARALL DWMNEPSARA
AGAPSPLYQM RPDLLPELDA RQVITIRVVA EGASSAEGNP FLQRALQEAP LKSLLVLMNG
LVVVHAGSQL AAGDRSVQTA TAAAGGVSGT ISATAATIQH FAQIRSDDIL ARQGLSAGWR
GAFDRYLLWG QATNLTLSIT AVFDAVYFGY GAWESVRQGD RRSGTIQAGM ATAAAGQTAA
GAHAFHTYRQ ARQALLVGRS TQAARTAARA RIAPAVILAL TLVIVAGAVS LRFTRDNELE
HWLRRTRFGT HPADWAGDLD EELGHLYRLL YQPRIRLEAR QARNPRSGDV YHYRVLLVTF
PGATPFPGMF TLEATERWRT GLVRHDERSR TITEKDLDLD IGGAEAPDGP VYRLIYHNTR
EGDRLSSLSG TLYYRPFPDL TLPPIRIN