Gene Mlg_1743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1743 
Symbol 
ID4270850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1995414 
End bp1998317 
Gene Length2904 bp 
Protein Length967 aa 
Translation table11 
GC content71% 
IMG OID638126501 
Producthypothetical protein 
Protein accessionYP_742579 
Protein GI114320896 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0189343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.985267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGC AGACCTCGCA ACCGTCCCAC CGCGACCACG GCCAGTACAG CGGCGTCTAC 
CTGCAGCAGG GGCGGATGAT CACCGACGCC GACTGGAACG CCCTCGGCGA GATCGGCCAG
CGCCGGCTGG TGGGCGCCCT GTGGGATGCC ATCGCCAGCG GCGCTCCGCG CGAGGGCGGG
CTGCGGTTGT CGGACGACAG CGGTCTGCGC CTGCACCCCG GCGTGCTTTA CGTGGGCGGC
GTCCCCGCCC GGCTGACCGG CGACGGGCCG CTGGCGCCGG GGGAGCAGCC CGATTACCCG
GATCCGCCCC CCTTCGACGG GCGGGACCTG ACGCTCTACG CCGACGTCTG GGAGCGTCCG
GTCACGGCCC TGGAGGACCC GGCCCTGATG GATCCGGCGC TGCACGGCGC CGATACCAGC
AGCCGGGGCG AGACCCTGCT GCAGGTGAAG TGGTGCCCGC GCGGGCTGGA CCCCGCCGAC
CCGGCGGTCA ACCCGCCGCT GGGCGATGCC CCGCTGGCGC TGCGGCTGCG TCACATCGTC
GTCGGCGATG ACCCCTGCGA CCCCTGCGCC AGCGAGATGA ACCTGGACGA GCGCATAGGC
AACTACCTGT TCCGGGTGGA GGTGCACGAC CTTTTCCTGG ACGAGGCGGG TGAGCGGCAG
CTGGTGCTGA AGTGGTCCCG CGACAACGGT GCCGAGGCCC ACCAGGCCGA TAACGTCCCG
GAGGGCTTCG ACCGCGGCGA CTGGGTCTGG GAGTTCTTCG ACGAGGCCAG CGAGCGGACC
CTGGGCCGGC ATTTCCCGGA GGACTACCGG CCCCGGCGCG GGCGCCTCAC CCCGGCGTTC
GAGCGCCCCC CGGAGGGGGA GCCGCGGGCG TTCGTGCGCC AGTGGGACGG CTTTCTTCAG
CTCAACCTGG ACCGGCCCGC CCTGGTCAGC GGCGTGGACC GGGGCGCCGA GCTGGACCCG
GAACTCGACC CGGACGCCCA CGGCTGGGCG GGGATCGACG CCGGTGTGCT GGCCGTCAAC
GGCGAACGCC TGGAACTCCG GCTCGCCTTC GCCGACCGCC AGTTCCTGCC CGGTGACTGC
TGGCAGGCGG CGGTGCGCGA GGCGGTGCAG GGGCCCGGCG ACTACGTGCT CGGCGACGAG
CATACCGGCG AACCGCCCCG CGGGGTGCGC CACCGCTACC TGCCTTTGGG CGAGCTGGAC
GGCGACGGCG CGCTGGTGCC GCACGGCGAC GCGGAGCGCC GCCGTTTCAA CTTCCCGCCG
CTCACCGACC TGGCCGCCGC CGACGTGGGC TTCAGCGAGC GCTGCGAAGC CCTCTATCGG
GGCGCGGAGA ACGTCCAGCA GGCGCTGGAC GCGCTCTGCG ACATCGCCGC CGAGCACATC
GCTTACCGGT TGCCCGATTG CGAGGGGGAG GAGGTGACGG TCAAGCGGCT GTTGGCCGAG
GCCCTGGCTG AACGTTGGCC GGATATCGAC AGGGACGGCC GGCTGAGCGT GCGGGACATG
CTCGACGGCT TGCTCTGTCA CCTGGACAGC GCGGCGCTGC CCCACGACGT GCCCGAGTGC
CGCGATGGCC GCCGCAGCCT GCGCGAGCGG CTGCGGATTC CCGCAGGGCG GACCACCACC
GCGGAACCGC TCAACCGGTT GCTTTGCGAT ACCACGGCCG ATCACCTCCC CCTGGGCCGC
GAGACCGAAC TCTGTCCGGA CCTGGACCGG GAAGGCGTGG AGACGGTGCA GGACGCCCTC
AACACCCTCT GCGGGCGCAG CGGTGGTGGC GGCTGCGCGC AGGTGGTGGA GCCGGGGGAG
CTGGCTCGGA CGCTGAAGGT GCTGATCGAA GAGCGGCGGG AGAGCATATG GCTGTGTCTT
AAACCCGGCG TGCACGAAAT CCCCGCGGAC CTGGCGCTGC ATACGGCCCG GCATATCCGG
ATCAGTGGTG GCGGGTATCA CGCCTGTGGC ATCCGCCTGG AGGGTGGAGA GTGGGCGTTG
CAGGCACCGC AGATCCAGCT ATTCGATCTC GGTATCGAAC TGCCCGACGA CGGCGAGAGC
CGGGTCCACC TGAGCGGGGA CGACATTACC CTCCAGCGGG TACACGTGCT CGCCCGGGAG
GCCGAGGGGG GCGGCGGCCA GCGCCGGCCA CTGCTCCATA TCGATGGGCA GCTTAAGGAA
GGTATTGGGC TTATCCGCCT GGAGGACTGC CGGCTCACCC CGGTGGATCG GGGCTGGGCG
CTGTTGCTGG AGCGGGTCTA CCAGGTCGCC TACATCCAGA ATAACCATAT CGACGGTCTC
GTGCGCTACC GGCATGGGGT GGGCAAGCCG GTGGACCCGG CGAGCCAGCG CATCGACTAT
GTGGACCTGC GCAGCGGAGA CACGCCCGAG CCGGTGGCCG ACGGGGATCG GGAAGGTGAC
ACCGGGGGTG ATGTGCCCCG GCCCAGGCCG GGGACCGGCG GAGGCCCGGT GGTCCGGCCC
ATCCCCATCC CGATCCGCGA CCCCATCCTG AACCGGCGGC TGCCGGACCG CGTCGCCGAC
GCCGAGGGCA GCCTTCACGT CAACAACAAT TTCATACTGC GCTGGACCAG CGATATGGAA
TCCGGCTTCG TTCGGGTGGA CGACCGGGAT CGGCGCTTCC TGGCCCGGGC GGTGACCGGT
CCGGCCATCT TCACGGTCGC CCAGAACACC TTCGGCATGC GGTCCAGCTT CATTGGCGGC
CGCCTTATGG CCCAGGGTAA TCAGCTGGTT GCGCTGGAGG ACGACAGCCG CAATACGCCG
GCCCTGTGGT TGCTCTGCAA CCAACTGGTG GCCCATGGCC ACATCGGGCC CGGGCTGGAG
GCGCGCCATA CGGCGGTGGA CGATGCGATC GGCAACAATC TGATGGCGTT CGCCTCACTC
AACCAGAACG GGGACCAGAC CTGA
 
Protein sequence
MKTQTSQPSH RDHGQYSGVY LQQGRMITDA DWNALGEIGQ RRLVGALWDA IASGAPREGG 
LRLSDDSGLR LHPGVLYVGG VPARLTGDGP LAPGEQPDYP DPPPFDGRDL TLYADVWERP
VTALEDPALM DPALHGADTS SRGETLLQVK WCPRGLDPAD PAVNPPLGDA PLALRLRHIV
VGDDPCDPCA SEMNLDERIG NYLFRVEVHD LFLDEAGERQ LVLKWSRDNG AEAHQADNVP
EGFDRGDWVW EFFDEASERT LGRHFPEDYR PRRGRLTPAF ERPPEGEPRA FVRQWDGFLQ
LNLDRPALVS GVDRGAELDP ELDPDAHGWA GIDAGVLAVN GERLELRLAF ADRQFLPGDC
WQAAVREAVQ GPGDYVLGDE HTGEPPRGVR HRYLPLGELD GDGALVPHGD AERRRFNFPP
LTDLAAADVG FSERCEALYR GAENVQQALD ALCDIAAEHI AYRLPDCEGE EVTVKRLLAE
ALAERWPDID RDGRLSVRDM LDGLLCHLDS AALPHDVPEC RDGRRSLRER LRIPAGRTTT
AEPLNRLLCD TTADHLPLGR ETELCPDLDR EGVETVQDAL NTLCGRSGGG GCAQVVEPGE
LARTLKVLIE ERRESIWLCL KPGVHEIPAD LALHTARHIR ISGGGYHACG IRLEGGEWAL
QAPQIQLFDL GIELPDDGES RVHLSGDDIT LQRVHVLARE AEGGGGQRRP LLHIDGQLKE
GIGLIRLEDC RLTPVDRGWA LLLERVYQVA YIQNNHIDGL VRYRHGVGKP VDPASQRIDY
VDLRSGDTPE PVADGDREGD TGGDVPRPRP GTGGGPVVRP IPIPIRDPIL NRRLPDRVAD
AEGSLHVNNN FILRWTSDME SGFVRVDDRD RRFLARAVTG PAIFTVAQNT FGMRSSFIGG
RLMAQGNQLV ALEDDSRNTP ALWLLCNQLV AHGHIGPGLE ARHTAVDDAI GNNLMAFASL
NQNGDQT