Gene Mlg_1745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1745 
Symbol 
ID4270852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2000041 
End bp2003001 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content73% 
IMG OID638126503 
Producthypothetical protein 
Protein accessionYP_742581 
Protein GI114320898 
COG category 
COG ID 
TIGRFAM ID[TIGR02243] conserved hypothetical protein, phage tail-like region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0343775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATG ATGATCTGAC CCGCTGGAAC CGCGCCGGCC TGAGCCGCTT CCGTTACCTG 
GACGGCAATG CCGCCACCTT CCTGGAGGAG CTGCGCGCCG GGCTGCAGGC GCGCTTCCCG
CGTTGGCCGG CGGTGGCCGG GGAGGGACCC CCGGAGGAGG ACGAGCGCGA GTGGCGGGCC
CGCCTGGAGC GACACTACCA GGCCGACCGG GACGACCTCC TGTGGCAGAT CGGGCGTGGC
TTCGCCCGCG CCAGCCACGT GCTGGGCGAG CACCTGGACG TCTACGCCAA CGAGGCCACC
CTGGGCACCG CCGGCGAGTG GGAGAACCTG CGCAAGCTGG TGGCCATGCT CGACTACCAC
CCGCGGCCGC CGGCCTCCGC GCACACCACG CTGGCGGTAC TGGCCAAGAA GGCCGGGCCG
CTGGCGGCCG GCTTTGCGGT CAAACACAGC CCGGCGGACG GTGCCCCGGT GGTGTTCGAG
ACCCTGACCG ACCTGGACCT GGACCCGGTG CTCAACGCCG TGCGCCCCGC CGATCACGAC
CGCAATCCGG CACCGCTGCA GGGGCAGTAC CTGGAACTGG CGGGCGAGCA CGACAAGCTG
ACCCGGGGCA CCCCGCTGGT GGTGGAGGAC ACCCGGCGCG GCAGCAGCCG CGCCCACCTG
ATCCAGTCGG TGACGCTGGA CGAAGCGCGC GGTGTCACCC GGGTCCGGGT GAGCCCGCGG
CTTTCGTCCC GTTACCGGGT GGGCGATGTC CGGGTGCACG TCCTGCCCAA GGAGCGCCTG
GCGGTTACCG GCCCGGTGCT GAAGGGCGCG GTGCTGGGGC ACAGTCTGCG GCTCGGCGAC
GACACCGGAG ACCTCAAACC GGGGGAGACG CTGGTGCTCA GCAACCCGGG CCACAAGGCG
CGCTTCCTGC GCGTCGACCG GGTCCGGCCC CGGCTGCTCA GCTTCAAGAC CCCCCTGGGC
AAGACCTACC TGGCCGGCGC GCGCCTGTCC CGGCCGGTGG AGGTGCCGGT GGTCCGCCGT
GCCGGGGTGC CCTGGCGGCG CCGCATCGAG GCCGGTGACG ACAAGGGCAA GAACCTCTAC
GTGGTGTTCG TCGCCGGCGA CTGGCACCGC CTGCAGAACC AGTGGGTGGC CCGCCGCCCG
TCGGCCGTGG ACACCGCCCT CCGTTCATTC AAGGTGACCC GCGCCCACTA CCAACCGGTG
GGCGTGCCGC CGGAGGCCGA TGACTCGCCG GCCTGGGAGG GCTACACCGC CCTGAGCCTG
GTGGGCGACG AACTGGATAA CAATCCGCAG TACCTGCTGG CGGTGCCCGA GTCACCCGGC
CCCTGGGCCC CCGACCCCCT GCTTGAGCGG GCGGAGGGCG GGGTGCGCGA TCCGCTGATC
AGCGAGCACA GCAAGCACGC GGCGCCGGGC GATTTCGCCG TGCTGGTCTG CGGCGGCGCG
ATCGCCTGGG CCCGGCTGGG GGCGGTGGCC GAGGACGAAG AGGGCGAGCG CACCACCCAC
CATGCCGCCG GCGGCGCCTG GCGGGACAGC GGCGGCGGCC CGATCCATCC CGACAGCTCG
GGGGCGCGTG AGGGCGGGCC CTTCTACCGT GATGCCTCGC AGCTCTTCGT GCACTTCACC
GAGACGGTGC GGCTGCACGA CGGGCAGCGC AACCCCACGC CCCTGCGCGG GCGCACCCTG
CCGGTGTCTG ATCCGGACGG GGTGCTGGCC GCTCGCCTGG GCCAGGGGCA TCGCCTGCTG
CTGGACAACG GCAGCGGGGC GACCACCGCC CGGGTGGTGA AACTGGAGGG CGGCGATCCC
TTGCGCCTGA CCCTGTCCGA GCCGCTGCCG GACGACAGCC GCCACGACAA TCTGGTCCTG
TACGGCAATG CGGTGCCGGC CGGCCACGGC AGCGGCAAAC CGGAACAGGC CCTGGGCAGT
GGCGACGCCA CCGAGCGTCA CCCGGCCTTC GAGCTGGCCG TCAAGGATGT CAGCTTCGTC
GCCGACCCCA GTCAGGCGAG CGGGGTGCGG GCGGCGGTGG AGGTCACGGT GGATGACCGG
CGCTGGACGC AGATCGCCAA CCTCAAGGAC GCCGGGCCGG AGGACGCCGT CTACACCGCG
CGCCTGACCG AGGATGGCAC CCTGCAGGTG CGCTTTGGCG ACGGGCGTCA CGGGCGCCGA
CTGCCCACCG GGACGAATAA CGTGCGCATC CACTACCGCC AGGGCGTGGG CACCCGGGGC
AACCTGCCGC CGGGCTCGCT CACCCAGCCC CAGCGCCCCC ACCCGCGGGT GGCGTCGGTG
CGCCAGCCGC TGCCGGCCGG TGGTGGCGCC GACCGCGAGC CGGAGGCGGA CCTGCGGGAG
AGTGCCCCGG CCACCCTGCT GACCCTGTCC CGGGCCGTCT CGCTGCGCGA TTTCGCCCGC
CTGGCCCGGG CGCACGCCAG CATCTGGCAG GCCAACGCCT TCTCCCGGCC CACCCGGCGG
GAGCGCCGGG AGAGCCTGGA GGTGGTGGTG GTGCCCGCCG AGGGGGCCCG CCTGACCAGT
GAGCTGCGCG ACCAGCTCAC CCGCTACCTG GGTACCCACG GGGTGCCGGG GGTGGACCTG
CGGGTGGAGG ATTACGTGCC GGTGGTGATC GGGCTCGATA TCACCCTGCG CATCGACCTG
GATGCCTTCG ACCCGGAGCC GGTGATCGAG GCGGTGCGCG CCGCGCTGGA AGAGGCCTTC
TCACTGCGAC GCCGGCGCCT GGGCCAGCCA CTCTACCGGG GCGAGGTCTT CCAGGTGGTG
GAAGGGGTGC GGGGGGTGGC CAACTCCAGT TGCGAGATCA GGGTGGTCTC CGTGGGCACG
GAGGGTGAGG CGGATGGCCT GCGCCAGGTG CTCACCTCCG GCGGCGTGGT CCGCGTGCTG
CAGCCCGGAC CCCGCCAGTG CCTGCACCTC GCCCCCGGGC GGCCCGACAT CGCCATCGAG
ACGGAGGCCT ACCAGCCATG A
 
Protein sequence
MADDDLTRWN RAGLSRFRYL DGNAATFLEE LRAGLQARFP RWPAVAGEGP PEEDEREWRA 
RLERHYQADR DDLLWQIGRG FARASHVLGE HLDVYANEAT LGTAGEWENL RKLVAMLDYH
PRPPASAHTT LAVLAKKAGP LAAGFAVKHS PADGAPVVFE TLTDLDLDPV LNAVRPADHD
RNPAPLQGQY LELAGEHDKL TRGTPLVVED TRRGSSRAHL IQSVTLDEAR GVTRVRVSPR
LSSRYRVGDV RVHVLPKERL AVTGPVLKGA VLGHSLRLGD DTGDLKPGET LVLSNPGHKA
RFLRVDRVRP RLLSFKTPLG KTYLAGARLS RPVEVPVVRR AGVPWRRRIE AGDDKGKNLY
VVFVAGDWHR LQNQWVARRP SAVDTALRSF KVTRAHYQPV GVPPEADDSP AWEGYTALSL
VGDELDNNPQ YLLAVPESPG PWAPDPLLER AEGGVRDPLI SEHSKHAAPG DFAVLVCGGA
IAWARLGAVA EDEEGERTTH HAAGGAWRDS GGGPIHPDSS GAREGGPFYR DASQLFVHFT
ETVRLHDGQR NPTPLRGRTL PVSDPDGVLA ARLGQGHRLL LDNGSGATTA RVVKLEGGDP
LRLTLSEPLP DDSRHDNLVL YGNAVPAGHG SGKPEQALGS GDATERHPAF ELAVKDVSFV
ADPSQASGVR AAVEVTVDDR RWTQIANLKD AGPEDAVYTA RLTEDGTLQV RFGDGRHGRR
LPTGTNNVRI HYRQGVGTRG NLPPGSLTQP QRPHPRVASV RQPLPAGGGA DREPEADLRE
SAPATLLTLS RAVSLRDFAR LARAHASIWQ ANAFSRPTRR ERRESLEVVV VPAEGARLTS
ELRDQLTRYL GTHGVPGVDL RVEDYVPVVI GLDITLRIDL DAFDPEPVIE AVRAALEEAF
SLRRRRLGQP LYRGEVFQVV EGVRGVANSS CEIRVVSVGT EGEADGLRQV LTSGGVVRVL
QPGPRQCLHL APGRPDIAIE TEAYQP