Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0840 |
Symbol | |
ID | 4270777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 952764 |
End bp | 955370 |
Gene Length | 2607 bp |
Protein Length | 868 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638125592 |
Product | hypothetical protein |
Protein accession | YP_741684 |
Protein GI | 114320001 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCG CCCAGTGCCC CTTGCTCGCC GCCATCGTCC CCGTCCGGTA CGCCATCGGG GTCAACGGGG TGTCCAGCCC CTTCCTGGAG GACTTCGACC TGCCACCGTT GCAAGGCCGG CCGGTCACCG AGAGCCAGGG GTCTCCGGAC GAGGCGGCTC CGCTGCGTTA CGTCGCCCGG CCCCTTCGCA ATGGCTGGTG TTACGTCTGG TTAGACAGCC AGCAACGGCT GGTCGAATAC CGGGTGCGCG GCAGCGCGCT GGAGGAGACC GATCGGGCCG GCGCCCCCAT CGGGCCCACC GCCAGGGTCT GCATCTACGT ACCGGCGGGC GAGACGGCCG CCATCGCCTG GTCCCCGGTG CGCTGGAGCG ATGAGCAGTT CACCGCCATC GGGCGAGACG ACGGCAGGCG CCGGGCCGTG ATGCGCGAAT TCGTGCCCGG CCAGGGGCCA GCCAGCAGCG CCGATGCCTG GTCGAACGAG ATCCCTGAAC TCGGCGAGTT TGACGAAAGG GACTTCCACT GGAGTATCGA GCAGCCCGCC ACCCTGCCCG TATGGGAGGA CATCAAGCGT GCGGTGGACG ATGCCGAGCA GCACGCCACC GTGCTGGTGG ACGACCCCTG GGGGGTGGTC ATCGAACTGG CCCACCTGGT GCGCCAGGGC CAGGCGCAGC GCGCCGACTG GCTCGCCAAC GAAGGCGAGG AACGCATTCT GGCCGAGAAC ATCCTGGCCC TGGACCGTCA GGGCGAGGGC TTTCGCGGGC GACTCCCGCG GCTGGCCGAC CGCGACCGCC TCGAGCAGGC CATTCACCAC CATGGGCGGG AAATTAGGGC CATCGAGGAG AACCTCGATA CCCTGGCCGC CGACTGGGCG CGCTGGATGG GCACCCTCTG GGGGGGCGAA GGCCCCGAGT CCATGGCCTC CGCCCAGAGC CACTTCGACC CCTCGCTTGA CGAACACCAC GAGGCGATGG AAACCCTCTG GAGCGCGGCC CTGGGCGGTG TCACCCAATA CGAAACCGGT GCCACCCTGG CCACCAACCT GCTGGATCCC GATACCGGCC CCCCGGTCAT GCCCGGCGGC CACTCCCTGT GGACGGCACT GTTGGGGCGG TTAGAACCGG TGCAACTGGC CGATGTCCAG CGCCTGGTCG GGGTCAGCGA GTCCCTGCAG GCCCAGGACT GGGAAACCTG GGCCCACAGC CTCAACCACC TGGCCGGACA GCTGGGCCAC GGCCTGGCCA CGGCCCGCGA GGGGCTGTTC CTCGTGCTCG CCACCACCAT TGGGCCCATT CTCCGGGAAC AGGGCGCCAC CTCCGCCCAC CGGACCCTGA TCGCCGGCTA TCTCGCCGCC GCGCTGGCGC GCAGCCGGCA GCGCCTGAAA GTGGAGTCGG TCGCCGCGCG CGCCCTGTTG GACTGGATGA ACGAACCCAG CGCCCGCGCC GCCGGCGCCC CGTCGCCCCT GTACCAGATG CGCCCCGACC TGCTGCCCGA GCTCGACGCC CGGCAGGTCA TCACCATCCG CGTGGTCGCG GAGGGCGCTT CTTCCGCCGA GGGCAACCCC TTCCTGCAGC GCGCCCTGCA GGAGGCCCCG CTGAAAAGCC TGCTGGTGCT GATGAATGGG TTGGTGGTGG TTCATGCGGG CAGCCAATTG GCCGCCGGTG ATCGTTCCGT ACAGACCGCA ACAGCGGCGG CTGGGGGCGT ATCCGGCACC ATCAGCGCCA CCGCCGCCAC CATTCAGCAC TTCGCGCAGA TCCGCTCCGA CGACATCCTT GCCCGCCAGG GCCTGTCCGC CGGCTGGCGG GGCGCCTTCG ACCGCTACCT CCTCTGGGGT CAGGCCACCA ACCTGACTCT GTCGATCACT GCGGTCTTCG ACGCCGTCTA CTTCGGCTAC GGCGCCTGGG AGAGCGTCCG CCAGGGGGAT CGCCGATCGG GCACCATCCA GGCCGGCATG GCCACGGCCG CCGCCGGCCA GACCGCCGCC GGCGCCCACG CCTTCCACAC CTACCGCCAG GCCCGGCAAG CCCTGCTCGT GGGCCGGTCC ACCCAGGCCG CGCGCACCGC CGCCCGGGCC CGGATCGCAC CGGCCGTGAT CCTCGCCCTG ACGCTGGTCA TCGTCGCCGG CGCGGTCAGC CTGCGCTTCA CCCGCGACAA CGAACTGGAA CACTGGCTGC GCCGTACCCG CTTCGGCACC CACCCGGCCG ACTGGGCCGG CGACCTGGAC GAGGAACTCG GCCACCTCTA CCGGCTGCTC TACCAGCCAC GCATCCGGCT GGAGGCCCGC CAGGCCCGTA ACCCCCGCTC CGGCGATGTC TACCACTACC GTGTCCTGCT GGTCACCTTC CCCGGCGCCA CGCCCTTCCC GGGCATGTTC ACCCTGGAGG CCACCGAGCG CTGGCGCACC GGGCTGGTCC GGCATGACGA ACGCAGCCGG ACCATCACCG AAAAGGATCT CGACCTCGAC ATCGGCGGGG CGGAGGCGCC CGACGGCCCC GTCTACCGAC TCATCTACCA CAACACCCGG GAAGGCGACC GGCTGAGCTC CCTTTCCGGC ACCCTGTACT ATCGCCCGTT CCCCGATCTC ACCCTGCCTC CCATCCGGAT CAACTAG
|
Protein sequence | MTIAQCPLLA AIVPVRYAIG VNGVSSPFLE DFDLPPLQGR PVTESQGSPD EAAPLRYVAR PLRNGWCYVW LDSQQRLVEY RVRGSALEET DRAGAPIGPT ARVCIYVPAG ETAAIAWSPV RWSDEQFTAI GRDDGRRRAV MREFVPGQGP ASSADAWSNE IPELGEFDER DFHWSIEQPA TLPVWEDIKR AVDDAEQHAT VLVDDPWGVV IELAHLVRQG QAQRADWLAN EGEERILAEN ILALDRQGEG FRGRLPRLAD RDRLEQAIHH HGREIRAIEE NLDTLAADWA RWMGTLWGGE GPESMASAQS HFDPSLDEHH EAMETLWSAA LGGVTQYETG ATLATNLLDP DTGPPVMPGG HSLWTALLGR LEPVQLADVQ RLVGVSESLQ AQDWETWAHS LNHLAGQLGH GLATAREGLF LVLATTIGPI LREQGATSAH RTLIAGYLAA ALARSRQRLK VESVAARALL DWMNEPSARA AGAPSPLYQM RPDLLPELDA RQVITIRVVA EGASSAEGNP FLQRALQEAP LKSLLVLMNG LVVVHAGSQL AAGDRSVQTA TAAAGGVSGT ISATAATIQH FAQIRSDDIL ARQGLSAGWR GAFDRYLLWG QATNLTLSIT AVFDAVYFGY GAWESVRQGD RRSGTIQAGM ATAAAGQTAA GAHAFHTYRQ ARQALLVGRS TQAARTAARA RIAPAVILAL TLVIVAGAVS LRFTRDNELE HWLRRTRFGT HPADWAGDLD EELGHLYRLL YQPRIRLEAR QARNPRSGDV YHYRVLLVTF PGATPFPGMF TLEATERWRT GLVRHDERSR TITEKDLDLD IGGAEAPDGP VYRLIYHNTR EGDRLSSLSG TLYYRPFPDL TLPPIRIN
|
| |