Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1743 |
Symbol | |
ID | 4270850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1995414 |
End bp | 1998317 |
Gene Length | 2904 bp |
Protein Length | 967 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638126501 |
Product | hypothetical protein |
Protein accession | YP_742579 |
Protein GI | 114320896 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0189343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.985267 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGC AGACCTCGCA ACCGTCCCAC CGCGACCACG GCCAGTACAG CGGCGTCTAC CTGCAGCAGG GGCGGATGAT CACCGACGCC GACTGGAACG CCCTCGGCGA GATCGGCCAG CGCCGGCTGG TGGGCGCCCT GTGGGATGCC ATCGCCAGCG GCGCTCCGCG CGAGGGCGGG CTGCGGTTGT CGGACGACAG CGGTCTGCGC CTGCACCCCG GCGTGCTTTA CGTGGGCGGC GTCCCCGCCC GGCTGACCGG CGACGGGCCG CTGGCGCCGG GGGAGCAGCC CGATTACCCG GATCCGCCCC CCTTCGACGG GCGGGACCTG ACGCTCTACG CCGACGTCTG GGAGCGTCCG GTCACGGCCC TGGAGGACCC GGCCCTGATG GATCCGGCGC TGCACGGCGC CGATACCAGC AGCCGGGGCG AGACCCTGCT GCAGGTGAAG TGGTGCCCGC GCGGGCTGGA CCCCGCCGAC CCGGCGGTCA ACCCGCCGCT GGGCGATGCC CCGCTGGCGC TGCGGCTGCG TCACATCGTC GTCGGCGATG ACCCCTGCGA CCCCTGCGCC AGCGAGATGA ACCTGGACGA GCGCATAGGC AACTACCTGT TCCGGGTGGA GGTGCACGAC CTTTTCCTGG ACGAGGCGGG TGAGCGGCAG CTGGTGCTGA AGTGGTCCCG CGACAACGGT GCCGAGGCCC ACCAGGCCGA TAACGTCCCG GAGGGCTTCG ACCGCGGCGA CTGGGTCTGG GAGTTCTTCG ACGAGGCCAG CGAGCGGACC CTGGGCCGGC ATTTCCCGGA GGACTACCGG CCCCGGCGCG GGCGCCTCAC CCCGGCGTTC GAGCGCCCCC CGGAGGGGGA GCCGCGGGCG TTCGTGCGCC AGTGGGACGG CTTTCTTCAG CTCAACCTGG ACCGGCCCGC CCTGGTCAGC GGCGTGGACC GGGGCGCCGA GCTGGACCCG GAACTCGACC CGGACGCCCA CGGCTGGGCG GGGATCGACG CCGGTGTGCT GGCCGTCAAC GGCGAACGCC TGGAACTCCG GCTCGCCTTC GCCGACCGCC AGTTCCTGCC CGGTGACTGC TGGCAGGCGG CGGTGCGCGA GGCGGTGCAG GGGCCCGGCG ACTACGTGCT CGGCGACGAG CATACCGGCG AACCGCCCCG CGGGGTGCGC CACCGCTACC TGCCTTTGGG CGAGCTGGAC GGCGACGGCG CGCTGGTGCC GCACGGCGAC GCGGAGCGCC GCCGTTTCAA CTTCCCGCCG CTCACCGACC TGGCCGCCGC CGACGTGGGC TTCAGCGAGC GCTGCGAAGC CCTCTATCGG GGCGCGGAGA ACGTCCAGCA GGCGCTGGAC GCGCTCTGCG ACATCGCCGC CGAGCACATC GCTTACCGGT TGCCCGATTG CGAGGGGGAG GAGGTGACGG TCAAGCGGCT GTTGGCCGAG GCCCTGGCTG AACGTTGGCC GGATATCGAC AGGGACGGCC GGCTGAGCGT GCGGGACATG CTCGACGGCT TGCTCTGTCA CCTGGACAGC GCGGCGCTGC CCCACGACGT GCCCGAGTGC CGCGATGGCC GCCGCAGCCT GCGCGAGCGG CTGCGGATTC CCGCAGGGCG GACCACCACC GCGGAACCGC TCAACCGGTT GCTTTGCGAT ACCACGGCCG ATCACCTCCC CCTGGGCCGC GAGACCGAAC TCTGTCCGGA CCTGGACCGG GAAGGCGTGG AGACGGTGCA GGACGCCCTC AACACCCTCT GCGGGCGCAG CGGTGGTGGC GGCTGCGCGC AGGTGGTGGA GCCGGGGGAG CTGGCTCGGA CGCTGAAGGT GCTGATCGAA GAGCGGCGGG AGAGCATATG GCTGTGTCTT AAACCCGGCG TGCACGAAAT CCCCGCGGAC CTGGCGCTGC ATACGGCCCG GCATATCCGG ATCAGTGGTG GCGGGTATCA CGCCTGTGGC ATCCGCCTGG AGGGTGGAGA GTGGGCGTTG CAGGCACCGC AGATCCAGCT ATTCGATCTC GGTATCGAAC TGCCCGACGA CGGCGAGAGC CGGGTCCACC TGAGCGGGGA CGACATTACC CTCCAGCGGG TACACGTGCT CGCCCGGGAG GCCGAGGGGG GCGGCGGCCA GCGCCGGCCA CTGCTCCATA TCGATGGGCA GCTTAAGGAA GGTATTGGGC TTATCCGCCT GGAGGACTGC CGGCTCACCC CGGTGGATCG GGGCTGGGCG CTGTTGCTGG AGCGGGTCTA CCAGGTCGCC TACATCCAGA ATAACCATAT CGACGGTCTC GTGCGCTACC GGCATGGGGT GGGCAAGCCG GTGGACCCGG CGAGCCAGCG CATCGACTAT GTGGACCTGC GCAGCGGAGA CACGCCCGAG CCGGTGGCCG ACGGGGATCG GGAAGGTGAC ACCGGGGGTG ATGTGCCCCG GCCCAGGCCG GGGACCGGCG GAGGCCCGGT GGTCCGGCCC ATCCCCATCC CGATCCGCGA CCCCATCCTG AACCGGCGGC TGCCGGACCG CGTCGCCGAC GCCGAGGGCA GCCTTCACGT CAACAACAAT TTCATACTGC GCTGGACCAG CGATATGGAA TCCGGCTTCG TTCGGGTGGA CGACCGGGAT CGGCGCTTCC TGGCCCGGGC GGTGACCGGT CCGGCCATCT TCACGGTCGC CCAGAACACC TTCGGCATGC GGTCCAGCTT CATTGGCGGC CGCCTTATGG CCCAGGGTAA TCAGCTGGTT GCGCTGGAGG ACGACAGCCG CAATACGCCG GCCCTGTGGT TGCTCTGCAA CCAACTGGTG GCCCATGGCC ACATCGGGCC CGGGCTGGAG GCGCGCCATA CGGCGGTGGA CGATGCGATC GGCAACAATC TGATGGCGTT CGCCTCACTC AACCAGAACG GGGACCAGAC CTGA
|
Protein sequence | MKTQTSQPSH RDHGQYSGVY LQQGRMITDA DWNALGEIGQ RRLVGALWDA IASGAPREGG LRLSDDSGLR LHPGVLYVGG VPARLTGDGP LAPGEQPDYP DPPPFDGRDL TLYADVWERP VTALEDPALM DPALHGADTS SRGETLLQVK WCPRGLDPAD PAVNPPLGDA PLALRLRHIV VGDDPCDPCA SEMNLDERIG NYLFRVEVHD LFLDEAGERQ LVLKWSRDNG AEAHQADNVP EGFDRGDWVW EFFDEASERT LGRHFPEDYR PRRGRLTPAF ERPPEGEPRA FVRQWDGFLQ LNLDRPALVS GVDRGAELDP ELDPDAHGWA GIDAGVLAVN GERLELRLAF ADRQFLPGDC WQAAVREAVQ GPGDYVLGDE HTGEPPRGVR HRYLPLGELD GDGALVPHGD AERRRFNFPP LTDLAAADVG FSERCEALYR GAENVQQALD ALCDIAAEHI AYRLPDCEGE EVTVKRLLAE ALAERWPDID RDGRLSVRDM LDGLLCHLDS AALPHDVPEC RDGRRSLRER LRIPAGRTTT AEPLNRLLCD TTADHLPLGR ETELCPDLDR EGVETVQDAL NTLCGRSGGG GCAQVVEPGE LARTLKVLIE ERRESIWLCL KPGVHEIPAD LALHTARHIR ISGGGYHACG IRLEGGEWAL QAPQIQLFDL GIELPDDGES RVHLSGDDIT LQRVHVLARE AEGGGGQRRP LLHIDGQLKE GIGLIRLEDC RLTPVDRGWA LLLERVYQVA YIQNNHIDGL VRYRHGVGKP VDPASQRIDY VDLRSGDTPE PVADGDREGD TGGDVPRPRP GTGGGPVVRP IPIPIRDPIL NRRLPDRVAD AEGSLHVNNN FILRWTSDME SGFVRVDDRD RRFLARAVTG PAIFTVAQNT FGMRSSFIGG RLMAQGNQLV ALEDDSRNTP ALWLLCNQLV AHGHIGPGLE ARHTAVDDAI GNNLMAFASL NQNGDQT
|
| |