Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1745 |
Symbol | |
ID | 4270852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2000041 |
End bp | 2003001 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638126503 |
Product | hypothetical protein |
Protein accession | YP_742581 |
Protein GI | 114320898 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02243] conserved hypothetical protein, phage tail-like region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0343775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATG ATGATCTGAC CCGCTGGAAC CGCGCCGGCC TGAGCCGCTT CCGTTACCTG GACGGCAATG CCGCCACCTT CCTGGAGGAG CTGCGCGCCG GGCTGCAGGC GCGCTTCCCG CGTTGGCCGG CGGTGGCCGG GGAGGGACCC CCGGAGGAGG ACGAGCGCGA GTGGCGGGCC CGCCTGGAGC GACACTACCA GGCCGACCGG GACGACCTCC TGTGGCAGAT CGGGCGTGGC TTCGCCCGCG CCAGCCACGT GCTGGGCGAG CACCTGGACG TCTACGCCAA CGAGGCCACC CTGGGCACCG CCGGCGAGTG GGAGAACCTG CGCAAGCTGG TGGCCATGCT CGACTACCAC CCGCGGCCGC CGGCCTCCGC GCACACCACG CTGGCGGTAC TGGCCAAGAA GGCCGGGCCG CTGGCGGCCG GCTTTGCGGT CAAACACAGC CCGGCGGACG GTGCCCCGGT GGTGTTCGAG ACCCTGACCG ACCTGGACCT GGACCCGGTG CTCAACGCCG TGCGCCCCGC CGATCACGAC CGCAATCCGG CACCGCTGCA GGGGCAGTAC CTGGAACTGG CGGGCGAGCA CGACAAGCTG ACCCGGGGCA CCCCGCTGGT GGTGGAGGAC ACCCGGCGCG GCAGCAGCCG CGCCCACCTG ATCCAGTCGG TGACGCTGGA CGAAGCGCGC GGTGTCACCC GGGTCCGGGT GAGCCCGCGG CTTTCGTCCC GTTACCGGGT GGGCGATGTC CGGGTGCACG TCCTGCCCAA GGAGCGCCTG GCGGTTACCG GCCCGGTGCT GAAGGGCGCG GTGCTGGGGC ACAGTCTGCG GCTCGGCGAC GACACCGGAG ACCTCAAACC GGGGGAGACG CTGGTGCTCA GCAACCCGGG CCACAAGGCG CGCTTCCTGC GCGTCGACCG GGTCCGGCCC CGGCTGCTCA GCTTCAAGAC CCCCCTGGGC AAGACCTACC TGGCCGGCGC GCGCCTGTCC CGGCCGGTGG AGGTGCCGGT GGTCCGCCGT GCCGGGGTGC CCTGGCGGCG CCGCATCGAG GCCGGTGACG ACAAGGGCAA GAACCTCTAC GTGGTGTTCG TCGCCGGCGA CTGGCACCGC CTGCAGAACC AGTGGGTGGC CCGCCGCCCG TCGGCCGTGG ACACCGCCCT CCGTTCATTC AAGGTGACCC GCGCCCACTA CCAACCGGTG GGCGTGCCGC CGGAGGCCGA TGACTCGCCG GCCTGGGAGG GCTACACCGC CCTGAGCCTG GTGGGCGACG AACTGGATAA CAATCCGCAG TACCTGCTGG CGGTGCCCGA GTCACCCGGC CCCTGGGCCC CCGACCCCCT GCTTGAGCGG GCGGAGGGCG GGGTGCGCGA TCCGCTGATC AGCGAGCACA GCAAGCACGC GGCGCCGGGC GATTTCGCCG TGCTGGTCTG CGGCGGCGCG ATCGCCTGGG CCCGGCTGGG GGCGGTGGCC GAGGACGAAG AGGGCGAGCG CACCACCCAC CATGCCGCCG GCGGCGCCTG GCGGGACAGC GGCGGCGGCC CGATCCATCC CGACAGCTCG GGGGCGCGTG AGGGCGGGCC CTTCTACCGT GATGCCTCGC AGCTCTTCGT GCACTTCACC GAGACGGTGC GGCTGCACGA CGGGCAGCGC AACCCCACGC CCCTGCGCGG GCGCACCCTG CCGGTGTCTG ATCCGGACGG GGTGCTGGCC GCTCGCCTGG GCCAGGGGCA TCGCCTGCTG CTGGACAACG GCAGCGGGGC GACCACCGCC CGGGTGGTGA AACTGGAGGG CGGCGATCCC TTGCGCCTGA CCCTGTCCGA GCCGCTGCCG GACGACAGCC GCCACGACAA TCTGGTCCTG TACGGCAATG CGGTGCCGGC CGGCCACGGC AGCGGCAAAC CGGAACAGGC CCTGGGCAGT GGCGACGCCA CCGAGCGTCA CCCGGCCTTC GAGCTGGCCG TCAAGGATGT CAGCTTCGTC GCCGACCCCA GTCAGGCGAG CGGGGTGCGG GCGGCGGTGG AGGTCACGGT GGATGACCGG CGCTGGACGC AGATCGCCAA CCTCAAGGAC GCCGGGCCGG AGGACGCCGT CTACACCGCG CGCCTGACCG AGGATGGCAC CCTGCAGGTG CGCTTTGGCG ACGGGCGTCA CGGGCGCCGA CTGCCCACCG GGACGAATAA CGTGCGCATC CACTACCGCC AGGGCGTGGG CACCCGGGGC AACCTGCCGC CGGGCTCGCT CACCCAGCCC CAGCGCCCCC ACCCGCGGGT GGCGTCGGTG CGCCAGCCGC TGCCGGCCGG TGGTGGCGCC GACCGCGAGC CGGAGGCGGA CCTGCGGGAG AGTGCCCCGG CCACCCTGCT GACCCTGTCC CGGGCCGTCT CGCTGCGCGA TTTCGCCCGC CTGGCCCGGG CGCACGCCAG CATCTGGCAG GCCAACGCCT TCTCCCGGCC CACCCGGCGG GAGCGCCGGG AGAGCCTGGA GGTGGTGGTG GTGCCCGCCG AGGGGGCCCG CCTGACCAGT GAGCTGCGCG ACCAGCTCAC CCGCTACCTG GGTACCCACG GGGTGCCGGG GGTGGACCTG CGGGTGGAGG ATTACGTGCC GGTGGTGATC GGGCTCGATA TCACCCTGCG CATCGACCTG GATGCCTTCG ACCCGGAGCC GGTGATCGAG GCGGTGCGCG CCGCGCTGGA AGAGGCCTTC TCACTGCGAC GCCGGCGCCT GGGCCAGCCA CTCTACCGGG GCGAGGTCTT CCAGGTGGTG GAAGGGGTGC GGGGGGTGGC CAACTCCAGT TGCGAGATCA GGGTGGTCTC CGTGGGCACG GAGGGTGAGG CGGATGGCCT GCGCCAGGTG CTCACCTCCG GCGGCGTGGT CCGCGTGCTG CAGCCCGGAC CCCGCCAGTG CCTGCACCTC GCCCCCGGGC GGCCCGACAT CGCCATCGAG ACGGAGGCCT ACCAGCCATG A
|
Protein sequence | MADDDLTRWN RAGLSRFRYL DGNAATFLEE LRAGLQARFP RWPAVAGEGP PEEDEREWRA RLERHYQADR DDLLWQIGRG FARASHVLGE HLDVYANEAT LGTAGEWENL RKLVAMLDYH PRPPASAHTT LAVLAKKAGP LAAGFAVKHS PADGAPVVFE TLTDLDLDPV LNAVRPADHD RNPAPLQGQY LELAGEHDKL TRGTPLVVED TRRGSSRAHL IQSVTLDEAR GVTRVRVSPR LSSRYRVGDV RVHVLPKERL AVTGPVLKGA VLGHSLRLGD DTGDLKPGET LVLSNPGHKA RFLRVDRVRP RLLSFKTPLG KTYLAGARLS RPVEVPVVRR AGVPWRRRIE AGDDKGKNLY VVFVAGDWHR LQNQWVARRP SAVDTALRSF KVTRAHYQPV GVPPEADDSP AWEGYTALSL VGDELDNNPQ YLLAVPESPG PWAPDPLLER AEGGVRDPLI SEHSKHAAPG DFAVLVCGGA IAWARLGAVA EDEEGERTTH HAAGGAWRDS GGGPIHPDSS GAREGGPFYR DASQLFVHFT ETVRLHDGQR NPTPLRGRTL PVSDPDGVLA ARLGQGHRLL LDNGSGATTA RVVKLEGGDP LRLTLSEPLP DDSRHDNLVL YGNAVPAGHG SGKPEQALGS GDATERHPAF ELAVKDVSFV ADPSQASGVR AAVEVTVDDR RWTQIANLKD AGPEDAVYTA RLTEDGTLQV RFGDGRHGRR LPTGTNNVRI HYRQGVGTRG NLPPGSLTQP QRPHPRVASV RQPLPAGGGA DREPEADLRE SAPATLLTLS RAVSLRDFAR LARAHASIWQ ANAFSRPTRR ERRESLEVVV VPAEGARLTS ELRDQLTRYL GTHGVPGVDL RVEDYVPVVI GLDITLRIDL DAFDPEPVIE AVRAALEEAF SLRRRRLGQP LYRGEVFQVV EGVRGVANSS CEIRVVSVGT EGEADGLRQV LTSGGVVRVL QPGPRQCLHL APGRPDIAIE TEAYQP
|
| |