Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0041 |
Symbol | |
ID | 4270910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 43278 |
End bp | 46577 |
Gene Length | 3300 bp |
Protein Length | 1099 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638124767 |
Product | hypothetical protein |
Protein accession | YP_740889 |
Protein GI | 114319206 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCG ACATGGAAGC CGCCAACCGC GTCGCGGAGA GCCGGAACGA TAACGACCCG GGCCAAGAGC AGGGGCTCTG CCCTCTTACG CGCGGTGAGC TGCAGCTGGT CCCGGTGCGG TATGCCCTGG TCGAGGCGCC CGAGGCGGAT CGGGCCTCGG GCACGCCGGG CTTCCGGCCC GTTCATGACG GCAGTTTCCG TCGCTGTGGC GTACGGCCGG TACGGGAAGG CTGGCTCTAC CTGGTACACA GCTCAACGCC CGATGAGCTG CAGGTCTTCG AGGTCAAACC CGATGGCAGC GGCGATGCGA TCATCGTCGA GCGCGAGGGC AGCATCCAGG TGCTGTTCAG TCCCTTGGCG CTGACGGCCG TGCACCAATC GATGCTGCTT AAACCTGCCT TCCGCGATCA GGTGATGACG CGGGTCAACG TTGGCGCCTA CTGTCCGGGG GCGGGTACCG CTCACCTGCT CGATCCCGAT GCCCTGGCCG ACGTCCTGGC GGACGACCAT GGCGAACACC GCGCGACCCC CAGCGCCCCG ACCGAGCAGG ACGGCGACCC GCTCGACCCG GGCAGCTACG CCTGGTGCGA CGCGGAAGGC GAACACGCCG AGTGGCAGCG TGCGCGCGCC GCCGAGATCA AGGCGGCTAT CCAGGGGGGC TTTCAGCAAG ACAGCGCCTG CCTGGTAGTG GACGATATCG CCGGACGTAT CAAGGACCTC GCCCAGGCGT GGGCCTGCCT GGCGGAACAA CAGGGCCAGT GGGTCGACGA GAACGCGGTG GCGCTCTTCT CGGCACGAAC CATCGAAGGC CTGATGTCGT TGAACCTGGC GCCTCACATT GCCAATGCAG GCGACGGCAA TATTCCTGAG TGGCTGAGCG AAGCCAGTTA TGCCGAGCGT AACGACCTGG AGTCGCTCGC CGAGCTCTAT TCGCAGCATC GCGAAGCCAT AGAGCAGGTC ACTGCCCGTT CAGGCGGCCA CCCGGGCTTC ATTGGCGGGG CGCTGGCGCC CAACGGCATC GTCGACGTGA TCCGCCGGCA GGCGATGGAA GACTTCCTGG CCTATGCCAC CCCGATGCAG GCGCAGTGGC AGAGCGAGTA TCACCGTATC GGGGCCGACC TGGCCGCCAT ACTGCCCACC TGGCACACCC AGGCCCTGCT GCTCGACCGC GAAGAGGAGC ATCACATCCT GCTGACCTGC CTGCTGGAAA AGCAGGCCGT CGAGACCCTG CTGGCCTGCG GGCAGGAGGA CTTCCTGTCG AGCTACTACG CCGGCGACGA CCCGGTACCG GCACACCTGA TGCATTACGT CCCCACCGCG TCCTTCGTGG AGGGATTCCT TTCCCAAAAC ACTGGCCTCC AGAAAGCGCT CACCCAGGCG TCCGCACTGA TGGGCGCCCA GGGCGCACTC AGCCGCTACC AGCAGTGGCG GGGCGAGGTT GAACATCAGA CCGGGCTGCG CTTTCGCAGC GTCGAGGGCC TCTCCGACGA AGCCCGAACC GCCATCGCGG GCGAAGTGCA AATCAAGGAG AAGCTGCTGG GGCAGGCGGT GCTCGGGACG TTGCTCGATG ACGTACAGGA TGTCGACCTG GGGCAGCGCA TCACTACCCT GGCTTCGCGC TTGCCCGATG GGCAACGGCT GATGTTTGCC GAACGTCTGG GGCTGCTCGA GCTCGGCTGG GCCATCCCCG ACCGCTCCGT ACTGGGCCGC ATCCAGCAAG CACTGGACCA GGCGGACTCG GCCGTGGCCA ACCTGGCAAC CCTTGAGCGC AGGCTCGAGC AGCTCTACCA GGAGCGCCAG GTCGAGTTGG CACGGGCGTC CAGGCGCGGC ACCAGCCAGG CGCATCGCCG GGCCGCCGAC CGGTTCAATG CCCGCAAAAT CCGCGAAAGC CAGGCCGACA TCAACCGGCA CAGGCGCCTG CTGGGCGAGG CATTCGACGC CCTGGCCGAA CACAGCTTCC CCGTCGACAA GGCCAATGGC CACGCCGTCC AGGTCGGCGG CCTGAGCCTG GCCGCCACCC GTGCCGCCCT GGCCGAGCGT GCCGCCGACC GGGCCGTTGC CCGTCAACCA GCCGCGACCC TGAACGACAC CTTGAACACC CTGGTGCGCG ACGAACAGGG CGAGCTGTCG GCGAGTCGGA GCCTGGGGGT GCTGGTTAAT GGGAGCCTGT CGATGCTGGG TATGATCATG ACCGGCGTGG CCCTGCGCAA TACGCTTGAT GCCTGGGGCG AAGAACGCCT CATGGAGCAC GCCTTCGCGA CGGGCTCTCA CGCGACAGGG GCGGCAGCGA GCATCATGGC AATACGAGAA ATGATCATCG ACGCCCGCCA TCGCAACCTC TACCAGGGGC GGGGCTTTCA GCAGGTGGCC TTGGCCGAGA CCCGCGCCGC CGCCGGAAGC CCCGCGCAAC TGGAGCGCTG GGCAAGGGTT GCCAATGGGG CGATGGGGGC CGTGGCCCTA CTGGGTGGCT TTGCAGGTCT CTTAGAAACC TACAAGCAGT ACAAACGAAT GCAGGGATCA GAAACCCAGG CGGAGCGACT GGCCTTACAG GTTGCCTTTA CAGGTGCGGC CGGAGTCGCC GGTGGTGGCT TCTTAATCGG CGGCATGAGC GCGCTTGGCA GGATGCTGGG CAAGCCCGCA GTCGCCTGGC GACTGCTGCT GCTGAAATTC GCCGGCCCCG CCGGCTGGGT GGTGGCGGTT GGCACCGCCC TGCTGATCAT CGGCGAGGTG CTGGCCAATC GCTTCTCGTT GAGCCCTGTG CAGCGCTGGT GCCAGCGCAG TCACTGGGGG CGAGAAGATC AGGGCTGGGA TCGCGAGGCC CACGAGCGGG AACTGGCCCG ACTTGGCGAT ACCGATCTCA CGGTGGAACG GCAGGGGCAG GCCGAGCCCC ATGGCGGCCC GGGGCCCGGG CCGGCAGGCA CCGACCTCGC CATACGCATT GGCTTGCCCG GGCTTGACGC CCCCAATGCG GAAAACCTCG CGCTGGGCCT CTGGGGCGTC ACCCCTCGCC TCAAGGAAAT GACCCGAGAC TTTCTCGAAC ATGCCGAGCT CGAAAACCAG GGCTCGAGCT ATGCCCTGCA CTACCATTTC GATCCCGAAA CATTGGCCGA ATGCCACGAG TTCCGCCTCG TCATCCGCAC GAAGGGCCCC GAAGCATCCA CCACCCGGGT CTTCCAGTTG CATCGCCGCG GCACATCGCT CTCCGATGAG TGGAGGGAGA TCTCCGCCCT CGGCGATCGT TTCCTCACGC GGTACCAAGT GGGCAACTGG CCGGACATGC CCCTGACGCC CTGGCCGTGA
|
Protein sequence | MSTDMEAANR VAESRNDNDP GQEQGLCPLT RGELQLVPVR YALVEAPEAD RASGTPGFRP VHDGSFRRCG VRPVREGWLY LVHSSTPDEL QVFEVKPDGS GDAIIVEREG SIQVLFSPLA LTAVHQSMLL KPAFRDQVMT RVNVGAYCPG AGTAHLLDPD ALADVLADDH GEHRATPSAP TEQDGDPLDP GSYAWCDAEG EHAEWQRARA AEIKAAIQGG FQQDSACLVV DDIAGRIKDL AQAWACLAEQ QGQWVDENAV ALFSARTIEG LMSLNLAPHI ANAGDGNIPE WLSEASYAER NDLESLAELY SQHREAIEQV TARSGGHPGF IGGALAPNGI VDVIRRQAME DFLAYATPMQ AQWQSEYHRI GADLAAILPT WHTQALLLDR EEEHHILLTC LLEKQAVETL LACGQEDFLS SYYAGDDPVP AHLMHYVPTA SFVEGFLSQN TGLQKALTQA SALMGAQGAL SRYQQWRGEV EHQTGLRFRS VEGLSDEART AIAGEVQIKE KLLGQAVLGT LLDDVQDVDL GQRITTLASR LPDGQRLMFA ERLGLLELGW AIPDRSVLGR IQQALDQADS AVANLATLER RLEQLYQERQ VELARASRRG TSQAHRRAAD RFNARKIRES QADINRHRRL LGEAFDALAE HSFPVDKANG HAVQVGGLSL AATRAALAER AADRAVARQP AATLNDTLNT LVRDEQGELS ASRSLGVLVN GSLSMLGMIM TGVALRNTLD AWGEERLMEH AFATGSHATG AAASIMAIRE MIIDARHRNL YQGRGFQQVA LAETRAAAGS PAQLERWARV ANGAMGAVAL LGGFAGLLET YKQYKRMQGS ETQAERLALQ VAFTGAAGVA GGGFLIGGMS ALGRMLGKPA VAWRLLLLKF AGPAGWVVAV GTALLIIGEV LANRFSLSPV QRWCQRSHWG REDQGWDREA HERELARLGD TDLTVERQGQ AEPHGGPGPG PAGTDLAIRI GLPGLDAPNA ENLALGLWGV TPRLKEMTRD FLEHAELENQ GSSYALHYHF DPETLAECHE FRLVIRTKGP EASTTRVFQL HRRGTSLSDE WREISALGDR FLTRYQVGNW PDMPLTPWP
|
| |