Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0125 |
Symbol | |
ID | 4269818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 140831 |
End bp | 142432 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638124849 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_740970 |
Protein GI | 114319287 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain [TIGR03098] acyl-CoA ligase (AMP-forming), exosortase system type 1 associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATAC CAATTTTGCT TCACGAGACC CTGCTGAGCA GCGCGGGGCG GGCCCCTTCA TCACCGGCGG TCAGCTACCG GGATTCTTCA CACGACTATG ACACCCTTGC CACTGCCTGC CAAAGCGTGG CCGCAGCGCT GCGCGAATGC GGCCTGCTGC GTCAGGAGCG GGTGGCCGTC TACCTCGACA AGCGACCCGA GACGGTGCAG GCCTTGTTCG GTGCGGCAAT GGCCGGAGGG GTCTTCGTCC CCGTCAATCC GCTGCTGAAG GCTGACCAGG TCGCCTACAT CCTGCGCGAC TGCAATGTCC GTATCCTGGT GACCACCGCT GACCGGCTCA AGGCGATCCG CCAGGCGCTG GTCCAGTGCC CCGACCTGCA CACCGTCCTG GTCGTGGGCC GTTCGGACAC CCTCGGCGAA GAGGAGCCTC CCTACCGGGT CCACGACTGG TACGACGTGG TCTCTACAGC AGGGCCGGCT CGGCCGCATC GCGTGATCGA CAGCGACATG GCCGCCATCC TCTATACCTC CGGCAGTACC GGTCAGCCCA AGGGGGTGGT GCTCTCCCAC CGCAATATGG TTGCCGGTGC CCAGAGCGTG GCCAGCTACC TGGACAACCG GCCCGAGGAC CACCTGCTGG CCGCACTGCC TTTCAGCTTC GATTACGGCC TCAGCCAGCT CACGACGGCC TTCTTGACCG GTGCGCAGGT GTCGCTGCTG AACTACCTGC TGCCGCGGGA TGTGATCAGA GCGGTGGAAA AGCAGGGAAT CACCGGCCTG GCGGCGGTCC CCCCATTGTG GATCCAGTTA GCACAGCTGG AGTGGCCGGA CGCGGCGCGC AAGACCCTGC GCTACGTGAC CAATTCCGGC GGTGCCATGC CCCTGAAGAC CTTGGAGGCG CTCCGGGCGC AACTGCCCGC TACCCGCTTC TACCTGATGT ACGGGCTCAC CGAGGCCTTC CGCTCGACGT ACCTGCCGCC GGAGGAGGTG GACCGACGGC CCGATTCCAT GGGCAAGGCC ATCCCCAATG CAGAGATCAG GGTGGTGCGT GAGGACGGCT CCCCGTGCGC CCCCGGCGAG CCGGGCGAGC TGGTGCACCG GGGGGCGTTG GTGGCCATGG GCTACTGGAA CGACCCGGAA CGGACCGCGC AGCGCTTCCG TCCGGCGCCG GGCCAGCCGG GCGGCTTGCC GCAAAAGGAA TTGGCGGTCT GGTCGGGCGA TACCGTACGC ATGGACGAGG AGGGGTTTCT CTACTTCATC GGTCGACGCG ATGAGATGAT CAAGACCTCG GGTTATCGGG TGAGCCCGAA CGAGGTGGAG GAAGCCGTCT ATGGCACCGG GCTTGTCGCC GAGGCCGCCG CGCTCGGGGT GCCGCACCCG GTGCTGGGGC ACGGCATCGT TTTGGTCGTG TTGCCGCGCT CCACCGGCGT GACCGCGGAG GAGCTTTTGA ATGCCCTGCG GCCCCGGGTA CCGGCATTCA TGTTGCCCGC TCACGTGGAG CTTCGGCAAC AACCCCTGCC GCGTAACGCC AATGGCAAGA TCGACCGCAA GGGGCTATCC ACCGACTTCA CCGACCTGTT TCAGGAGAAC GCTGCACCGT GA
|
Protein sequence | MAIPILLHET LLSSAGRAPS SPAVSYRDSS HDYDTLATAC QSVAAALREC GLLRQERVAV YLDKRPETVQ ALFGAAMAGG VFVPVNPLLK ADQVAYILRD CNVRILVTTA DRLKAIRQAL VQCPDLHTVL VVGRSDTLGE EEPPYRVHDW YDVVSTAGPA RPHRVIDSDM AAILYTSGST GQPKGVVLSH RNMVAGAQSV ASYLDNRPED HLLAALPFSF DYGLSQLTTA FLTGAQVSLL NYLLPRDVIR AVEKQGITGL AAVPPLWIQL AQLEWPDAAR KTLRYVTNSG GAMPLKTLEA LRAQLPATRF YLMYGLTEAF RSTYLPPEEV DRRPDSMGKA IPNAEIRVVR EDGSPCAPGE PGELVHRGAL VAMGYWNDPE RTAQRFRPAP GQPGGLPQKE LAVWSGDTVR MDEEGFLYFI GRRDEMIKTS GYRVSPNEVE EAVYGTGLVA EAAALGVPHP VLGHGIVLVV LPRSTGVTAE ELLNALRPRV PAFMLPAHVE LRQQPLPRNA NGKIDRKGLS TDFTDLFQEN AAP
|
| |