Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0007 |
Symbol | |
ID | 4269538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 9005 |
End bp | 10393 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638124734 |
Product | Alpha,alpha-trehalose-phosphate synthase (UDP-forming) |
Protein accession | YP_740856 |
Protein GI | 114319173 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0380] Trehalose-6-phosphate synthase |
TIGRFAM ID | [TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.554922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTGA TCACCATATC GAACCGGGTT GCCCTGCCCA GCCAGCTCCA GGGGGCCCAA GGCGGACTGG CGGTGGGTCT GCGCAGTGCA CTGGAGGATG GCGGTGGGCT CTGGTTCGGT TGGGACGGTC GCCACGAACC CCGGTTACCC GATCCCCGCC CACTGGCGGT GCAGAACGCC AACGGCGTGC GCTACGCCAC CCTGCGCCTG ACTCGTGCCG AGTACGAGCG GTTCTATCTC GGCTTCTCCA ACCAGGTGCT CTGGCCCCTG TTCCATTACC GCTTGCCCTA TGTCCACTGC CAACGGGCGG ATCGGGAAGG GTACTGGGCC GTCAACCGCC TGTTCGCGGA TCGGCTGCCC GATCTACCGG ACGCCGAGTC GCCCATCTGG ATCCACGATT ACCACTTCAT TCCCCTGGGC CAGATGTTGC GGGAGCGGGG GATCACTAAC CCCCTGGGCT TTTTCCTCCA CACCCCCTTC CCGGCTTGGG ATGTCTACCG CGCATTGCCG GATCACCAGG CCCTGCTGGA AGCCCTCTGC GCCCATGACC TGGTGGGCTT CCAAACCGAC CTGGATCGTG ACAACTTCCT GGACTGTCTG CGCCACACCG ACGCCATCGA GGTGATCGAC ACGGCCCGGG CGCGCTACCG CGGCCGGGTG GTCCACATCG GCGTCTTCCC GATCGGTATC GACGTGGACT CTGTGGCCGC CCACGCCCGG CACGGCTTCT ACTCGCAGCA GGGACGCCGG TTGCAGGCCA GCCTTGGGGA CAGACGCCTG ATCATCGGTG TTGACCGCCT GGATTACAGC AAGGGGCTGG ATATCCGCTT CGAAGCCTAT CGGCGTATGC TGGAGCGCCG CCGGGAGCAC CGGGCGCGGG TCGTTTACCT GCAGATTGCG CCGATATCGC GGGGTGAGGT GCCGGAGTAC GACGAGATTC GCCGCGAACT GGAGTACCAG GCGGGTCACC TCAACTGCCA GTTCGCCGAG TACGACTGGG CCCCGCTGCG GTACCTGAAC CGCGGCTTCC ACCGGGCGAA CGTGCTCGGG TTTCTGTCCC GGGCCCATGT CGGGCTGGTC ACCCCTATGC GCGACGGTAT GAACCTCGTG GCCAAGGAGT TCGTGGCCGC CCAGGATCCG GACGACCCGG GTGTGTTGGT CCTCTCGCAT TTGGCCGGGG CGGCGCGCGA GCTGGACGCG GCGGTTCTGG TCAACCCCTA CGATCCCGAT GAGGTGGCGG AGCGGCTTCA CCAGGCGCTG ACCATGCCGT TAGCGGAGCG GCGTGAGCGT TGGACCCGCA TGATCCGGTG CCTGCGCGAG CAGGACATTA CCTGTTGGCG TGACGACTAC CTGCAGGCCC TGCAGACGGC GGCCGGTGCG TCCGGGTGA
|
Protein sequence | MSLITISNRV ALPSQLQGAQ GGLAVGLRSA LEDGGGLWFG WDGRHEPRLP DPRPLAVQNA NGVRYATLRL TRAEYERFYL GFSNQVLWPL FHYRLPYVHC QRADREGYWA VNRLFADRLP DLPDAESPIW IHDYHFIPLG QMLRERGITN PLGFFLHTPF PAWDVYRALP DHQALLEALC AHDLVGFQTD LDRDNFLDCL RHTDAIEVID TARARYRGRV VHIGVFPIGI DVDSVAAHAR HGFYSQQGRR LQASLGDRRL IIGVDRLDYS KGLDIRFEAY RRMLERRREH RARVVYLQIA PISRGEVPEY DEIRRELEYQ AGHLNCQFAE YDWAPLRYLN RGFHRANVLG FLSRAHVGLV TPMRDGMNLV AKEFVAAQDP DDPGVLVLSH LAGAARELDA AVLVNPYDPD EVAERLHQAL TMPLAERRER WTRMIRCLRE QDITCWRDDY LQALQTAAGA SG
|
| |