Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2094 |
Symbol | |
ID | 7267601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2566073 |
End bp | 2569033 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643566928 |
Product | maltooligosyl trehalose synthase |
Protein accession | YP_002463417 |
Protein GI | 219848984 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGATG CAGAATTGCG TATTCCTCGT GCCACGTACC GCCTACAACT GAACGCCGAT CTCACGTTTA CCGATGTCGC CCGTTTGGTG CCTTACTTCG TTGATCTTGG AATCGGTGAT CTCTACTTTT CACCGATCTT GACCCCGCGA GCCGGCAGTC GTCACGGCTA CGATATTACC GATCATTCGC AAATCAACCC TGAACTCGGT GGCGAGGCCG GCTTCACCCA GCTTGCCGAG ACCTTACGCG CCCATGAACT CGGTCTGATC CTTGATGTCG TACCCAACCA CATGGGGATT GGCGATCCGC GGAACGTGTG GTGGCGCGAT GTCCTGGAAA ACGGGCCAAG CTCCATCTTT GCCCCGTACT TTGACATTGA CTGGGATCCG GTGCCGCCCG AATTGCACGG CAAAGTGCTG TTACCGGTGC TCGGTGATCA GTACGGGGTT ATTCTCGAAC GGGGTGAACT ACGTTTGTAC TACGACGACG ATGGCGGGTT CAGTCTGGGC TATTGGGAGC ACCGGTTTCC GTTGAATCCG CGGAGCTACG CCGACATTCT GACGCAACGG CTTGATGATC TCCTGAGTAA CCTCGGTAGC GATCACCCTG ATGCGATTGA GTTGCAGAGT ATCATCACTG CTATCGGTTA CCTGCCTTCA TGTCACGAGG TATCACCTGA ACGGATAATT GAGCGTAACC GCGAGAAAGA AGTGATTAAA CGTCGGATTG CGACGCTGGT CGCGAATAGT GAACCGGTAC GTCAGATGAT CGCGCAAGCA CTGGCCGACT ACAACGGTGA TCCGTCCGAT CCAAAAAGTT TTGACTTGCT TGATACGTTA CTTGCCCGTC AATCGTACCG GTTGGCGTTC TGGCGGGTAG CAACCGAAGA GATTAACTAC CGTCGCTTCT TTGACATCAA CGATCTGGCC GCCATCCGCG TCGAACTTCC CGATGTCTTA CAAGCTACGC ATGATCTGAT CATGCGCTTG TTGGCCGAGG GGATTGCGAC CGGCGCCCGC ATCGACCACC CCGATGGCCT CTGGCAACCG GCCACCTATT TTCGTCAATT GCAAGAGAGT TACCTACGGT ATGCCGCCGT ATTCCGCTTT GGAGGGAGCG CACCTGCCGA TCTCGATGAG CAGATCCGGC GACGACTCGC GCAGGCTGAG CGTGGTGAAC GGCCATGGCC GCTCTACGTA GTCGCCGAGA AGATTCTGAG CCACGGCGAA CCGTTACCCT CGGATTGGGC CGTCGCCGGA ACAACCGGCT ACGATTTTCT GAACCAGATT GGGGGCGTCT TGATCGACCG CAGTAGCCAG CGCGCACTCA ACCGACTGTA TAGCCAATTT GCCGGGCCGC AGCCCACTTT CGCCAATCTG GTCAATAGCA AAAAGAAAGA GATCATGCTC GTCTCGCTCG CCAGTGAAGT CAACACGCTT AGTCATCTGC TTGACCGGCT GGCCGAACGC ACGCGACGTT ACCGTGACTT CACCCTGAAC AGCCTGACGT TTGCTATCCG CGAGGTGATT GCAGGGATGC CGGTGTACCG TACCTACATC AGCTCTGATG GTGTTGTGAG CCAACGTGAT GAGCAGGCAA TCCGCGTGGC GGTGCGCGAG GCAAAGCGAC GTAACCCACG CACAGCGGCA CAGATCTTCG ACTTTATCGA GGATACATTG CTCTTGCGTA ATCTTGACCA CTTTGCGCCG GAGGTACGCG ATGATGTGGT ACGCTTCGTG ATGAAGTTCC AGCAACTCAG TGGGCCGGTG ATGGCGAAGG GCGTGGAAGA TACAGCATTT TATGTCTACA ATCGCCTGGT CGCATTGAAT GAGGTAGGTG GCCATCCCGA ACTTTTCGGC TGCGAAGTGA GTGAGCTACA CGCCGCCGCA CAAGAACGGC AGCGCCACTG GCCGCACAGT ATGGTCACCA CTTCTACCCA CGATACCAAG CGTAGCGAAG ATGTGCGCGC GCGGATTAGT GTCCTTAGCG AATTGCCCGA TGAATGGCAC CGACACGTGA TCCGCTGGAG CCGACTAAAC ACGGCTAAGC GCAGTACCAT CGAGGGTGGG ATGGCGCCGA GTCGTAATGA TGAATACTTG CTCTATCAAA CACTGGTCGG TACGTGGGAG TCGATGGATC AGCTTGAAAC CTTTACCCAG CGGATCGCTG CCTACATGGA GAAGGCGACC CGTGAAGCTA AGGTGAATAC GAGCTGGATC AACCCTAACG CCGATTATGA TGCTGCCGTC CAACGTTTTG TACGAGGTAT TCTTGATCCA CGTCGCTCGC GCCGTTTTCT CGATAGCCTC GATGCCTTCG CCCATCGGAT CGCCTTTTTT GGACGGTGGA ATAGTTTGAC CCAGACGATT GTTCGTCTCA CCACACCGGG TGTGCCCGAT CTTTACCAGG GATGCGAATT GTGGGATTTT AGTCTGGTTG ATCCGGATAA TCGGCGTCCG GTCGATTTTC AGCGTCGAGT AGCGCTCTTG GCCGATCTGC GTGCCCGACA GGCGGCCTGC GAGAAGGCTG CACTAGCCGA TGAGCTGTTG GCGTCGGCGG CAGATGGACG GATCAAGCTC TACACGATTG CTACGGCGCT TGATCTCCGT CGCCAACGCC CCGAACTCTT CAGTGCCGGT GAGTATCTAC CGCTGACGGC AAGTGGGCCT ACTGCCGAAC ACGTGATCGC CTTTGCGCGT CGGCATCCGA GTGCCGGTGA AGCGATCACG GTTGCACCGC GGCTCACGGC ACGCCTGAGT AACGGGCGTG AAGTGCCGCC GGTCGGTGCG CTGTGGGGCG AGACATGGTT GCCCTTGCCG CAGAGTACGC CGGGTAGCCG GTATCACAAC CTTTTCACCG GCGAACGTCT CGTTGTGACC GAGTACTCGG CAGCGCCGGG GCTGGCCCTC GCCGAGATAT TGCGGCGCTG GCCGATTGCC CTGTTGGTGC GTGAAGATTA G
|
Protein sequence | MIDAELRIPR ATYRLQLNAD LTFTDVARLV PYFVDLGIGD LYFSPILTPR AGSRHGYDIT DHSQINPELG GEAGFTQLAE TLRAHELGLI LDVVPNHMGI GDPRNVWWRD VLENGPSSIF APYFDIDWDP VPPELHGKVL LPVLGDQYGV ILERGELRLY YDDDGGFSLG YWEHRFPLNP RSYADILTQR LDDLLSNLGS DHPDAIELQS IITAIGYLPS CHEVSPERII ERNREKEVIK RRIATLVANS EPVRQMIAQA LADYNGDPSD PKSFDLLDTL LARQSYRLAF WRVATEEINY RRFFDINDLA AIRVELPDVL QATHDLIMRL LAEGIATGAR IDHPDGLWQP ATYFRQLQES YLRYAAVFRF GGSAPADLDE QIRRRLAQAE RGERPWPLYV VAEKILSHGE PLPSDWAVAG TTGYDFLNQI GGVLIDRSSQ RALNRLYSQF AGPQPTFANL VNSKKKEIML VSLASEVNTL SHLLDRLAER TRRYRDFTLN SLTFAIREVI AGMPVYRTYI SSDGVVSQRD EQAIRVAVRE AKRRNPRTAA QIFDFIEDTL LLRNLDHFAP EVRDDVVRFV MKFQQLSGPV MAKGVEDTAF YVYNRLVALN EVGGHPELFG CEVSELHAAA QERQRHWPHS MVTTSTHDTK RSEDVRARIS VLSELPDEWH RHVIRWSRLN TAKRSTIEGG MAPSRNDEYL LYQTLVGTWE SMDQLETFTQ RIAAYMEKAT REAKVNTSWI NPNADYDAAV QRFVRGILDP RRSRRFLDSL DAFAHRIAFF GRWNSLTQTI VRLTTPGVPD LYQGCELWDF SLVDPDNRRP VDFQRRVALL ADLRARQAAC EKAALADELL ASAADGRIKL YTIATALDLR RQRPELFSAG EYLPLTASGP TAEHVIAFAR RHPSAGEAIT VAPRLTARLS NGREVPPVGA LWGETWLPLP QSTPGSRYHN LFTGERLVVT EYSAAPGLAL AEILRRWPIA LLVRED
|
| |