Gene Cagg_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2094 
Symbol 
ID7267601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2566073 
End bp2569033 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content57% 
IMG OID643566928 
Productmaltooligosyl trehalose synthase 
Protein accessionYP_002463417 
Protein GI219848984 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3280] Maltooligosyl trehalose synthase 
TIGRFAM ID[TIGR02401] malto-oligosyltrehalose synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATG CAGAATTGCG TATTCCTCGT GCCACGTACC GCCTACAACT GAACGCCGAT 
CTCACGTTTA CCGATGTCGC CCGTTTGGTG CCTTACTTCG TTGATCTTGG AATCGGTGAT
CTCTACTTTT CACCGATCTT GACCCCGCGA GCCGGCAGTC GTCACGGCTA CGATATTACC
GATCATTCGC AAATCAACCC TGAACTCGGT GGCGAGGCCG GCTTCACCCA GCTTGCCGAG
ACCTTACGCG CCCATGAACT CGGTCTGATC CTTGATGTCG TACCCAACCA CATGGGGATT
GGCGATCCGC GGAACGTGTG GTGGCGCGAT GTCCTGGAAA ACGGGCCAAG CTCCATCTTT
GCCCCGTACT TTGACATTGA CTGGGATCCG GTGCCGCCCG AATTGCACGG CAAAGTGCTG
TTACCGGTGC TCGGTGATCA GTACGGGGTT ATTCTCGAAC GGGGTGAACT ACGTTTGTAC
TACGACGACG ATGGCGGGTT CAGTCTGGGC TATTGGGAGC ACCGGTTTCC GTTGAATCCG
CGGAGCTACG CCGACATTCT GACGCAACGG CTTGATGATC TCCTGAGTAA CCTCGGTAGC
GATCACCCTG ATGCGATTGA GTTGCAGAGT ATCATCACTG CTATCGGTTA CCTGCCTTCA
TGTCACGAGG TATCACCTGA ACGGATAATT GAGCGTAACC GCGAGAAAGA AGTGATTAAA
CGTCGGATTG CGACGCTGGT CGCGAATAGT GAACCGGTAC GTCAGATGAT CGCGCAAGCA
CTGGCCGACT ACAACGGTGA TCCGTCCGAT CCAAAAAGTT TTGACTTGCT TGATACGTTA
CTTGCCCGTC AATCGTACCG GTTGGCGTTC TGGCGGGTAG CAACCGAAGA GATTAACTAC
CGTCGCTTCT TTGACATCAA CGATCTGGCC GCCATCCGCG TCGAACTTCC CGATGTCTTA
CAAGCTACGC ATGATCTGAT CATGCGCTTG TTGGCCGAGG GGATTGCGAC CGGCGCCCGC
ATCGACCACC CCGATGGCCT CTGGCAACCG GCCACCTATT TTCGTCAATT GCAAGAGAGT
TACCTACGGT ATGCCGCCGT ATTCCGCTTT GGAGGGAGCG CACCTGCCGA TCTCGATGAG
CAGATCCGGC GACGACTCGC GCAGGCTGAG CGTGGTGAAC GGCCATGGCC GCTCTACGTA
GTCGCCGAGA AGATTCTGAG CCACGGCGAA CCGTTACCCT CGGATTGGGC CGTCGCCGGA
ACAACCGGCT ACGATTTTCT GAACCAGATT GGGGGCGTCT TGATCGACCG CAGTAGCCAG
CGCGCACTCA ACCGACTGTA TAGCCAATTT GCCGGGCCGC AGCCCACTTT CGCCAATCTG
GTCAATAGCA AAAAGAAAGA GATCATGCTC GTCTCGCTCG CCAGTGAAGT CAACACGCTT
AGTCATCTGC TTGACCGGCT GGCCGAACGC ACGCGACGTT ACCGTGACTT CACCCTGAAC
AGCCTGACGT TTGCTATCCG CGAGGTGATT GCAGGGATGC CGGTGTACCG TACCTACATC
AGCTCTGATG GTGTTGTGAG CCAACGTGAT GAGCAGGCAA TCCGCGTGGC GGTGCGCGAG
GCAAAGCGAC GTAACCCACG CACAGCGGCA CAGATCTTCG ACTTTATCGA GGATACATTG
CTCTTGCGTA ATCTTGACCA CTTTGCGCCG GAGGTACGCG ATGATGTGGT ACGCTTCGTG
ATGAAGTTCC AGCAACTCAG TGGGCCGGTG ATGGCGAAGG GCGTGGAAGA TACAGCATTT
TATGTCTACA ATCGCCTGGT CGCATTGAAT GAGGTAGGTG GCCATCCCGA ACTTTTCGGC
TGCGAAGTGA GTGAGCTACA CGCCGCCGCA CAAGAACGGC AGCGCCACTG GCCGCACAGT
ATGGTCACCA CTTCTACCCA CGATACCAAG CGTAGCGAAG ATGTGCGCGC GCGGATTAGT
GTCCTTAGCG AATTGCCCGA TGAATGGCAC CGACACGTGA TCCGCTGGAG CCGACTAAAC
ACGGCTAAGC GCAGTACCAT CGAGGGTGGG ATGGCGCCGA GTCGTAATGA TGAATACTTG
CTCTATCAAA CACTGGTCGG TACGTGGGAG TCGATGGATC AGCTTGAAAC CTTTACCCAG
CGGATCGCTG CCTACATGGA GAAGGCGACC CGTGAAGCTA AGGTGAATAC GAGCTGGATC
AACCCTAACG CCGATTATGA TGCTGCCGTC CAACGTTTTG TACGAGGTAT TCTTGATCCA
CGTCGCTCGC GCCGTTTTCT CGATAGCCTC GATGCCTTCG CCCATCGGAT CGCCTTTTTT
GGACGGTGGA ATAGTTTGAC CCAGACGATT GTTCGTCTCA CCACACCGGG TGTGCCCGAT
CTTTACCAGG GATGCGAATT GTGGGATTTT AGTCTGGTTG ATCCGGATAA TCGGCGTCCG
GTCGATTTTC AGCGTCGAGT AGCGCTCTTG GCCGATCTGC GTGCCCGACA GGCGGCCTGC
GAGAAGGCTG CACTAGCCGA TGAGCTGTTG GCGTCGGCGG CAGATGGACG GATCAAGCTC
TACACGATTG CTACGGCGCT TGATCTCCGT CGCCAACGCC CCGAACTCTT CAGTGCCGGT
GAGTATCTAC CGCTGACGGC AAGTGGGCCT ACTGCCGAAC ACGTGATCGC CTTTGCGCGT
CGGCATCCGA GTGCCGGTGA AGCGATCACG GTTGCACCGC GGCTCACGGC ACGCCTGAGT
AACGGGCGTG AAGTGCCGCC GGTCGGTGCG CTGTGGGGCG AGACATGGTT GCCCTTGCCG
CAGAGTACGC CGGGTAGCCG GTATCACAAC CTTTTCACCG GCGAACGTCT CGTTGTGACC
GAGTACTCGG CAGCGCCGGG GCTGGCCCTC GCCGAGATAT TGCGGCGCTG GCCGATTGCC
CTGTTGGTGC GTGAAGATTA G
 
Protein sequence
MIDAELRIPR ATYRLQLNAD LTFTDVARLV PYFVDLGIGD LYFSPILTPR AGSRHGYDIT 
DHSQINPELG GEAGFTQLAE TLRAHELGLI LDVVPNHMGI GDPRNVWWRD VLENGPSSIF
APYFDIDWDP VPPELHGKVL LPVLGDQYGV ILERGELRLY YDDDGGFSLG YWEHRFPLNP
RSYADILTQR LDDLLSNLGS DHPDAIELQS IITAIGYLPS CHEVSPERII ERNREKEVIK
RRIATLVANS EPVRQMIAQA LADYNGDPSD PKSFDLLDTL LARQSYRLAF WRVATEEINY
RRFFDINDLA AIRVELPDVL QATHDLIMRL LAEGIATGAR IDHPDGLWQP ATYFRQLQES
YLRYAAVFRF GGSAPADLDE QIRRRLAQAE RGERPWPLYV VAEKILSHGE PLPSDWAVAG
TTGYDFLNQI GGVLIDRSSQ RALNRLYSQF AGPQPTFANL VNSKKKEIML VSLASEVNTL
SHLLDRLAER TRRYRDFTLN SLTFAIREVI AGMPVYRTYI SSDGVVSQRD EQAIRVAVRE
AKRRNPRTAA QIFDFIEDTL LLRNLDHFAP EVRDDVVRFV MKFQQLSGPV MAKGVEDTAF
YVYNRLVALN EVGGHPELFG CEVSELHAAA QERQRHWPHS MVTTSTHDTK RSEDVRARIS
VLSELPDEWH RHVIRWSRLN TAKRSTIEGG MAPSRNDEYL LYQTLVGTWE SMDQLETFTQ
RIAAYMEKAT REAKVNTSWI NPNADYDAAV QRFVRGILDP RRSRRFLDSL DAFAHRIAFF
GRWNSLTQTI VRLTTPGVPD LYQGCELWDF SLVDPDNRRP VDFQRRVALL ADLRARQAAC
EKAALADELL ASAADGRIKL YTIATALDLR RQRPELFSAG EYLPLTASGP TAEHVIAFAR
RHPSAGEAIT VAPRLTARLS NGREVPPVGA LWGETWLPLP QSTPGSRYHN LFTGERLVVT
EYSAAPGLAL AEILRRWPIA LLVRED