Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2763 |
Symbol | |
ID | 4072385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3269935 |
End bp | 3272958 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984780 |
Product | malto-oligosyltrehalose synthase |
Protein accession | YP_591838 |
Protein GI | 94969790 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.200258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGCA GCCCCACGAT TTTTGCCGAA CAGAAGAGTG CCTTGGCCGA ATGTATGGAG CGGATTGCCG CTGGTAAGCG GCAGCTGCGT CCAACTTCGA CCTACCGCCT CCAATTTCAT TCCAATTTCA GGTTCACAGA CGCCGAGCAG CTCATCGGCT ATCTGCACGA ACTCGGCATC TCGCATTGTT ATGCCTCGCC GATCTTAAAG GCCCGCGCCG GAAGTACACA CGGCTACGAC ATCACCGATC ACAACTCGCT CAACCCGGAG ATCGGGACGG AAGAGGAGTT TCACCAGCTC TCAACGAAGC TGAAAGAGCA CGGGATCGGA TTCATCCTCG ACGTGGTGCC CAACCACATG GGTGTAGGCA CCGGCGAGAA CCGCTGGTGG CAGGATGTTC TCGAAAACGG CCGCGCCAGC GAGTTCGCCG ACTACTTCGA TATTGACTGG AACCCGCTCA AGCCGGAGCT CCGCAACAAG CTGCTGCTCC CGATCCTCGG CAACTACTAC GGCGATGAAC TGGAGGCCGC CCGGATTAAG TTGTCGCTGC ACGACGGCCT CATCGTCTTC CTTTACTACG AACGAGTCCT GCCCGTGGAT CCTCAGACGA TTCCCATGAT CTACGGCGCG CTCGGAGACC TCCGCCAGCG CCAGGGCCAT CGCATGCCCG AGCTGATCGC GGTTCTCGAA GAGCTGCGCG GATATCCGCC GAACTGGACC GAAGATCACG ACCTTGTCCT CACCCGCCAG CGCGGACTGC CAAATGTCGT AGAGCGACTC TCGGAACTGA TTGCCGGCAG CGAGTCCGTC CGACAGGCCA CTGAGGATGC CATGGCGATC CTCAACGGCG AGGTTGGCGA CACCCGCAGC TTCGACGGTC TCCATCGTCT GCTGGAAGCC CAGGCCTACC GCCTGGCGTT TTGGCGTGTG AGTGGCGAGG AGATCAATTA TCGCCGCTTC TTCGACATCA ACGACCTCGT CGCCATCCGC ATGGAAAACC CGCGCGTTTT CGCCGACACC CATCGCCTGA TTCGCAAGCT GCTCGCAAAC GGCGACGTCA CCGGCCTGCG CCTCGACCAT CCTGACGGCC TGTTTAATCC GCTGCAATAT TTCGTGCGCG CGCAAATGCT CTACACCGCG AGCCAATGCA ACGGCGCCAC TCCGGAAGGC GAGCTCGCGG AAAACGGCAT CGAGCGCGAA ATCCAGAGCG CCTTCGGACA GCGCGACTGG GGCGGTCCGA GCGCACCGCT CTATTTATTG GTCGAGAAAA TTCTCGAACC CGGCGAGCAC CTTCCCGTCG AATGGCCGGT GGACGGCACC GTTGGCTACG ACTTCGCCAA TCTCGTCAAC GGCGTATTGA TTGATCCCGC AGGCGAGAAG CCGCTCACCC AGCTCTACCA TCGCGTGCTG GAGCGCACGG TCGACATCGA CGACCTGATC TACGACAGCA AGAAGCTGAT CATGGACACC GCGTTGGCGA GCGAGATCAA CGTGCTCACC CACATGCTCG ACGACATCTC CGGCCGCGAT CGCCGCGCCC GCGATTACAC CCGCAATGTG CTCTCCGACG CCATCCGCGA AACCATCGCC TGCTTCCCTG TCTATCGAAC CTACATAGAT GAGCGCGGCA ACATGAACGC GCGCGATCGT GAACAGATTG ACAAAGCCAT TGTCACCGCG AAACGCCGCA ACGAAGGCAT GGCTGCCGGC GTCTTCGATT TTTTGCGCGA CATCCTTCTG CTCGAAGGCA ACGACGGCGG AGAGCGTATC CACGGCTATC GCAAAATGCT CTATTTCACG CTGAAGTTCC AGCAACTTAC CGGCCCGGTG ATGGCCAAGG GCCTGGAAGA CACCACGTTC TACGTGTACA ACCGATTTAT ATCGTTAAAC GAGGTAGGCG GCTCGCCCGA AACTTTCGGA ACTTCCCTGC TGCAATTCCA TCGCGCCAAT GCTGCTCGCG CGGGCACTTG GGCGGCGTCC ATGCTCTCGA CTTCCACGCA CGACACCAAG CGTAGCGAAG ATGTCCGTGC GCGCTTGAAC GTGCTCTCAG AGATGCCGCG CGAGTGGTCC ACCCACGTGA TGCGCTTCCG TCGCGTCAAC AAGCCGAAGA AGCTGCAACT CAGCGATGGC CGTGTTCCAC CCGATGCCAA CGAAGAATAC TTGCTCTATC AAACGCTGCT TGGCGCGTGG CCGCTCGAAG GTATCGGCGA CCCGGATTGC CGCGAGAGTT TCGTTCATCG CATCCAGGAA TACATGACCA AGGCGATCCA CGAAGCCAAG GTCAACCTGA GTTGGGTAAA CCAGAATCCG GATTACACCG AAGCTCTTCA GGAATTTGTC GCGAGCATTC TCGAGCCCGG CAGTGTGCGG CGTCCGAACC AGTTCCTCAG TTACATGGAC CAGCTGCTCC CGCAAGTTCA GTTCTTCGGC GCCATCAACT CGCTCTCACA AACGCTGATC AAGCTGACCG CGCCGGGAGT TCCCGACATT TATCAGGGTC AGGAAATGTG GGACTTCAGC CTCGTGGATC CGGACAATCG CCGCCCGGTT GACTTTGAAG CACGCAAACG CGCGGTAAGT GATCTAAACC ATTTCGCGGA CGCAGAATCT GAACTCTGCC GTACTCTTCT CGAAAACTGG CGCGACGGCC ACATTAAACT CTGGACCGTG ATGCAATCGT TGCGTCTGCG CCAGCAGGAG CGCGAACTGT TCATGGAAGG CAGCTACACG CCGCTCTCCG CAAGCTATCT GCACGAGAAG CACGTCATCG CTTATGCGCG CACTCTCAAC GGACGGCACG CGATCGCCGT GGCTCCGCGA CTGAGCTGTA CGTTGATGAA GGGTATCGTG CAGCCGCCAA TCGGGCGCGC CTGGGACCGC GGCTATCTCG AAATTCCGCC GGAGATCACC GGCACATTCC GCAATGTCTT CACCGGTGAA ACGGTGAGCA TCGGCCGCGA ACAGCGCTTG TTATGCAGCG AAATCTTCCG GTCGTTCCCC GTTGCCCTGC TGGTTTCCGC CTGA
|
Protein sequence | MRSSPTIFAE QKSALAECME RIAAGKRQLR PTSTYRLQFH SNFRFTDAEQ LIGYLHELGI SHCYASPILK ARAGSTHGYD ITDHNSLNPE IGTEEEFHQL STKLKEHGIG FILDVVPNHM GVGTGENRWW QDVLENGRAS EFADYFDIDW NPLKPELRNK LLLPILGNYY GDELEAARIK LSLHDGLIVF LYYERVLPVD PQTIPMIYGA LGDLRQRQGH RMPELIAVLE ELRGYPPNWT EDHDLVLTRQ RGLPNVVERL SELIAGSESV RQATEDAMAI LNGEVGDTRS FDGLHRLLEA QAYRLAFWRV SGEEINYRRF FDINDLVAIR MENPRVFADT HRLIRKLLAN GDVTGLRLDH PDGLFNPLQY FVRAQMLYTA SQCNGATPEG ELAENGIERE IQSAFGQRDW GGPSAPLYLL VEKILEPGEH LPVEWPVDGT VGYDFANLVN GVLIDPAGEK PLTQLYHRVL ERTVDIDDLI YDSKKLIMDT ALASEINVLT HMLDDISGRD RRARDYTRNV LSDAIRETIA CFPVYRTYID ERGNMNARDR EQIDKAIVTA KRRNEGMAAG VFDFLRDILL LEGNDGGERI HGYRKMLYFT LKFQQLTGPV MAKGLEDTTF YVYNRFISLN EVGGSPETFG TSLLQFHRAN AARAGTWAAS MLSTSTHDTK RSEDVRARLN VLSEMPREWS THVMRFRRVN KPKKLQLSDG RVPPDANEEY LLYQTLLGAW PLEGIGDPDC RESFVHRIQE YMTKAIHEAK VNLSWVNQNP DYTEALQEFV ASILEPGSVR RPNQFLSYMD QLLPQVQFFG AINSLSQTLI KLTAPGVPDI YQGQEMWDFS LVDPDNRRPV DFEARKRAVS DLNHFADAES ELCRTLLENW RDGHIKLWTV MQSLRLRQQE RELFMEGSYT PLSASYLHEK HVIAYARTLN GRHAIAVAPR LSCTLMKGIV QPPIGRAWDR GYLEIPPEIT GTFRNVFTGE TVSIGREQRL LCSEIFRSFP VALLVSA
|
| |