Gene Acid345_2763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2763 
Symbol 
ID4072385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3269935 
End bp3272958 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content59% 
IMG OID637984780 
Productmalto-oligosyltrehalose synthase 
Protein accessionYP_591838 
Protein GI94969790 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3280] Maltooligosyl trehalose synthase 
TIGRFAM ID[TIGR02401] malto-oligosyltrehalose synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.200258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGCA GCCCCACGAT TTTTGCCGAA CAGAAGAGTG CCTTGGCCGA ATGTATGGAG 
CGGATTGCCG CTGGTAAGCG GCAGCTGCGT CCAACTTCGA CCTACCGCCT CCAATTTCAT
TCCAATTTCA GGTTCACAGA CGCCGAGCAG CTCATCGGCT ATCTGCACGA ACTCGGCATC
TCGCATTGTT ATGCCTCGCC GATCTTAAAG GCCCGCGCCG GAAGTACACA CGGCTACGAC
ATCACCGATC ACAACTCGCT CAACCCGGAG ATCGGGACGG AAGAGGAGTT TCACCAGCTC
TCAACGAAGC TGAAAGAGCA CGGGATCGGA TTCATCCTCG ACGTGGTGCC CAACCACATG
GGTGTAGGCA CCGGCGAGAA CCGCTGGTGG CAGGATGTTC TCGAAAACGG CCGCGCCAGC
GAGTTCGCCG ACTACTTCGA TATTGACTGG AACCCGCTCA AGCCGGAGCT CCGCAACAAG
CTGCTGCTCC CGATCCTCGG CAACTACTAC GGCGATGAAC TGGAGGCCGC CCGGATTAAG
TTGTCGCTGC ACGACGGCCT CATCGTCTTC CTTTACTACG AACGAGTCCT GCCCGTGGAT
CCTCAGACGA TTCCCATGAT CTACGGCGCG CTCGGAGACC TCCGCCAGCG CCAGGGCCAT
CGCATGCCCG AGCTGATCGC GGTTCTCGAA GAGCTGCGCG GATATCCGCC GAACTGGACC
GAAGATCACG ACCTTGTCCT CACCCGCCAG CGCGGACTGC CAAATGTCGT AGAGCGACTC
TCGGAACTGA TTGCCGGCAG CGAGTCCGTC CGACAGGCCA CTGAGGATGC CATGGCGATC
CTCAACGGCG AGGTTGGCGA CACCCGCAGC TTCGACGGTC TCCATCGTCT GCTGGAAGCC
CAGGCCTACC GCCTGGCGTT TTGGCGTGTG AGTGGCGAGG AGATCAATTA TCGCCGCTTC
TTCGACATCA ACGACCTCGT CGCCATCCGC ATGGAAAACC CGCGCGTTTT CGCCGACACC
CATCGCCTGA TTCGCAAGCT GCTCGCAAAC GGCGACGTCA CCGGCCTGCG CCTCGACCAT
CCTGACGGCC TGTTTAATCC GCTGCAATAT TTCGTGCGCG CGCAAATGCT CTACACCGCG
AGCCAATGCA ACGGCGCCAC TCCGGAAGGC GAGCTCGCGG AAAACGGCAT CGAGCGCGAA
ATCCAGAGCG CCTTCGGACA GCGCGACTGG GGCGGTCCGA GCGCACCGCT CTATTTATTG
GTCGAGAAAA TTCTCGAACC CGGCGAGCAC CTTCCCGTCG AATGGCCGGT GGACGGCACC
GTTGGCTACG ACTTCGCCAA TCTCGTCAAC GGCGTATTGA TTGATCCCGC AGGCGAGAAG
CCGCTCACCC AGCTCTACCA TCGCGTGCTG GAGCGCACGG TCGACATCGA CGACCTGATC
TACGACAGCA AGAAGCTGAT CATGGACACC GCGTTGGCGA GCGAGATCAA CGTGCTCACC
CACATGCTCG ACGACATCTC CGGCCGCGAT CGCCGCGCCC GCGATTACAC CCGCAATGTG
CTCTCCGACG CCATCCGCGA AACCATCGCC TGCTTCCCTG TCTATCGAAC CTACATAGAT
GAGCGCGGCA ACATGAACGC GCGCGATCGT GAACAGATTG ACAAAGCCAT TGTCACCGCG
AAACGCCGCA ACGAAGGCAT GGCTGCCGGC GTCTTCGATT TTTTGCGCGA CATCCTTCTG
CTCGAAGGCA ACGACGGCGG AGAGCGTATC CACGGCTATC GCAAAATGCT CTATTTCACG
CTGAAGTTCC AGCAACTTAC CGGCCCGGTG ATGGCCAAGG GCCTGGAAGA CACCACGTTC
TACGTGTACA ACCGATTTAT ATCGTTAAAC GAGGTAGGCG GCTCGCCCGA AACTTTCGGA
ACTTCCCTGC TGCAATTCCA TCGCGCCAAT GCTGCTCGCG CGGGCACTTG GGCGGCGTCC
ATGCTCTCGA CTTCCACGCA CGACACCAAG CGTAGCGAAG ATGTCCGTGC GCGCTTGAAC
GTGCTCTCAG AGATGCCGCG CGAGTGGTCC ACCCACGTGA TGCGCTTCCG TCGCGTCAAC
AAGCCGAAGA AGCTGCAACT CAGCGATGGC CGTGTTCCAC CCGATGCCAA CGAAGAATAC
TTGCTCTATC AAACGCTGCT TGGCGCGTGG CCGCTCGAAG GTATCGGCGA CCCGGATTGC
CGCGAGAGTT TCGTTCATCG CATCCAGGAA TACATGACCA AGGCGATCCA CGAAGCCAAG
GTCAACCTGA GTTGGGTAAA CCAGAATCCG GATTACACCG AAGCTCTTCA GGAATTTGTC
GCGAGCATTC TCGAGCCCGG CAGTGTGCGG CGTCCGAACC AGTTCCTCAG TTACATGGAC
CAGCTGCTCC CGCAAGTTCA GTTCTTCGGC GCCATCAACT CGCTCTCACA AACGCTGATC
AAGCTGACCG CGCCGGGAGT TCCCGACATT TATCAGGGTC AGGAAATGTG GGACTTCAGC
CTCGTGGATC CGGACAATCG CCGCCCGGTT GACTTTGAAG CACGCAAACG CGCGGTAAGT
GATCTAAACC ATTTCGCGGA CGCAGAATCT GAACTCTGCC GTACTCTTCT CGAAAACTGG
CGCGACGGCC ACATTAAACT CTGGACCGTG ATGCAATCGT TGCGTCTGCG CCAGCAGGAG
CGCGAACTGT TCATGGAAGG CAGCTACACG CCGCTCTCCG CAAGCTATCT GCACGAGAAG
CACGTCATCG CTTATGCGCG CACTCTCAAC GGACGGCACG CGATCGCCGT GGCTCCGCGA
CTGAGCTGTA CGTTGATGAA GGGTATCGTG CAGCCGCCAA TCGGGCGCGC CTGGGACCGC
GGCTATCTCG AAATTCCGCC GGAGATCACC GGCACATTCC GCAATGTCTT CACCGGTGAA
ACGGTGAGCA TCGGCCGCGA ACAGCGCTTG TTATGCAGCG AAATCTTCCG GTCGTTCCCC
GTTGCCCTGC TGGTTTCCGC CTGA
 
Protein sequence
MRSSPTIFAE QKSALAECME RIAAGKRQLR PTSTYRLQFH SNFRFTDAEQ LIGYLHELGI 
SHCYASPILK ARAGSTHGYD ITDHNSLNPE IGTEEEFHQL STKLKEHGIG FILDVVPNHM
GVGTGENRWW QDVLENGRAS EFADYFDIDW NPLKPELRNK LLLPILGNYY GDELEAARIK
LSLHDGLIVF LYYERVLPVD PQTIPMIYGA LGDLRQRQGH RMPELIAVLE ELRGYPPNWT
EDHDLVLTRQ RGLPNVVERL SELIAGSESV RQATEDAMAI LNGEVGDTRS FDGLHRLLEA
QAYRLAFWRV SGEEINYRRF FDINDLVAIR MENPRVFADT HRLIRKLLAN GDVTGLRLDH
PDGLFNPLQY FVRAQMLYTA SQCNGATPEG ELAENGIERE IQSAFGQRDW GGPSAPLYLL
VEKILEPGEH LPVEWPVDGT VGYDFANLVN GVLIDPAGEK PLTQLYHRVL ERTVDIDDLI
YDSKKLIMDT ALASEINVLT HMLDDISGRD RRARDYTRNV LSDAIRETIA CFPVYRTYID
ERGNMNARDR EQIDKAIVTA KRRNEGMAAG VFDFLRDILL LEGNDGGERI HGYRKMLYFT
LKFQQLTGPV MAKGLEDTTF YVYNRFISLN EVGGSPETFG TSLLQFHRAN AARAGTWAAS
MLSTSTHDTK RSEDVRARLN VLSEMPREWS THVMRFRRVN KPKKLQLSDG RVPPDANEEY
LLYQTLLGAW PLEGIGDPDC RESFVHRIQE YMTKAIHEAK VNLSWVNQNP DYTEALQEFV
ASILEPGSVR RPNQFLSYMD QLLPQVQFFG AINSLSQTLI KLTAPGVPDI YQGQEMWDFS
LVDPDNRRPV DFEARKRAVS DLNHFADAES ELCRTLLENW RDGHIKLWTV MQSLRLRQQE
RELFMEGSYT PLSASYLHEK HVIAYARTLN GRHAIAVAPR LSCTLMKGIV QPPIGRAWDR
GYLEIPPEIT GTFRNVFTGE TVSIGREQRL LCSEIFRSFP VALLVSA