Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1485 |
Symbol | |
ID | 4071655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1799167 |
End bp | 1802493 |
Gene Length | 3327 bp |
Protein Length | 1108 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983494 |
Product | trehalose synthase-like |
Protein accession | YP_590561 |
Protein GI | 94968513 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTGA TTTCGACACA GACTTGGTTC AAAGACGCGA TCATCTACGA AGTGCACGTT CGTGCTTTCT ACGACAGCGT GACTGACGGC ATCGGTGACT TCGGCGGTAT CACCCAGAAA CTCGATTACC TCGAAGACCT CGGCGTCACC GCCGTATGGC TCCTGCCCTT TTATCCGTCG CCGCTCAAGG ACGACGGCTA CGACATCGCC GACTACAACA ACGTCCATCC GTCTTACGGA TCGCTACGCG AGTTCCAGCG CTTTCTGCGA GAAGCCCATC GTCGAGGAAT CCGCGTCATC ACCGAGTTGG TGCTGAACCA CACCTCCGAT CAGCACATCT GGTTCCAGCG CTCCCGCCGC GCCGAACCCG GTAGCCGCTG GCGCAACTTC TACGTCTGGA GCGACACCCC CGATCGCTAC CAGGACGCGC GGATCATCTT CAAAGACTTC GAGACCTCCA ACTGGACATG GGACCCGATC GCCAAGGCCT ACTTTTGGCA CCGCTTCTAT TCCCACCAGC CTGACTTGAA CTGGGAAAAC CCTGAAGTTC GCGAGGCTAT GTTCGATGCC ATGGACTTCT GGTTCGACAT GGGCGTGGAC GGCATGCGCC TTGACGCCGT TCCGTACCTT TACGAACGCG AAGGCACCAA CTGCGAAAAT CTCATCGAGA CCCACGGTGC CCTGCGCGAA CTGCGCAAGC ACCTCGACGA AAAATACAAA GACAAAATGC TCCTCGCGGA AGCGAATCAG TGGCCGGAAG ACGCTGTTGC CTATTTCGGT AAGGGCGATG AGTGTCACAT GGCCTTCCAC TTTCCGCTGA TGCCGCGACT CTTTATGTCG CTGCGAATGG AAGACCGTTA CCCGGTCACC GATATTCTTC GCCTCACGCC GCCTATCCCC GAGACGTGCC AGTGGGCGCT CTTCCTGCGC AATCACGACG AGCTCACCCT TGAGATGGTC ACCGACGAAG AGCGCGATTA TATGTACCGC ACCTACGCGC ACGACCGCAC AGCGCGCATC AATCTCGGAA TTCGCCGCCG CCTCGCGCCG CTGCTCGAAA ACGATCGCCG CAAGATCGAG CTCATGAACG CGCTGCTCTT TTCGCTGCCG GGAACACCCG TTGTTTATTA CGGCGACGAG ATTGGCATGG GCGACAACAT CTACCTCGGC GACCGCAACG GTGTTCGCAC GCCGATGCAG TGGAGTGCCG ATCGAAACGC CGGCTTTTCA AAAGCTAACC CGCAAAAGCT CTACCTTCCC GTCAACATCG ACCCCGAATA TCACTACGAG GCGGTCAACG TCGAGAGTCA GCAGAACAAT CCTCACTCGC TGCTCTGGTG GATGAAACGC GTCATCGCGC AGCGCACGCA ATTCAAGGCC TTCGGCCGTG GCACGCTCGA ATTTCTTTAT CCCAGCAATC GCAAGGTCGT CGCCTACATC CGCCAGTACG AAGACGAAAC CATCCTCGTG GTCGCCAACC TCTCGCGTTT CACTCAGTGT GCCGAACTCG ATCTCAGCCG CTTTAACGGT CTGGCGCCAG TCGAAATCTT TGGCCGCGCC CGGTTCCCGA ACATCACTGA ACAGCCTTAC TTTTTCTCGC TCGGTCCACA CGCGTATTAC TGGTTCCACT TGCAGCCGCG CGAGGTCACG CACGAGTCGC TGACCACCAA CGCCGCCGCG CCGTCGGTTC CGACAATACT GGTAGAGTCC ACCGCAGACG TCTTTGCGCC CGCCACACGC GATGCCTTTT TCCGCCTTCT CCCAGCCATG CTGGTCAGCC GCCCCTGGTT CCAGGGGAAG TCGAAAACCA TCCGCAGCCT CGATCTCGGC GACGCCATAC CGCTTCCGCA GACCGGCGCC TACGTCGTGC TACTCAATGT TGATTACGCC GACGGCGACC CAGAAACCTA TCTCACGCCT CTCTCTATCG CGAGTGGAGA AAAGGCCGAT GCCATCCTCC GCGATCGTCC CGACGCGGTC CTCGCCAAGC TTGACGATGG CAAACGCCAG GTCCTGCTCT ATGCCGGTGT TTATGACCGC GAATTTGCCG ACTCGCTCCT CCGGGCCATC GTCAAACGCA AGCGCTTTAA AGGCGAGATT GGCGAGCTCG TGGCCGGTCA CACCCGCTCC TTCCGTAAGG CTTGGGAGAA CCAGCGTGCC GGCTTGGAAC CGAATACTCA GCCCGGTGAA CGTAATACCG TAACAATCAC TTACGGAGAG AACTTTAATC TAAAGCTTTA CCGTAAGCTG GATGCTGGCC CGAACCCCGA TCGTGAGATG GAAGAGTTCC TGACCGAGGA AACCAGCTTT ACTCAAATAC CCCGAGCGCT CGGCTGGCTG GAGTACCGTC GTGGCGAAGA GGAGAGCCTC GAGCAGACCA CCATCGGACT GCTCGTCGGT TACGCTCGCA ACGCTACCAA CGGTTGGACC TACGCCCTCG ACACTCTCGG CATTTTCTTC GAGCGCGCTC TCGCCATTCC GCAAAACGAC CCGCGTCTAA AGGACCTTAC CGACGGTTCT GTCCTTGCCA TGGCCAACCA GCCGTTGCCG CCGATCATGG GGGAACTCCT CGGCAGCCTT GCCGACAACA TGCGTTTGCT TGGCCAACGC ACCGCCGACT TGCACGTCGC GCTCGCCAGT CGACCCGACA TCCCGACCTT CGCTCCCGAG CCGTTCACCG AGTTCTATCG CCACAGCCTG TATCACGGGA TGCTCGGTCA ATCGAGCCGC GCCATGGACA CCCTGCGGGC TCGACTCCGT TCGCTTCCCA CTGCTTCGCA GGACGACGCC CAAGCTCTGA TTAATCGCGA AAATGAACTG CGCGCCAGGC TCCTGCGCCT CCGTGACACT CGAATTTCCG GCACTCGCAT CCGTCACCAC GCCGATTACC AACTCTCGAA TATTCAATAC ACTGGCAGCG ATTTCCTCGT GTCTAACTTT GAGGGAAATC CCGAACGTCC GATCGGCGAG CGGCGCATCA AGCGTTCCCC GTTGCGAGAC ATCGCAAGCA TGGTGCGTTC CTTCCATTAC GTCTCGCATG CCGTACTCTT CGACCAGGTA CCCGGGATCG TGCTGAGCCG CGACGCCTAC CCGCAATTGG AGCGCTGGGC AACCGCGTGG TACCAGTGGG TCAGCGCGTT GTTCTTGAAG GGCTACCTCG AACGCGCCGG TACTGCCGCC TTTCTGCCGC GCTCGCAGGA GGAACGCGCA GTGCTGCTTG AGTCCTATAC GTTGGAGAAG GCGCTGATCG AAATCGAATA CGAACTCACG CACCGGCCGA ACTGGGTCCG CATTCCGGTG CATGGCATTC TCGAACAGCT CCACTGA
|
Protein sequence | MDLISTQTWF KDAIIYEVHV RAFYDSVTDG IGDFGGITQK LDYLEDLGVT AVWLLPFYPS PLKDDGYDIA DYNNVHPSYG SLREFQRFLR EAHRRGIRVI TELVLNHTSD QHIWFQRSRR AEPGSRWRNF YVWSDTPDRY QDARIIFKDF ETSNWTWDPI AKAYFWHRFY SHQPDLNWEN PEVREAMFDA MDFWFDMGVD GMRLDAVPYL YEREGTNCEN LIETHGALRE LRKHLDEKYK DKMLLAEANQ WPEDAVAYFG KGDECHMAFH FPLMPRLFMS LRMEDRYPVT DILRLTPPIP ETCQWALFLR NHDELTLEMV TDEERDYMYR TYAHDRTARI NLGIRRRLAP LLENDRRKIE LMNALLFSLP GTPVVYYGDE IGMGDNIYLG DRNGVRTPMQ WSADRNAGFS KANPQKLYLP VNIDPEYHYE AVNVESQQNN PHSLLWWMKR VIAQRTQFKA FGRGTLEFLY PSNRKVVAYI RQYEDETILV VANLSRFTQC AELDLSRFNG LAPVEIFGRA RFPNITEQPY FFSLGPHAYY WFHLQPREVT HESLTTNAAA PSVPTILVES TADVFAPATR DAFFRLLPAM LVSRPWFQGK SKTIRSLDLG DAIPLPQTGA YVVLLNVDYA DGDPETYLTP LSIASGEKAD AILRDRPDAV LAKLDDGKRQ VLLYAGVYDR EFADSLLRAI VKRKRFKGEI GELVAGHTRS FRKAWENQRA GLEPNTQPGE RNTVTITYGE NFNLKLYRKL DAGPNPDREM EEFLTEETSF TQIPRALGWL EYRRGEEESL EQTTIGLLVG YARNATNGWT YALDTLGIFF ERALAIPQND PRLKDLTDGS VLAMANQPLP PIMGELLGSL ADNMRLLGQR TADLHVALAS RPDIPTFAPE PFTEFYRHSL YHGMLGQSSR AMDTLRARLR SLPTASQDDA QALINRENEL RARLLRLRDT RISGTRIRHH ADYQLSNIQY TGSDFLVSNF EGNPERPIGE RRIKRSPLRD IASMVRSFHY VSHAVLFDQV PGIVLSRDAY PQLERWATAW YQWVSALFLK GYLERAGTAA FLPRSQEERA VLLESYTLEK ALIEIEYELT HRPNWVRIPV HGILEQLH
|
| |