Gene Acid345_1485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1485 
Symbol 
ID4071655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1799167 
End bp1802493 
Gene Length3327 bp 
Protein Length1108 aa 
Translation table11 
GC content59% 
IMG OID637983494 
Producttrehalose synthase-like 
Protein accessionYP_590561 
Protein GI94968513 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTGA TTTCGACACA GACTTGGTTC AAAGACGCGA TCATCTACGA AGTGCACGTT 
CGTGCTTTCT ACGACAGCGT GACTGACGGC ATCGGTGACT TCGGCGGTAT CACCCAGAAA
CTCGATTACC TCGAAGACCT CGGCGTCACC GCCGTATGGC TCCTGCCCTT TTATCCGTCG
CCGCTCAAGG ACGACGGCTA CGACATCGCC GACTACAACA ACGTCCATCC GTCTTACGGA
TCGCTACGCG AGTTCCAGCG CTTTCTGCGA GAAGCCCATC GTCGAGGAAT CCGCGTCATC
ACCGAGTTGG TGCTGAACCA CACCTCCGAT CAGCACATCT GGTTCCAGCG CTCCCGCCGC
GCCGAACCCG GTAGCCGCTG GCGCAACTTC TACGTCTGGA GCGACACCCC CGATCGCTAC
CAGGACGCGC GGATCATCTT CAAAGACTTC GAGACCTCCA ACTGGACATG GGACCCGATC
GCCAAGGCCT ACTTTTGGCA CCGCTTCTAT TCCCACCAGC CTGACTTGAA CTGGGAAAAC
CCTGAAGTTC GCGAGGCTAT GTTCGATGCC ATGGACTTCT GGTTCGACAT GGGCGTGGAC
GGCATGCGCC TTGACGCCGT TCCGTACCTT TACGAACGCG AAGGCACCAA CTGCGAAAAT
CTCATCGAGA CCCACGGTGC CCTGCGCGAA CTGCGCAAGC ACCTCGACGA AAAATACAAA
GACAAAATGC TCCTCGCGGA AGCGAATCAG TGGCCGGAAG ACGCTGTTGC CTATTTCGGT
AAGGGCGATG AGTGTCACAT GGCCTTCCAC TTTCCGCTGA TGCCGCGACT CTTTATGTCG
CTGCGAATGG AAGACCGTTA CCCGGTCACC GATATTCTTC GCCTCACGCC GCCTATCCCC
GAGACGTGCC AGTGGGCGCT CTTCCTGCGC AATCACGACG AGCTCACCCT TGAGATGGTC
ACCGACGAAG AGCGCGATTA TATGTACCGC ACCTACGCGC ACGACCGCAC AGCGCGCATC
AATCTCGGAA TTCGCCGCCG CCTCGCGCCG CTGCTCGAAA ACGATCGCCG CAAGATCGAG
CTCATGAACG CGCTGCTCTT TTCGCTGCCG GGAACACCCG TTGTTTATTA CGGCGACGAG
ATTGGCATGG GCGACAACAT CTACCTCGGC GACCGCAACG GTGTTCGCAC GCCGATGCAG
TGGAGTGCCG ATCGAAACGC CGGCTTTTCA AAAGCTAACC CGCAAAAGCT CTACCTTCCC
GTCAACATCG ACCCCGAATA TCACTACGAG GCGGTCAACG TCGAGAGTCA GCAGAACAAT
CCTCACTCGC TGCTCTGGTG GATGAAACGC GTCATCGCGC AGCGCACGCA ATTCAAGGCC
TTCGGCCGTG GCACGCTCGA ATTTCTTTAT CCCAGCAATC GCAAGGTCGT CGCCTACATC
CGCCAGTACG AAGACGAAAC CATCCTCGTG GTCGCCAACC TCTCGCGTTT CACTCAGTGT
GCCGAACTCG ATCTCAGCCG CTTTAACGGT CTGGCGCCAG TCGAAATCTT TGGCCGCGCC
CGGTTCCCGA ACATCACTGA ACAGCCTTAC TTTTTCTCGC TCGGTCCACA CGCGTATTAC
TGGTTCCACT TGCAGCCGCG CGAGGTCACG CACGAGTCGC TGACCACCAA CGCCGCCGCG
CCGTCGGTTC CGACAATACT GGTAGAGTCC ACCGCAGACG TCTTTGCGCC CGCCACACGC
GATGCCTTTT TCCGCCTTCT CCCAGCCATG CTGGTCAGCC GCCCCTGGTT CCAGGGGAAG
TCGAAAACCA TCCGCAGCCT CGATCTCGGC GACGCCATAC CGCTTCCGCA GACCGGCGCC
TACGTCGTGC TACTCAATGT TGATTACGCC GACGGCGACC CAGAAACCTA TCTCACGCCT
CTCTCTATCG CGAGTGGAGA AAAGGCCGAT GCCATCCTCC GCGATCGTCC CGACGCGGTC
CTCGCCAAGC TTGACGATGG CAAACGCCAG GTCCTGCTCT ATGCCGGTGT TTATGACCGC
GAATTTGCCG ACTCGCTCCT CCGGGCCATC GTCAAACGCA AGCGCTTTAA AGGCGAGATT
GGCGAGCTCG TGGCCGGTCA CACCCGCTCC TTCCGTAAGG CTTGGGAGAA CCAGCGTGCC
GGCTTGGAAC CGAATACTCA GCCCGGTGAA CGTAATACCG TAACAATCAC TTACGGAGAG
AACTTTAATC TAAAGCTTTA CCGTAAGCTG GATGCTGGCC CGAACCCCGA TCGTGAGATG
GAAGAGTTCC TGACCGAGGA AACCAGCTTT ACTCAAATAC CCCGAGCGCT CGGCTGGCTG
GAGTACCGTC GTGGCGAAGA GGAGAGCCTC GAGCAGACCA CCATCGGACT GCTCGTCGGT
TACGCTCGCA ACGCTACCAA CGGTTGGACC TACGCCCTCG ACACTCTCGG CATTTTCTTC
GAGCGCGCTC TCGCCATTCC GCAAAACGAC CCGCGTCTAA AGGACCTTAC CGACGGTTCT
GTCCTTGCCA TGGCCAACCA GCCGTTGCCG CCGATCATGG GGGAACTCCT CGGCAGCCTT
GCCGACAACA TGCGTTTGCT TGGCCAACGC ACCGCCGACT TGCACGTCGC GCTCGCCAGT
CGACCCGACA TCCCGACCTT CGCTCCCGAG CCGTTCACCG AGTTCTATCG CCACAGCCTG
TATCACGGGA TGCTCGGTCA ATCGAGCCGC GCCATGGACA CCCTGCGGGC TCGACTCCGT
TCGCTTCCCA CTGCTTCGCA GGACGACGCC CAAGCTCTGA TTAATCGCGA AAATGAACTG
CGCGCCAGGC TCCTGCGCCT CCGTGACACT CGAATTTCCG GCACTCGCAT CCGTCACCAC
GCCGATTACC AACTCTCGAA TATTCAATAC ACTGGCAGCG ATTTCCTCGT GTCTAACTTT
GAGGGAAATC CCGAACGTCC GATCGGCGAG CGGCGCATCA AGCGTTCCCC GTTGCGAGAC
ATCGCAAGCA TGGTGCGTTC CTTCCATTAC GTCTCGCATG CCGTACTCTT CGACCAGGTA
CCCGGGATCG TGCTGAGCCG CGACGCCTAC CCGCAATTGG AGCGCTGGGC AACCGCGTGG
TACCAGTGGG TCAGCGCGTT GTTCTTGAAG GGCTACCTCG AACGCGCCGG TACTGCCGCC
TTTCTGCCGC GCTCGCAGGA GGAACGCGCA GTGCTGCTTG AGTCCTATAC GTTGGAGAAG
GCGCTGATCG AAATCGAATA CGAACTCACG CACCGGCCGA ACTGGGTCCG CATTCCGGTG
CATGGCATTC TCGAACAGCT CCACTGA
 
Protein sequence
MDLISTQTWF KDAIIYEVHV RAFYDSVTDG IGDFGGITQK LDYLEDLGVT AVWLLPFYPS 
PLKDDGYDIA DYNNVHPSYG SLREFQRFLR EAHRRGIRVI TELVLNHTSD QHIWFQRSRR
AEPGSRWRNF YVWSDTPDRY QDARIIFKDF ETSNWTWDPI AKAYFWHRFY SHQPDLNWEN
PEVREAMFDA MDFWFDMGVD GMRLDAVPYL YEREGTNCEN LIETHGALRE LRKHLDEKYK
DKMLLAEANQ WPEDAVAYFG KGDECHMAFH FPLMPRLFMS LRMEDRYPVT DILRLTPPIP
ETCQWALFLR NHDELTLEMV TDEERDYMYR TYAHDRTARI NLGIRRRLAP LLENDRRKIE
LMNALLFSLP GTPVVYYGDE IGMGDNIYLG DRNGVRTPMQ WSADRNAGFS KANPQKLYLP
VNIDPEYHYE AVNVESQQNN PHSLLWWMKR VIAQRTQFKA FGRGTLEFLY PSNRKVVAYI
RQYEDETILV VANLSRFTQC AELDLSRFNG LAPVEIFGRA RFPNITEQPY FFSLGPHAYY
WFHLQPREVT HESLTTNAAA PSVPTILVES TADVFAPATR DAFFRLLPAM LVSRPWFQGK
SKTIRSLDLG DAIPLPQTGA YVVLLNVDYA DGDPETYLTP LSIASGEKAD AILRDRPDAV
LAKLDDGKRQ VLLYAGVYDR EFADSLLRAI VKRKRFKGEI GELVAGHTRS FRKAWENQRA
GLEPNTQPGE RNTVTITYGE NFNLKLYRKL DAGPNPDREM EEFLTEETSF TQIPRALGWL
EYRRGEEESL EQTTIGLLVG YARNATNGWT YALDTLGIFF ERALAIPQND PRLKDLTDGS
VLAMANQPLP PIMGELLGSL ADNMRLLGQR TADLHVALAS RPDIPTFAPE PFTEFYRHSL
YHGMLGQSSR AMDTLRARLR SLPTASQDDA QALINRENEL RARLLRLRDT RISGTRIRHH
ADYQLSNIQY TGSDFLVSNF EGNPERPIGE RRIKRSPLRD IASMVRSFHY VSHAVLFDQV
PGIVLSRDAY PQLERWATAW YQWVSALFLK GYLERAGTAA FLPRSQEERA VLLESYTLEK
ALIEIEYELT HRPNWVRIPV HGILEQLH