Gene Clim_2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2378 
Symbol 
ID6355849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2603839 
End bp2607138 
Gene Length3300 bp 
Protein Length1099 aa 
Translation table11 
GC content54% 
IMG OID642669970 
Producttrehalose synthase 
Protein accessionYP_001944380 
Protein GI189347851 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCAAC CCGAACCGCT CTGGTACAAA GACGCCATCA TTTACGAAGC ACATGTCAAA 
ACGTTTTACG ACAGTAACAA CGACGGCATC GGTGATTTTC AGGGACTTCG TCAAAAACTC
GGTTATCTGG AAAGTCTCGG TATTACCGCG ATCTGGCTGC TGCCGTTCTA TCCGTCGCCG
CTTCGTGACG ACGGGTACGA TATCGCCGAC TATATGTCCG TAAATCCCGA CTACGGGACG
CTCGAAGATT TCAGACAGTT CCTCGAGGAA GCGCATTCAC GCGGGATGAA AGTCATTACC
GAACTGGTTG TGAACCATAC CTCCGATCAG CACGCGTGGT TTCAGCGGGC TCGCAAAGCC
CCGGCAGGCT CTCCTGAACG GAATTTCTAT GTCTGGAGCG ACGATATGGA CAAATATTCC
GAGACCCGCA TTATTTTTCA GGATTTCGAA GCATCCAACT GGACCTGGGA TCCGGTCGCC
AAGCAGTATT TCTGGCATCG TTTCTATCAC CACCAGCCCG ATCTGAACTT TGAAAATCCG
GAAGTCCATC AGGCCCTTTT CGATGTACTC GATTTCTGGC TTGGTATGGG CGTTGACGGT
CTGAGGCTCG ATGCCGTCCC CTACCTCTAC GAGGCAGAGG GCACCAATTG CGAAAATCTT
CCCCAGACCT ACGATTATCT CAGGAAACTT CGCAGTTATG TCGACGAACA TTATCCGAAC
CGGATGCTGC TTGCCGAAGC GAACCAGTGG CCTGAAGATT CCGCCGCTTA TTTAGGCGAG
GGCGATCTGT GCCATATGAA CTTCCATTTT CCCCTGATGC CGCGCATGTA TATGGCGCTG
GCGACCGAAG ATCGTTTTCC CATTCTCGAC ATTCTCGACC AGACTCCCGA AATTCCTGAA
AACTGCCAGT GGGCCTCTTT TCTGCGCAAC CACGACGAGC TGACCCTTGA AATGGTGACC
GATGAGGAGC GCGACTACAT GCGCCGGGTC TATGCCAACG ATCCCAGAGC GCGCATCAAC
CTCGGTATCC GCAGAAGGCT TGCTCCGCTC ATGACCAACG ACCGGCGGCG TATCGAACTG
ATGAACATCA TGCTGCTCTC GCTACCCGGC ACACCGGTGC TCTACTACGG CGACGAGATC
GGTATGGGCG ACAACTATTA TCTTGGCGAC CGGGACGGTG TTCGCACTCC TATGCAGTGG
AACTCCGATC GTAATGCCGG TTTTTCTCGC GCCAACCCGC AAAAACTGCT GCTTCCGGTC
ATTATCGATC CCGAATACCA TTACGAAGCC GTCAACGTAG AGGTGCAGGA GAGCAACACC
AATTCGCTGC TCTGGTGGAT GCGGCACGCT ATCGCCACGG CAAGGCGCTA CAAGTCGCTC
AGTCGCGGCA GCATCGAGTT TCTGCAGGTG AACAACCCCA AAGTGCTGAT ATTTACCCGC
ACCTTCGAGG ATGAAACCAT GCTCGCAGTG ATCAACCTTT CCCGAAATGC CCAGGCGGTC
ACGATCGATC TGTCCGGCTA CGAAGGGTGT ATCCCCGAAG AGGTCTTCAG TCTGAACCGG
TTCCCGAAAG TTCGCGAAAC GCCCTATATG GTCGCTCTCG GAGCATACGG TTTCTTCTGG
CTTCGGCTTG TCAGAGAGAA CGCCCAGGAA GGCGGGCGTC CCTATCTTGA CTCGCCGTAT
GCGAAAGTTG CCCGCTGGCA GAGCTGTTTT GTAGGTAAAA GCCGGGAACG GCTCGAAACC
GAAATCCTTC CGAAGTACTT TATGGGCAGC CGCTGGTTCG GCGGCAAGGC AAGAACCATC
ATACGCATCG CCATTATCGA TACAATCCCG GTTGAGGGGT TGGAGCATGC CAAACTGCTG
ATTACCGAGG TAAGCTATTC AAGCGGCGAA AACGAGCGTT ATCAGCTTCC GGTCTGCTTT
ACTCCTGAAG GTGTCGTCAG CACTGCCGCC GATAACTTCT CCAGCCGGGT TATTGCGAAA
GTCGTACTTG GCGATGAGGA AGGGTATCTC TGCGATGCCA CCTATGAAAA CAGCTTTCTC
AGTCGGCTTT TCCACCTTGT TATGGGCAAC TCGGAATGGA AGGGCAAGCT CGGCAATGTC
AACGGCGTAA AAGGAGAGGC GTTCGACGGA ATTTCCAGGG ATAAGGGGAC CGGGCCCGAA
CCATTTCTGC TCGGCAACGA GCAGAGTAAC ACCTCCATCC GATATCGTGA CGATCTCTGC
CTGAAACTTT ATCGCAGAAT TGAAACAGGA GTCTCTCCCG AAGTCGAAAT GTGCCGCGCG
TTGTCGGAGA GAAGCTCGTT CGCTCAACTG CCCAATTATC TTGGGACGCT TGATTACGTT
CAGAGCCGTA CGAACAGGTA TTCCATCGGT ATTCTGCAGA GTTTCGTGCC CAATGAAGGC
GATACCTGGA AGTTGTCGCT CGATTATGTA CAGCGGTATT ATGAAGAGGT GCTTTCGAAA
CTGCAGTGCG GTATCGAACT GCCTCCGCTT CCGGCTCTTA GCGGCAATCC CGTTCCTATT
CCTCCGGCTA TGCAGGAGCT TATCGGCGGA ACCTGGCTCG GCCTGGTTGA AAAGCTTGCC
GAACGAACGG CCGAGATGCA TATCGCTCTG GCCTCGATGA CCGACCAGCC CGAGTTTGCA
CCGGAGCCCT TTACGTCGCT CTACCAGCGT TCCATCTATC AGGCGATGTG CGAGCAGGTA
AAGCGCAGTA TGATTCTGCT TCGTGAAGTG AAAGATTTCA TGATCGGCGA ACCGAAGGAA
CTGGTATTGC AGATGCTGGC CAATCAGAAA AAAATTCTGC AGCAGTTCGA ACCGATCAGA
ACCGGGAAAA TAGATACCCT GAAGGTGAGA ATTCATGGCG ATTACCATCT TGGGCAGGTA
CTCTCTACAG GCAATGACGT CGTGATTATC GACTTTGAGG GCGAGCCCTC CCGTCCGATA
TCGGAACGCA AGATCAAACG CTCGGTTTTT CGCGATGTGG CCGGCATGAT GCGTTCGTTC
GATTATGCGG CCTTCAATGT GCTTCTGCAG GCAAATCCGG TGATCAGGCA GGAGGATGTA
GCGCGCCTTG AACCATGGGC CGAGCGCTGG AGCTACTATG TGGGTCAGCA CTTTGTGGAT
TCGTATTTCA GTGCGGCAAA GGGGCATGAA ATCGTTCCCG AAGAGGCCTC TCAGCGAGAG
CACCTGCTTC GCGGCTACCT CATGAACAAG GCGGTTTACG AGTTGAATTA CGAGCTGAAC
AACCGTCCTG AGTGGGCCAC GATTCCCATG CGGGGCATTC TTAAGCTCAT CGAGTTGTAA
 
Protein sequence
MYQPEPLWYK DAIIYEAHVK TFYDSNNDGI GDFQGLRQKL GYLESLGITA IWLLPFYPSP 
LRDDGYDIAD YMSVNPDYGT LEDFRQFLEE AHSRGMKVIT ELVVNHTSDQ HAWFQRARKA
PAGSPERNFY VWSDDMDKYS ETRIIFQDFE ASNWTWDPVA KQYFWHRFYH HQPDLNFENP
EVHQALFDVL DFWLGMGVDG LRLDAVPYLY EAEGTNCENL PQTYDYLRKL RSYVDEHYPN
RMLLAEANQW PEDSAAYLGE GDLCHMNFHF PLMPRMYMAL ATEDRFPILD ILDQTPEIPE
NCQWASFLRN HDELTLEMVT DEERDYMRRV YANDPRARIN LGIRRRLAPL MTNDRRRIEL
MNIMLLSLPG TPVLYYGDEI GMGDNYYLGD RDGVRTPMQW NSDRNAGFSR ANPQKLLLPV
IIDPEYHYEA VNVEVQESNT NSLLWWMRHA IATARRYKSL SRGSIEFLQV NNPKVLIFTR
TFEDETMLAV INLSRNAQAV TIDLSGYEGC IPEEVFSLNR FPKVRETPYM VALGAYGFFW
LRLVRENAQE GGRPYLDSPY AKVARWQSCF VGKSRERLET EILPKYFMGS RWFGGKARTI
IRIAIIDTIP VEGLEHAKLL ITEVSYSSGE NERYQLPVCF TPEGVVSTAA DNFSSRVIAK
VVLGDEEGYL CDATYENSFL SRLFHLVMGN SEWKGKLGNV NGVKGEAFDG ISRDKGTGPE
PFLLGNEQSN TSIRYRDDLC LKLYRRIETG VSPEVEMCRA LSERSSFAQL PNYLGTLDYV
QSRTNRYSIG ILQSFVPNEG DTWKLSLDYV QRYYEEVLSK LQCGIELPPL PALSGNPVPI
PPAMQELIGG TWLGLVEKLA ERTAEMHIAL ASMTDQPEFA PEPFTSLYQR SIYQAMCEQV
KRSMILLREV KDFMIGEPKE LVLQMLANQK KILQQFEPIR TGKIDTLKVR IHGDYHLGQV
LSTGNDVVII DFEGEPSRPI SERKIKRSVF RDVAGMMRSF DYAAFNVLLQ ANPVIRQEDV
ARLEPWAERW SYYVGQHFVD SYFSAAKGHE IVPEEASQRE HLLRGYLMNK AVYELNYELN
NRPEWATIPM RGILKLIEL