Gene Cphamn1_0240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0240 
Symbol 
ID6373895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp230044 
End bp233361 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content52% 
IMG OID642682754 
Producttrehalose synthase 
Protein accessionYP_001958690 
Protein GI189499220 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGAG CATCTGCATC CTATCAGCCT GAACCGCTCT GGTACAAGGA CGCCATCATT 
TATGAGGCAC ATGTAAAGAC TTTTTTTGAC AGTAACAATG ACGGTGTCGG TGATTTTGAA
GGGTTGCGCC AGAAGCTGCC CTATCTGGAA AGTCTCGGTA TAACCGCAAT CTGGCTGCTT
CCTTTTTATC CTTCACCCCT GAGAGATGAC GGCTACGATA TTGCCGATTA CATGGAGGTC
AATCCTGACT ACGGTACCAT CGAGGATTTC AAAGCCTTTC TCGATGACGC GCATAAGCTC
GGACTGAAGG TGATTACCGA GCTTGTCATC AACCATACGT CCGATCAGCA CGCATGGTTC
CAGAGGGCCA GACAGGCAGA GCCGGGATCG GTCGAACGGG ATTTTTACAT GTGGAGCAGT
GATCCCAAGA AATACTCCGG CGTCCGCATC ATTTTCCAGG ATTTCGAGGC CTCGAACTGG
ACATGGGACC CTGTCGCCGG AGAGTATTAC TGGCATCGGT TCTATCATCA TCAGCCTGAT
CTGAACTTCG AAAATCCCGC GGTTGAAAAA GCCATTTACA AGGTGCTTGA TTACTGGCTG
GAGATGGGCG TCGACGGGTT GCGGCTCGAC GCGGTTCCCT ATCTCTATGC AGAGGAAGGA
ACCAACTGTG AGAACCTTCC CCGCACGCAC AAGTTTCTGC AGAGGCTGCG CAAGCATGTA
GACGGCAAGT TCCCGAACCG TATGCTTCTT GCCGAGGCAA ATCAGTGGCC GGAAGATGCC
GCCGAGTATT TCGGCGAAGG TGACGAATGT CATATGAATT TCCATTTCCC TCTGATGCCG
AGGATGTACA TGGCGCTGGA AATGGAAGAT CGTTTTCCGA TCATAGATAT TCTCGACCAG
ACACCCGGGA TTCCGGAAGA GTGCCAGTGG GCTTCCTTCC TTCGCAATCA TGATGAACTT
ACTCTTGAGA TGGTGACCGA TGAGGAGCGT GACTATATGC GCCGGGTGTA CGCGCATGAT
CCGAAGGCGC GCATCAATCT CGGTATACGC CGCCGTCTGG CGCCGCTTAT GTCGAATGAC
CGCAGAAAAA TCGAGCTGAT GAACATCATG CTCCTTTCCC TGCCCGGCAC TCCGGTTCTC
TACTACGGTG ACGAGATAGG TATGGGCGAT AATTTTTACC TCGGTGATCG TGACGGCGTT
CGTACTCCCA TGCAGTGGAA CGGTGACCGG AATGCAGGTT TTTCCAGAGC CAATCCCCAG
CAGTTGCAGC TGCCGGTGAT CATTGATCCG GAATACCATT ACGAGGGAGC CAATGTCGAG
GTACAGGAAA GCAACATCAA TTCGCTGCTC TGGTGGACAC GCCACATGCT CTCCACCTCC
CGCAGGTACA AAGCCCTCAG TCGCGGGGAT ATTATCTTTA TTCAGTCTCA GAATCCTCAG
GTCCTGATTT TTACCAGGAC ATACAAGGAT GAGACCATGC TGTGCATCAT CAACCTGTCG
CGTAACGCAC AGGCGGTCAC CATGGATCTG TCGGAATACG AAGGATATAT TCCTGAAGAG
GTGTTCAGCC TCAGTCATTT TCCCGGGATC TCTGCAAGGC CGTATACGGT TACGCTGGGT
CCTTACGGAT ATTTCTGGTT CAAGCTTGTC AGGAGTGAAG ATGAGATCGG GAGCCGACGC
TATATCGACA AGCCGTTTGC GAAAGTAGCC GCCATGGATG ACCTCTTTTC CGGCAAGGTT
CTTGACCGTC TTGAATCCAG AGTGCTGCCT CAGTATATAC GGGGTTGCAG ATGGTTTGGC
GGCAAGGCCC GCAAGATCGT CAGGGTCAGC GTCAACGATA GCATTCCTGT GCCAGCCTGT
CAAAACACGG TCTACCTGAT TGTCGAGGTA CGCTATCCGA GCGGCTCGAA CGATCTGTAT
CAGCTTCCGG TGACATTTCT GCCTACAGGA GAGTTCAATC CTGACGAAGA CTTTTTCATG
AAGCAGGTTA TCTGCAGTGT GAAGATCGGA GAGAACGAGG GGTATCTCTG CGATGCGACC
TATCAGAAGG AGTTCCATCG TTTCCTTCTC GACGTTATCA TCGCCGGAAA AGGCCTGAAG
GGGGGGATTT TCAAACTGAC AGCCGAAAAG GGCTCGACTC TGGAGGAGTA TCTGCCGCAG
GAAGAGGATG ATAGTATGAA CTCCGTGATT TTCGGTCTGG AGCAGAGCAA TACGTCGATC
ATGTATGATG ACAAGCTCTG TCTGAAGCTC TACCGCAAGA TCTCTTCAGG GATTTCTCCT
GAAGTTGAAA TCTGCCGCAC CTTGACTGAA AAGACATCGT TTGAGAGTTC TCCGGGCTAT
CTTGGAGCGC TTTACCTTTC CAGAAGCCGC AAGGATACCT CTTCTCTGGG CATCCTGCAG
AACTTTATCC CCAATGAGGG AGATGCCTGG AGCCAGACCC TGCACTATGT GCACCGTTAC
TATGAAGAGG TGCTTGTTCT GTTGCCGCAG CTCGAAGAGA TCCCGGAAAT TCCCCCGATA
GGAGGAGAGA CAGTCGAGAT GCCGGAGATC ATGCACGGGC TGATAGGTGA AATCTACCTC
GGGATGGTTA ACAAGCTTGC TGAGCGAACA GCAGAAATGC ATCTTTCTCT GGCCTCGCCG
GATCTTGGTC CTGATTTTCT GCCTGAAGCA TTTACCACGC TCTATCAGCG CTCCATATAC
CAGTCCATGC GTGAACAGGT GAAAAGAGGT ATGGTGATGC TCAAGGAGCA GATGAAAGGG
ATCGCGAAAG ATTATAAGGG AATCGCGGCT GATCTGCTTG GACGGGAGCA GGAGATACTG
GACCGGCTTT CACATATCAA AGCTCGCAGG ATCCCGGCAT CAAAGATCAG GATTCATGGT
GACTATCATC TCGGTCAGGT ACTCTGGACC GGTAAGGATT TTGTGATCAT TGACTTTGAA
GGCGAACCGG CACGCTCTAT CAGCGAGCGC AGGATCAAGC GTGCCGTGTT CCGTGATCTT
GCAGGAATGA TGCGGTCGTT CCATTACGCT GCCTTCAACG TCCTGATCCA GGATCGTTCT
ATAAGGCCTG AGGATGCTGA AAAGCTTGAG CCATGGGCGG AGTTGTGGAG TTTTTATACC
GGGCAGCATT TCTATGATGT GTATGCGGCC GCTGTTGGAG GACACGGTCT GATTCCTGAA
AATATTACAG AACAGCACCT TCTGCTTCGC TCCTATCTCA TGGACAAGGC TATTTATGAA
TTGAACTATG AGCTGAACAA CCGTCCTGAG TGGGTAGGCA TAGCCCTGAA GGGTCTGCAG
CGGCTGCTCG AATCCTGA
 
Protein sequence
MPRASASYQP EPLWYKDAII YEAHVKTFFD SNNDGVGDFE GLRQKLPYLE SLGITAIWLL 
PFYPSPLRDD GYDIADYMEV NPDYGTIEDF KAFLDDAHKL GLKVITELVI NHTSDQHAWF
QRARQAEPGS VERDFYMWSS DPKKYSGVRI IFQDFEASNW TWDPVAGEYY WHRFYHHQPD
LNFENPAVEK AIYKVLDYWL EMGVDGLRLD AVPYLYAEEG TNCENLPRTH KFLQRLRKHV
DGKFPNRMLL AEANQWPEDA AEYFGEGDEC HMNFHFPLMP RMYMALEMED RFPIIDILDQ
TPGIPEECQW ASFLRNHDEL TLEMVTDEER DYMRRVYAHD PKARINLGIR RRLAPLMSND
RRKIELMNIM LLSLPGTPVL YYGDEIGMGD NFYLGDRDGV RTPMQWNGDR NAGFSRANPQ
QLQLPVIIDP EYHYEGANVE VQESNINSLL WWTRHMLSTS RRYKALSRGD IIFIQSQNPQ
VLIFTRTYKD ETMLCIINLS RNAQAVTMDL SEYEGYIPEE VFSLSHFPGI SARPYTVTLG
PYGYFWFKLV RSEDEIGSRR YIDKPFAKVA AMDDLFSGKV LDRLESRVLP QYIRGCRWFG
GKARKIVRVS VNDSIPVPAC QNTVYLIVEV RYPSGSNDLY QLPVTFLPTG EFNPDEDFFM
KQVICSVKIG ENEGYLCDAT YQKEFHRFLL DVIIAGKGLK GGIFKLTAEK GSTLEEYLPQ
EEDDSMNSVI FGLEQSNTSI MYDDKLCLKL YRKISSGISP EVEICRTLTE KTSFESSPGY
LGALYLSRSR KDTSSLGILQ NFIPNEGDAW SQTLHYVHRY YEEVLVLLPQ LEEIPEIPPI
GGETVEMPEI MHGLIGEIYL GMVNKLAERT AEMHLSLASP DLGPDFLPEA FTTLYQRSIY
QSMREQVKRG MVMLKEQMKG IAKDYKGIAA DLLGREQEIL DRLSHIKARR IPASKIRIHG
DYHLGQVLWT GKDFVIIDFE GEPARSISER RIKRAVFRDL AGMMRSFHYA AFNVLIQDRS
IRPEDAEKLE PWAELWSFYT GQHFYDVYAA AVGGHGLIPE NITEQHLLLR SYLMDKAIYE
LNYELNNRPE WVGIALKGLQ RLLES