Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0240 |
Symbol | |
ID | 6373895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 230044 |
End bp | 233361 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642682754 |
Product | trehalose synthase |
Protein accession | YP_001958690 |
Protein GI | 189499220 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGAG CATCTGCATC CTATCAGCCT GAACCGCTCT GGTACAAGGA CGCCATCATT TATGAGGCAC ATGTAAAGAC TTTTTTTGAC AGTAACAATG ACGGTGTCGG TGATTTTGAA GGGTTGCGCC AGAAGCTGCC CTATCTGGAA AGTCTCGGTA TAACCGCAAT CTGGCTGCTT CCTTTTTATC CTTCACCCCT GAGAGATGAC GGCTACGATA TTGCCGATTA CATGGAGGTC AATCCTGACT ACGGTACCAT CGAGGATTTC AAAGCCTTTC TCGATGACGC GCATAAGCTC GGACTGAAGG TGATTACCGA GCTTGTCATC AACCATACGT CCGATCAGCA CGCATGGTTC CAGAGGGCCA GACAGGCAGA GCCGGGATCG GTCGAACGGG ATTTTTACAT GTGGAGCAGT GATCCCAAGA AATACTCCGG CGTCCGCATC ATTTTCCAGG ATTTCGAGGC CTCGAACTGG ACATGGGACC CTGTCGCCGG AGAGTATTAC TGGCATCGGT TCTATCATCA TCAGCCTGAT CTGAACTTCG AAAATCCCGC GGTTGAAAAA GCCATTTACA AGGTGCTTGA TTACTGGCTG GAGATGGGCG TCGACGGGTT GCGGCTCGAC GCGGTTCCCT ATCTCTATGC AGAGGAAGGA ACCAACTGTG AGAACCTTCC CCGCACGCAC AAGTTTCTGC AGAGGCTGCG CAAGCATGTA GACGGCAAGT TCCCGAACCG TATGCTTCTT GCCGAGGCAA ATCAGTGGCC GGAAGATGCC GCCGAGTATT TCGGCGAAGG TGACGAATGT CATATGAATT TCCATTTCCC TCTGATGCCG AGGATGTACA TGGCGCTGGA AATGGAAGAT CGTTTTCCGA TCATAGATAT TCTCGACCAG ACACCCGGGA TTCCGGAAGA GTGCCAGTGG GCTTCCTTCC TTCGCAATCA TGATGAACTT ACTCTTGAGA TGGTGACCGA TGAGGAGCGT GACTATATGC GCCGGGTGTA CGCGCATGAT CCGAAGGCGC GCATCAATCT CGGTATACGC CGCCGTCTGG CGCCGCTTAT GTCGAATGAC CGCAGAAAAA TCGAGCTGAT GAACATCATG CTCCTTTCCC TGCCCGGCAC TCCGGTTCTC TACTACGGTG ACGAGATAGG TATGGGCGAT AATTTTTACC TCGGTGATCG TGACGGCGTT CGTACTCCCA TGCAGTGGAA CGGTGACCGG AATGCAGGTT TTTCCAGAGC CAATCCCCAG CAGTTGCAGC TGCCGGTGAT CATTGATCCG GAATACCATT ACGAGGGAGC CAATGTCGAG GTACAGGAAA GCAACATCAA TTCGCTGCTC TGGTGGACAC GCCACATGCT CTCCACCTCC CGCAGGTACA AAGCCCTCAG TCGCGGGGAT ATTATCTTTA TTCAGTCTCA GAATCCTCAG GTCCTGATTT TTACCAGGAC ATACAAGGAT GAGACCATGC TGTGCATCAT CAACCTGTCG CGTAACGCAC AGGCGGTCAC CATGGATCTG TCGGAATACG AAGGATATAT TCCTGAAGAG GTGTTCAGCC TCAGTCATTT TCCCGGGATC TCTGCAAGGC CGTATACGGT TACGCTGGGT CCTTACGGAT ATTTCTGGTT CAAGCTTGTC AGGAGTGAAG ATGAGATCGG GAGCCGACGC TATATCGACA AGCCGTTTGC GAAAGTAGCC GCCATGGATG ACCTCTTTTC CGGCAAGGTT CTTGACCGTC TTGAATCCAG AGTGCTGCCT CAGTATATAC GGGGTTGCAG ATGGTTTGGC GGCAAGGCCC GCAAGATCGT CAGGGTCAGC GTCAACGATA GCATTCCTGT GCCAGCCTGT CAAAACACGG TCTACCTGAT TGTCGAGGTA CGCTATCCGA GCGGCTCGAA CGATCTGTAT CAGCTTCCGG TGACATTTCT GCCTACAGGA GAGTTCAATC CTGACGAAGA CTTTTTCATG AAGCAGGTTA TCTGCAGTGT GAAGATCGGA GAGAACGAGG GGTATCTCTG CGATGCGACC TATCAGAAGG AGTTCCATCG TTTCCTTCTC GACGTTATCA TCGCCGGAAA AGGCCTGAAG GGGGGGATTT TCAAACTGAC AGCCGAAAAG GGCTCGACTC TGGAGGAGTA TCTGCCGCAG GAAGAGGATG ATAGTATGAA CTCCGTGATT TTCGGTCTGG AGCAGAGCAA TACGTCGATC ATGTATGATG ACAAGCTCTG TCTGAAGCTC TACCGCAAGA TCTCTTCAGG GATTTCTCCT GAAGTTGAAA TCTGCCGCAC CTTGACTGAA AAGACATCGT TTGAGAGTTC TCCGGGCTAT CTTGGAGCGC TTTACCTTTC CAGAAGCCGC AAGGATACCT CTTCTCTGGG CATCCTGCAG AACTTTATCC CCAATGAGGG AGATGCCTGG AGCCAGACCC TGCACTATGT GCACCGTTAC TATGAAGAGG TGCTTGTTCT GTTGCCGCAG CTCGAAGAGA TCCCGGAAAT TCCCCCGATA GGAGGAGAGA CAGTCGAGAT GCCGGAGATC ATGCACGGGC TGATAGGTGA AATCTACCTC GGGATGGTTA ACAAGCTTGC TGAGCGAACA GCAGAAATGC ATCTTTCTCT GGCCTCGCCG GATCTTGGTC CTGATTTTCT GCCTGAAGCA TTTACCACGC TCTATCAGCG CTCCATATAC CAGTCCATGC GTGAACAGGT GAAAAGAGGT ATGGTGATGC TCAAGGAGCA GATGAAAGGG ATCGCGAAAG ATTATAAGGG AATCGCGGCT GATCTGCTTG GACGGGAGCA GGAGATACTG GACCGGCTTT CACATATCAA AGCTCGCAGG ATCCCGGCAT CAAAGATCAG GATTCATGGT GACTATCATC TCGGTCAGGT ACTCTGGACC GGTAAGGATT TTGTGATCAT TGACTTTGAA GGCGAACCGG CACGCTCTAT CAGCGAGCGC AGGATCAAGC GTGCCGTGTT CCGTGATCTT GCAGGAATGA TGCGGTCGTT CCATTACGCT GCCTTCAACG TCCTGATCCA GGATCGTTCT ATAAGGCCTG AGGATGCTGA AAAGCTTGAG CCATGGGCGG AGTTGTGGAG TTTTTATACC GGGCAGCATT TCTATGATGT GTATGCGGCC GCTGTTGGAG GACACGGTCT GATTCCTGAA AATATTACAG AACAGCACCT TCTGCTTCGC TCCTATCTCA TGGACAAGGC TATTTATGAA TTGAACTATG AGCTGAACAA CCGTCCTGAG TGGGTAGGCA TAGCCCTGAA GGGTCTGCAG CGGCTGCTCG AATCCTGA
|
Protein sequence | MPRASASYQP EPLWYKDAII YEAHVKTFFD SNNDGVGDFE GLRQKLPYLE SLGITAIWLL PFYPSPLRDD GYDIADYMEV NPDYGTIEDF KAFLDDAHKL GLKVITELVI NHTSDQHAWF QRARQAEPGS VERDFYMWSS DPKKYSGVRI IFQDFEASNW TWDPVAGEYY WHRFYHHQPD LNFENPAVEK AIYKVLDYWL EMGVDGLRLD AVPYLYAEEG TNCENLPRTH KFLQRLRKHV DGKFPNRMLL AEANQWPEDA AEYFGEGDEC HMNFHFPLMP RMYMALEMED RFPIIDILDQ TPGIPEECQW ASFLRNHDEL TLEMVTDEER DYMRRVYAHD PKARINLGIR RRLAPLMSND RRKIELMNIM LLSLPGTPVL YYGDEIGMGD NFYLGDRDGV RTPMQWNGDR NAGFSRANPQ QLQLPVIIDP EYHYEGANVE VQESNINSLL WWTRHMLSTS RRYKALSRGD IIFIQSQNPQ VLIFTRTYKD ETMLCIINLS RNAQAVTMDL SEYEGYIPEE VFSLSHFPGI SARPYTVTLG PYGYFWFKLV RSEDEIGSRR YIDKPFAKVA AMDDLFSGKV LDRLESRVLP QYIRGCRWFG GKARKIVRVS VNDSIPVPAC QNTVYLIVEV RYPSGSNDLY QLPVTFLPTG EFNPDEDFFM KQVICSVKIG ENEGYLCDAT YQKEFHRFLL DVIIAGKGLK GGIFKLTAEK GSTLEEYLPQ EEDDSMNSVI FGLEQSNTSI MYDDKLCLKL YRKISSGISP EVEICRTLTE KTSFESSPGY LGALYLSRSR KDTSSLGILQ NFIPNEGDAW SQTLHYVHRY YEEVLVLLPQ LEEIPEIPPI GGETVEMPEI MHGLIGEIYL GMVNKLAERT AEMHLSLASP DLGPDFLPEA FTTLYQRSIY QSMREQVKRG MVMLKEQMKG IAKDYKGIAA DLLGREQEIL DRLSHIKARR IPASKIRIHG DYHLGQVLWT GKDFVIIDFE GEPARSISER RIKRAVFRDL AGMMRSFHYA AFNVLIQDRS IRPEDAEKLE PWAELWSFYT GQHFYDVYAA AVGGHGLIPE NITEQHLLLR SYLMDKAIYE LNYELNNRPE WVGIALKGLQ RLLES
|
| |