Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2536 |
Symbol | |
ID | 4569726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2908009 |
End bp | 2911305 |
Gene Length | 3297 bp |
Protein Length | 1098 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639767101 |
Product | trehalose synthase |
Protein accession | YP_912948 |
Protein GI | 119358304 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.605311 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCAAC CTGAAGCACT CTGGTACAAA GACGCAATTA TTTACGAAGC GCATGTCAAA ACTTTTTACG ACAGCAATAA TGATGGCATA GGCGATTTTC AGGGGCTGCG CCAGAAACTC GGTTATCTGG AAAGCCTTGG CATTACCGCT ATCTGGCTGT TGCCGTTCTA TCCCTCTCCG TTGCGCGATG ATGGTTATGA TATTGCCGAT TATATGAAGG TTAATCCCGA CTATGGCACG CTTGATGATT TCAGGGAGTT TCTTGAAGAG GCGCACAGCC GGGGTCTCAA GGTTATTACC GAGCTTGTGG TCAATCACAC TTCCGACCAG CATGCATGGT TTCAGCGTGC GCGCAGAGCG CCGGTCGGTT CGCCTGAACG CAACTTTTAT GTCTGGAGCG ACGATCCGAA CAAGTATTCC GAAACCCGCA TTATTTTTCA GGACTTCGAA GCGAGCAACT GGACGTATGA TCCGGTTGCC GGCCAGTATT TCTGGCACCG CTTTTACCAC CATCAGCCGG ATCTGAATTT TGAGAACCCC GAGGTGCACC AGGCGTTGTT TGATGTACTG GATTTCTGGC TTGGTATGGG CGTTGACGGT TTGCGACTTG ATGCTGTGCC ATATCTTTAT GAGGCCGAGG GCACCAATTG CGAAAATCTT CCGCAGACAT ACGAGTACCT GAAATCGTTG CGTTCCTATG TGGATGAGCA CTATCCCAAC AGGATGCTGC TTGCCGAAGC CAATCAGTGG CCGGAGGATG CTGCGGCATA TTTCGGCAAT GGGGATCAGT GTCATATGAA TTTCCACTTT CCTCTTATGC CGAGGATGTA TATGGCTCTG GCAACCGAAG ACCGCTTTCC TCTTCTTGAC ATTCTCGACC AGACACCTGA TATTCCGGAA AATTGCCAGT GGGCCTCCTT TCTTCGCAAT CATGACGAGC TGACTCTTGA AATGGTGACG GACGAAGAGC GGGACTATAT GCGCAGGGTC TATGCCAATG ATTCCAAGGC TCGCATTAAT CTTGGTATCC GCCGAAGGCT GGCGCCGCTT ATGACCAATG ATCGCCGGAA GATCGAGCTG ATGAACATCA TGCTTCTGTC GCTGCCGGGA ACGCCGGTGC TTTATTATGG CGATGAGATT GGTATGGGCG ATAACTATTA TCTCGGCGAT CGGGATGGAG TACGTACCCC CATGCAGTGG AACGCCGATC GCAATGCAGG GTTTTCGCGC GCCAATCCCC AGAAACTTCT GCTTCCGGTG ATTATCGATC CCGAGTACCA TTATGAAGCA ATCAATGTTG AGGTACAGGA GGGCAATACC AACTCGCTGC TCTGGTGGAT GCGCCATACT ATTGCGACGG CAAGAAAATA CAAGGCGCTT AGTCGCGGAA CTATCGAGTT TCTTCAGGTC AGCAATCCCA AGGTGCTGAT TTTTACCCGC ACGTTTGAGG ATGAAACGGT TCTGACCGTG ATCAATCTTT CACGAAACGC GCAGGCGGTC ATGATCGATC TGGCTGCTTA TGCCGATTGT ATCCCGGAAG AGGTGTTCAG TCTTAACCGG TTTCCGAAAA TCAGGAAAAC TCCCTATATG GTGGCGCTCG GGGCGTATGG TTATTTCTGG CTTAAACTGG TCAAAGAGAG CGATGAGTCG GAAGGAAGGC CTTTTCTTGA AGAGCCGTAT GCGTCGGTTA CGCGCTGGAC AAACCTGTTT GCTTCAAAAA ATCGCGAAAA GCTCGAGATC GATATTCTTC CGAAATATTT TATGAGCAGT CGCTGGTTCG GCGGCAAGGC TCGCACGATT ATACGGGTAG TGATTGCTGA TACGATTCCG GTGGAGGGGA TGGAGAATGC GAAAATGCTT CTTGCCGAGG TTCGTTATTC GAGCGGCGAA AATGAGCTTT ATCAGTTGCC GGTCTGCTTC GTTCCCGACG CAATGATCAG TCGGCATGAT GATAATTTTT ACAAGCGTGC CATTGCTCGG GCTGTGCTTG GCGACGAAAA GGGCTTTTTG TGCGATGCTA CTTATGAAAC GGCTTTTCTC AATCAGCTTT TTCGTCTTGT TATGGGTCAG CATCCATGGC ACGGCAAGTC GGGTCAGATT TCAGGTTGCA AGGGCGAAAA AGCTGATGTT CTTTGCAGCG AGGAGTGTCA CTCTTCTCCT GAACCGGCAC TTCTTGGCAA CGAACAGAGC AATACCTCAA TTCGGTATCG CGACAGGCTT TGTTTGAAAC TCTATCGCAG AATTGAAATC GGGGTTTCTC CGGAAATCGA TATGTGCAGG GCGCTGACAG AAAAAACCGG GTTCAGTAAT CTTCCCGAAT ATCTCGGATC GCTTGATTAT GCTCAGAATC GGGCGAGCCA CTATTCTCTG GGTATTCTTC AGAGTTATGT GCAGAATGAA GGCAATGCCT GGAATATATC GCTTGATTAT GCTCAGCGGT ATTATGAGGA TGTTCTTTCT AAAATCCAGA GCGTTTCCGA GTTGCCTGCG TTGCCGGCAT TAGGCGGGAA TCCCGTGCCG CTTCCTTCTA TTATGCTCGA ACTGATCGGA GGGCCCTATC TCGGACTGGT CGAAAAGCTT GGCGAAAGAA CTGCCGAGAT GCACATCGCT CTGGCTTCGA TTAACGATGA TCCTCAGTTT GCCCCTGAAC CGTTTACCTC ACTGTATCAG CGTTCCATCT ATCAGGCGAT GTGCGAACAG GTCAAACGCA GTATGATGCT GCTTCGACTG ATCAAGGGGA ATATTCCTGA AGAGCAGCAG GAGTTGGTTT CAAGGCTGCT GATGAGCGAG GAGGATATTC TTCTCCAGTT CGATCCTATC CGTCGTGAAA AGATTGATGC TGTTAAAATA AGGATTCATG GCGACTATCA TCTTGGTCAG GTGCTCTATA CCGGGCGGGA TGTGGTCGTT ATCGATTTTG AAGGGGAGCC TGCCAGGCCG GTTTCGGAAA GAAAAATCAA GCGCTCCGCT TTTCGCGATG TTGCCGGTAT GATGCGCTCA TTCGACTATG CCGCATTTAA TGCGCTTCTT CTTAACCCGG TGATCAGGCC TGAAGATCGG CATCGTCTTG AACCGTGGGC TGATCTTTGG AGTTTTTATG TGTCGCAGCA CTTTATCGAT GCTTATTTCA ATGCGGTGAA AGGTCACGAT ATTATTCCTG AAGAACCCGA TCAGCGGGAG CACCTGCTGC GCGGTTATCT GATGAACAAG GCGATCTACG AGCTTAATTA TGAACTGAAT AATCGTCCCG AATGGGCGAT GATTCCGCTC AGAGGGATTT TGAGGATGCT TGAGTGA
|
Protein sequence | MYQPEALWYK DAIIYEAHVK TFYDSNNDGI GDFQGLRQKL GYLESLGITA IWLLPFYPSP LRDDGYDIAD YMKVNPDYGT LDDFREFLEE AHSRGLKVIT ELVVNHTSDQ HAWFQRARRA PVGSPERNFY VWSDDPNKYS ETRIIFQDFE ASNWTYDPVA GQYFWHRFYH HQPDLNFENP EVHQALFDVL DFWLGMGVDG LRLDAVPYLY EAEGTNCENL PQTYEYLKSL RSYVDEHYPN RMLLAEANQW PEDAAAYFGN GDQCHMNFHF PLMPRMYMAL ATEDRFPLLD ILDQTPDIPE NCQWASFLRN HDELTLEMVT DEERDYMRRV YANDSKARIN LGIRRRLAPL MTNDRRKIEL MNIMLLSLPG TPVLYYGDEI GMGDNYYLGD RDGVRTPMQW NADRNAGFSR ANPQKLLLPV IIDPEYHYEA INVEVQEGNT NSLLWWMRHT IATARKYKAL SRGTIEFLQV SNPKVLIFTR TFEDETVLTV INLSRNAQAV MIDLAAYADC IPEEVFSLNR FPKIRKTPYM VALGAYGYFW LKLVKESDES EGRPFLEEPY ASVTRWTNLF ASKNREKLEI DILPKYFMSS RWFGGKARTI IRVVIADTIP VEGMENAKML LAEVRYSSGE NELYQLPVCF VPDAMISRHD DNFYKRAIAR AVLGDEKGFL CDATYETAFL NQLFRLVMGQ HPWHGKSGQI SGCKGEKADV LCSEECHSSP EPALLGNEQS NTSIRYRDRL CLKLYRRIEI GVSPEIDMCR ALTEKTGFSN LPEYLGSLDY AQNRASHYSL GILQSYVQNE GNAWNISLDY AQRYYEDVLS KIQSVSELPA LPALGGNPVP LPSIMLELIG GPYLGLVEKL GERTAEMHIA LASINDDPQF APEPFTSLYQ RSIYQAMCEQ VKRSMMLLRL IKGNIPEEQQ ELVSRLLMSE EDILLQFDPI RREKIDAVKI RIHGDYHLGQ VLYTGRDVVV IDFEGEPARP VSERKIKRSA FRDVAGMMRS FDYAAFNALL LNPVIRPEDR HRLEPWADLW SFYVSQHFID AYFNAVKGHD IIPEEPDQRE HLLRGYLMNK AIYELNYELN NRPEWAMIPL RGILRMLE
|
| |