Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_2029 |
Symbol | |
ID | 3746076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | - |
Start bp | 2256153 |
End bp | 2259455 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637770060 |
Product | alpha amylase domain-containing protein |
Protein accession | YP_375914 |
Protein GI | 78187871 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.269645 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCAAC CTGAACCGCT CTGGTACAAA GACGCCATCA TCTATGAGGC GCATGTCAAG ACGTTCTACG ACAGCAACAA TGACGGCATC GGCGATTTTC AGGGGCTCCG TCGCAAGCTT CCCCATCTGG AGAGCCTCGG TGTAACCGCC ATATGGCTGC TTCCCTTCTA CCCCTCTCCG CTGCGCGATG ACGGATACGA CATTGCCGAT TACATGACGG TCAACCCCGA CTACGGGACG CTGGACGATT TCAGGGAGTT CCTCGAAGAG GCCCACAGCC GCGGGCTGAA GGTCATCACC GAGCTGGTGG TCAACCATAC CTCAGACCAG CACGCATGGT TCCAGCAGGC ACGCAAGGCG CCGGCAGGTT CCCCGGAGCG GAACTTCTAT GTGTGGAGCG ACGATCCCAA CAAGTATTCC GAAACCCGCA TCATCTTTCA GGACTTCGAG GCCTCCAACT GGACATGGGA TCCTGTTGCC GGCCAGTATT ACTGGCACCG GTTCTTCCAC CACCAGCCGG ACTTGAACTT CGAGAACCCC GATGTCCAGC AGGCGCTCTT CGACGTGCTC GATTTCTGGC TCGGCATGGG TGTCGACGGC CTCCGGCTCG ACGCCGTCCC CTATCTCTAC GAGGCTGAGG GCACCAACTG CGAAAACCTT CCCGAGACCT ACGAGTTCCT GAAGAAGCTC CGCCGCCATG TCGATGAGCA TTTCCCGAAC CGCATGCTGC TTGCCGAGGC CAACCAGTGG CCCGAGGATT CCGCAGCCTA CTTCGGCAAT GGCGACCAGT GCCACATGAA CTTCCACTTC CCCTTGATGC CGAGGATGTA CATGGCCCTT GCCACCGAGG ACCGGTTCCC CATCATCGAC ATTCTCGAGC AGACCCCGGA AATCCCGGAG GGCTGCCAGT GGGCATCGTT CCTGCGCAAC CATGACGAGC TGACCCTCGA AATGGTGACC GACGAGGAGC GCGACTACAT GCGCCGCGTC TACGCCAACG ACCCCAAGGC GCGCATCAAC CTCGGCATCC GCCGCCGCCT GGCTCCGCTC ATGACCAACG ATCGCCGGCG CATCGAGCTC ATGAACATCA TGCTCCTTTC GCTTCCCGGC ACCCCGGTGC TCTACTACGG CGACGAAATC GGCATGGGCG ACAACTACTA CCTCGGTGAT CGCGACGGTG TGCGCACCCC CATGCAGTGG AACAGCGACC GCAATGCCGG GTTCTCCCGC GCCAACCCCC AGCAGCTGCT GCTGCCTGTC ATCATCGATC CCGAGTACCA CTACGAGGCG GTGAACGTCG AGGTGCAGGA GAGCAACGTC AACTCGCTGC TCTGGTGGAC CCGCCACGCC ATCGCCACCG CGCGTCGCTA CAAGTCGCTC AGCCGCGGCT CGATAGAGTT CCTGCAGGTC AGCAATCCCA AGGTGCTCAT CTTCATCCGC CGTTACGAAG ACGAGACCAT CCTGTCGGTC ATAAACCTTT CGCGCAACGC ACAGGCGGTC TCCGTCGACC TCTCCGCCTT CGAAGGCTAC ACTCCCGAAG AGGTGTTCAG CATGAACCGC TTCCCGAAAA TCCGTCAGGC GCCCTATATG CTCTCGCTCG GCTCCTATGG GTATTTCTGG CTGAAGATGG TGCCGGATAC CGCCGATGTA TCCTGCCGTC CCGGTCTGGA AACGGCGTTC GCCGAAGTGT ATAACTGGCC TGCGCTCTTT GCCGGCAGAA GCCGAGAGAA GCTTGAAAAC TCGGTCCTGC CCTGCTACTA CCAGGCTATG CGCTGGTTCG GCGGCAAGGC GCGCAATGTC CTTCGCATTT CGATCGTCGA CACCATTCCG GTTGAGGGTA TGGAGCATGC CAAGCTTCTC GTCACCGAAG TCCGCTACTC CAGCGGAGAG AACGAACTCT ATCAGCTGCC TGTCAGCTTC GTTCCACTGG AGGAGGTATC GCCTTCCGAC GACAACTTCT ACCGGAGGGT CATCGGCCGC GCCGTTGTCG GGGAAGTGGA CGGGTACCTG TGCGACGCTA CCTACGAGAA CGCGTTCCTC AGCCGCCTCT TCAGCATCAT GGTCGGCGGT GAGCAGTGGA AAGGAAAGGC GGGTATTGTT TCCGGCATGA AAGGATCCGC TCTCGAAGTG TTCCGCGACG TGACGGACGG GTCGCAGCCC GAACCCTTCC TCATGGGGGT CGAGCAGACC AACACCTCAA TCCGCTTCCA CGATTCGCTC TGTTTGAAGC TCTACCGCCG GATCGAGAAG GGCGTTTCTC CCGAAATCGA GATGTGCGGC GCCCTGACGG AAAAAACCGG TTTTGCAAGC CTCCCGAAAT ACCTCGGTTC GCTCAACTAC GATCAGGGCC GCAATGCCGG TTACTCCATC GGCATCCTGC AGAATTTCGT GGAGAATGAA GGTGATGCAT GGCAGCTCTC CCTCGGCCAG GTTGCCCGTT ACTTCGGAGA CGTACAGGCC AAGGTTGGCG CCGGCATCGA GCTTCCGTCC ATGCAGCAGC TCAGCGGCAG CCCCGTCGAG GTCCCTGAGC TCATGCATGA GCTTATCGGT GGAGCATATC TCGGCATGAT CGAGAAGCTG GCCGAGCGTA CCGCTGAAAT GCACCTCTCG CTGGCCTCGC TTGACGCCGA CCCTGCGTTC GCCCCGGAGC CCTTCACCAC CCTCTACCAG CGCTCAATCT ATCAGGCGAT GTGCGAGCAG GTAAAGCGTG CCATGATCCT CATCGGGGAG CTTATGCCTT CGATGGAGCC GGAACAGCGT GAGCTCTGCT CGCTGCTCGT CAGGAACCAG AAGGACATCC TGCAGCAGTT CGAGCCGGTC CGCCAGGAGA AGATCGATAC CCTGAAAATC AGGATCCACG GCGACTATCA CCTCGGCCAG GTGCTCTTCA CCGGCAAGGA CTTCGTCATC ATCGACTTCG AGGGCGAGCC GGCCCGTCCG ATTTCCGAAC GCAAGATCAA GCGCTCGGTC TTCCGCGATA TCTCCGGCAT GCTCCGTTCA TTCGACTATG CGGCCTTCCA TGTGCTCCAC CTCAACGAAT CGGTTGTTCG CGCTGAAGAC CGTCACCAGA TGGAACCCTG GGCCGACCGC TGGAGCAATG CCGTCGGGCA GCACTTCCTC GACTCCTACT TCAAGAGCAC CGAGGGCAGT GCCATCGTCC CTGAGGATCC CCGTCAGCGG GAACACCTGA TGAACGCCTA CCTTATGAAC AAGGCGGTCT ACGAGCTCAA CTACGAGCTC AACAACCGCC CGCAGTGGGT TGGTATCCCG ATCAGGGGCA TTCTGAAGAT GCTCGATATG TAA
|
Protein sequence | MYQPEPLWYK DAIIYEAHVK TFYDSNNDGI GDFQGLRRKL PHLESLGVTA IWLLPFYPSP LRDDGYDIAD YMTVNPDYGT LDDFREFLEE AHSRGLKVIT ELVVNHTSDQ HAWFQQARKA PAGSPERNFY VWSDDPNKYS ETRIIFQDFE ASNWTWDPVA GQYYWHRFFH HQPDLNFENP DVQQALFDVL DFWLGMGVDG LRLDAVPYLY EAEGTNCENL PETYEFLKKL RRHVDEHFPN RMLLAEANQW PEDSAAYFGN GDQCHMNFHF PLMPRMYMAL ATEDRFPIID ILEQTPEIPE GCQWASFLRN HDELTLEMVT DEERDYMRRV YANDPKARIN LGIRRRLAPL MTNDRRRIEL MNIMLLSLPG TPVLYYGDEI GMGDNYYLGD RDGVRTPMQW NSDRNAGFSR ANPQQLLLPV IIDPEYHYEA VNVEVQESNV NSLLWWTRHA IATARRYKSL SRGSIEFLQV SNPKVLIFIR RYEDETILSV INLSRNAQAV SVDLSAFEGY TPEEVFSMNR FPKIRQAPYM LSLGSYGYFW LKMVPDTADV SCRPGLETAF AEVYNWPALF AGRSREKLEN SVLPCYYQAM RWFGGKARNV LRISIVDTIP VEGMEHAKLL VTEVRYSSGE NELYQLPVSF VPLEEVSPSD DNFYRRVIGR AVVGEVDGYL CDATYENAFL SRLFSIMVGG EQWKGKAGIV SGMKGSALEV FRDVTDGSQP EPFLMGVEQT NTSIRFHDSL CLKLYRRIEK GVSPEIEMCG ALTEKTGFAS LPKYLGSLNY DQGRNAGYSI GILQNFVENE GDAWQLSLGQ VARYFGDVQA KVGAGIELPS MQQLSGSPVE VPELMHELIG GAYLGMIEKL AERTAEMHLS LASLDADPAF APEPFTTLYQ RSIYQAMCEQ VKRAMILIGE LMPSMEPEQR ELCSLLVRNQ KDILQQFEPV RQEKIDTLKI RIHGDYHLGQ VLFTGKDFVI IDFEGEPARP ISERKIKRSV FRDISGMLRS FDYAAFHVLH LNESVVRAED RHQMEPWADR WSNAVGQHFL DSYFKSTEGS AIVPEDPRQR EHLMNAYLMN KAVYELNYEL NNRPQWVGIP IRGILKMLDM
|
| |