Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1983 |
Symbol | |
ID | 3747362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2515296 |
End bp | 2518592 |
Gene Length | 3297 bp |
Protein Length | 1098 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637774520 |
Product | alpha amylase domain-containing protein |
Protein accession | YP_380274 |
Protein GI | 78189936 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCAAC CCGAACCCCT CTGGTACAAA GACGCCATTA TTTACGAGGC GCACGTTAAA ACCTTTTACG ATAGCGATAA TGATGGCATT GGCGATTTTC AAGGATTGCG CCAAAAGCTT GGTTACTTGC AAAGTCTTGG TATTACGGCA ATTTGGTTGC TTCCCTTTTA TCCCTCGCCA CTGCGTGATG ATGGATACGA TATTGCTGAT TACATGACGG TTAACCCCGA TTATGGCACT ATGGATGATT TTCGTGCCTT TCTTGAGGAG GCGCATTCAT TGGGCTTAAA GGTGATTACC GAGTTGGTTG TAAACCACAC CTCCGACCAA CATGCGTGGT TCCAGCGTGC GCGCCATGCA CCAAAGGATT CGCCTGAGCG CAATTTTTAT GTGTGGAGCG ATGATCCCAA CAAATATTCC GAAACGCGCA TCATTTTTCA AGATTTTGAA GCCTCTAACT GGACGTATGA TTCCGTTGCA GGGCAATATT ATTGGCATCG CTTTTACCAC CATCAGCCCG ATTTAAATTT TGAAAATCCT GCGGTTCATG CTGCCTTGCT CCATGTGCTT GACTTCTGGC TTGGCATGGG CGTTGATGGG CTTCGTCTCG ATGCGGTGCC TTACCTGTAT GAGGAAGAGG GCACCAATTG CGAAAATCTC CCCAGAACCT ATCAGTACCT GCGTGATTTA CGCTCTTATA TTGATGAAAA ATATCCCAAC CGCATGTTGC TTGCCGAAGC CAATCAGTGG CCCGAAGATT CGGCAGCATA TTTAGGCAAT GGCGATATGT GCCATATGAA CTTCCACTTC CCGCTCATGC CACGTATGTA CATGGCGTTA GCAACGGAAG ATCGCTTCCC TATTCTTGAT ATTCTTGAGC AGACGCCCGA AATTCCAGAA AGTTGCCAAT GGGCATCTTT TTTGCGTAAC CACGATGAGC TAACGCTTGA AATGGTAACC GACGAAGAGC GCGACTACAT GCGCCGTGTG TATGCCAATG ATCCTCGTGC CCGCATTAAC CTTGGTATTC GTCGCCGTCT TGCGCCGCTC ATGTCGAATG ATCGCCGCAA AATTGAGCTG ATGAACATTA TGCTGCTCTC TTTGCCCGGC ACCCCTGTGC TTTACTACGG CGATGAAATT GGTATGGGTG ATAACTTCTA CCTTGGCGAT CGTGATGGCG TGCGTACCCC AATGCAGTGG AATGCTGACC GCAACGCTGG CTTTTCGCGT GCTAATCCGC AACGCTTGCA ACTGCCTGTT ATTATTGATC CCGAATACCA TTACGAAGCC GTAAACGTGG AGGTGCAAGA GAGCAACATC CATTCGCTTT TGTGGTGGAT GCGCCAAACT ATTTCCACAG CTCATCGTCA TAAAGCTTTT AGCCGTGGCA CTATTGAGTT CCTACCCGTC AAAAATTCTA AGGTACTCTC TTTTATTCGT CAATATGAAG ACGAAACCAT GCTCTGCGTT ATTAACCTTT CGAAAAATGC ACAGGCTGTA ACAGTTGATC TTTCTCGTTT TAATGGTTAC ACGCCCGAAG AGGTTTTTAG CTTAAACCGT TTCCCCAAAA TTAGAAGCAC GCCTTACATG TTGGCGCTTG GCGCTTATGG ATACTTCTGG CTCAAATTAA TTAAGGAGGA AAAAGAGGTT GATCGCCATG CGTTGCTTGA TGGCTCTGTA GTAAGCGTGA ACCGTTGGCA ATCACTCTTT ATTGGGAAAA ATCGCGAAAA GCTTGAAACG GCTGTTTTTT CAAGCTATTA CATGGCAGCA CGCTGGTTTG GAGGCAAAGC ACGCACCATC ATCCGCATCT CAATTACCGA TACCATTCCT ATTGCGAATG TAGCTAATAC CAAGTTGTTA GTAACTGAAG TGCGCTATTC AAGTGGTGAA AATGAGAACT ATCAGTTGCC AGTAACCTTT GTACCGCTTG CAAACCTTCA GCCATCCGAT GAGTACTTTA GCAAGCAAGT TATTGCTCGC ATAACTGTTG GCGATGAAGA GGGTTACCTT TGCGATGCTA CCTTTACACC TGCATTTTTG CAAGAGCTGT ATAGCGTTGC AACCGCTAAA GGCTCATGGC AAGGCAAACA AGGCGTGGTA AACGGTAGTT CGGCTCCAAA GCTTGCCGCT TTTCTTGCCA ATGTAGCTGA TGCAGCGCCC GAGCTGATGG GTGCAGAGCA AAGCAACACC TCAATTCGCT ATGCCGATAA TCTTTGCTTA AAGCTTTATC GCCGCATTGA ATCGGGTGTT TCGCCTGAGG TTGAAATGTG CAGCGCCTTG AGTGAGCGCA CAAGTTTTAC CAATTTGCCA ACCTATCTTG GCACAGTGAA CTATAGCCGC AGCCGTAGCA GCCGCTGTTC CATTGGCATT TTACAAACCT ACGTGCCAAA CCAAGGCGAC GCATGGCAAC TTTCGCTTGA CCAAGCACGC CGCTACTTCG ATGCCATCCA TTCAGCCTTA CCAAATGCGC TTGCCATGCC AGCTTTACCT GCATTAAGTG GCAATCCAGC TCCACTGCCC GAATTAATGC AAGAGCTTAT TGGGGGGCAT TATCTTGGTA TGATTGAAAA GCTTGCCGAG CGCACTGCCG AAATGCACCT TGCCCTTGCA ACGCTCGAAA GCGATCCCGC TTTTGCGCCC GAAGCCTTTA CTTCGCTTTA TCAGCGCTCC ATTTACCAAG CCATGTGCGA ACAAGTAAAG CGTTCGGTTA TTTTAATTCG TGAGCTACTT CCATCCCTAA ACGGAGAGCA GCAAACGCTT GCTACACAGT TCGTGCAAAA GCAAAAGCAA ATTCTGCAAC AGTTTGATCC TATTCGTACC GAAAAAATTG AAGCCCTAAA AATTCGCATT CACGGCGATT ACCATCTTGG GCAGGTGCTC TTTACGGGTA AGGATTTCAC CATTATTGAT TTTGAAGGTG AGCCAGCACG TCCGCTTTCG GAGCGTAAAA TTAAGCGCTC AGTTTTTCGT GATGTGGCTG GAATGCTTCG CTCATTTGAT TACGCCGCCT TTAGCGCATT GCGTCAAATT GCACCAACCC TTCGCCCCGA CGAGTTGCCA ATGCTTGACG CATGGGCAGA GCGCTGGAGT TTTTACGTGG GGCAGCACTT TATTAACCGC TATTTTGAAG CCACTAATGG TAGCTCTATT GTGCCCGTTG AGGCTCCACA GCGTGAGCAC TTGCTGCGCG GCTACTTAAT GAACAAAGCG ATTTATGAGT TGAATTATGA GCTAAACAAC CGTCCCGATT GGGCAGCAAT TCCATTACGT GGTATTTTAA AGCTCATAGA GCAATAA
|
Protein sequence | MYQPEPLWYK DAIIYEAHVK TFYDSDNDGI GDFQGLRQKL GYLQSLGITA IWLLPFYPSP LRDDGYDIAD YMTVNPDYGT MDDFRAFLEE AHSLGLKVIT ELVVNHTSDQ HAWFQRARHA PKDSPERNFY VWSDDPNKYS ETRIIFQDFE ASNWTYDSVA GQYYWHRFYH HQPDLNFENP AVHAALLHVL DFWLGMGVDG LRLDAVPYLY EEEGTNCENL PRTYQYLRDL RSYIDEKYPN RMLLAEANQW PEDSAAYLGN GDMCHMNFHF PLMPRMYMAL ATEDRFPILD ILEQTPEIPE SCQWASFLRN HDELTLEMVT DEERDYMRRV YANDPRARIN LGIRRRLAPL MSNDRRKIEL MNIMLLSLPG TPVLYYGDEI GMGDNFYLGD RDGVRTPMQW NADRNAGFSR ANPQRLQLPV IIDPEYHYEA VNVEVQESNI HSLLWWMRQT ISTAHRHKAF SRGTIEFLPV KNSKVLSFIR QYEDETMLCV INLSKNAQAV TVDLSRFNGY TPEEVFSLNR FPKIRSTPYM LALGAYGYFW LKLIKEEKEV DRHALLDGSV VSVNRWQSLF IGKNREKLET AVFSSYYMAA RWFGGKARTI IRISITDTIP IANVANTKLL VTEVRYSSGE NENYQLPVTF VPLANLQPSD EYFSKQVIAR ITVGDEEGYL CDATFTPAFL QELYSVATAK GSWQGKQGVV NGSSAPKLAA FLANVADAAP ELMGAEQSNT SIRYADNLCL KLYRRIESGV SPEVEMCSAL SERTSFTNLP TYLGTVNYSR SRSSRCSIGI LQTYVPNQGD AWQLSLDQAR RYFDAIHSAL PNALAMPALP ALSGNPAPLP ELMQELIGGH YLGMIEKLAE RTAEMHLALA TLESDPAFAP EAFTSLYQRS IYQAMCEQVK RSVILIRELL PSLNGEQQTL ATQFVQKQKQ ILQQFDPIRT EKIEALKIRI HGDYHLGQVL FTGKDFTIID FEGEPARPLS ERKIKRSVFR DVAGMLRSFD YAAFSALRQI APTLRPDELP MLDAWAERWS FYVGQHFINR YFEATNGSSI VPVEAPQREH LLRGYLMNKA IYELNYELNN RPDWAAIPLR GILKLIEQ
|
| |