Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4624 |
Symbol | |
ID | 4246278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 7104047 |
End bp | 7107124 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638109493 |
Product | Alpha-glucosidase |
Protein accession | YP_724069 |
Protein GI | 113478008 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAGTG AAATTTTTAA AAGTGATCGC CTCTATAAAT TTATCAAAGT CGAAGAATAC TTCGATCACT ACCAAGAATG GGATAAACTG GGACCTGCAA AAAATCCTGT GTTCGACGGG AAGTACACTT TAAGCCTTAC CTTTGATAAG GCCGGCGGAG GTGAGTGCTC AATGTTGCTT GAGATCACTC AGAACGATGT CTTACGGATG CGCTTTAACC CCAACAAAAA ATTAGCAGGG GATTATGCAA GGGGCACTCG CGTGGATAAT GATGAAACAG ATGAAGAGAT CAAAAGGAAT AATCAACGTA CTGTAACTTA CAGACAAGTT AAGGATATTA AGTTAGGAGA TCAGGTTATA TTGACGGCAA AAAGCAATTA TAACCAGGTT GGTCGTAATA CACCCTTCAA TATGGAAGTA GTGATCACTT TAAATCCATT TGGTATCAAA GTATTCAACA AGTCCGAATC GAATTCAGAG TATGCCAATA TTCCTGTCTG GGAAACCGCA AACACTTCAA TTTACTATAC CGCTAATGGC ACAGATGATT ATGCCATCAT TCAGTCGGTC AAGAAGTCGG CCGATGCCAA ATATATTGGT TTTGGGGAAC AGGGGGGAAC TAAACTCAGC AAAAATATGG ATCAGTTGAA TTATTTCAAC TTTGATAATA TGCGCTATCG ACAGGTCTAT AATCGTGGAC CTTTAGATAA CCGCGAACCT CTTTACCACT CTGAGCCTTT CTTTTATGAG TTCAATGGCG TTCCTGGTAG CGACAATGTT AATGCAGTTC TAGTGGATAA CCCAAGTCAA GTATTCATGG ACATAGGCTA TAGTAATTCT GGTCGCTATA TGTTTGGGAC TCGCTTTGGT GATCTTGACT ATTATGTGTT TTTTGGAGAA GACCCGAAGA ATATTTTAGA TAGCTATACG GCCGTGATCG GTCGCCCAGA GTTGAAACCG CGCTATGCTT TGGGATATCA CCAAGGTTGC TATGGATACG AAAAGCGGAG TGACTTAGAA TGGGTTGTGG CGCGATATCG GGATTGGGGT ATACCCATTG ATGGTCTCGC TGTTGATGTT GACTTACAAG CAAATTATCG GACTTTTACT ATCAACATCA ATAATTTCTG GGAACCAGAT AAAATGTTTG ATAACTTGCG AAAGCAGGGG ATAAAATGCT GTACGAACAT TACTCCCGTG ATCAGCAGTC AAGATAAGCT TGGCAAGACA GATTATGACT ACTCAACCTA CGTTGAGGGA AAGAATAACA ACTACTTTGT TGTTGATAAG CGCTACGATC CATATAATCC AATTAGTAAA GAGTATCAGA TCTATAATGG GGGTATTGAA GATCGTTCCA ACAAAGATGA TAATTCCGAT CCCGAAGGGT TTGATAGTAG TGAGCCTTAC ATTGGCGAAG TATATTACGG CAAGGATGCC AATGGTAAAG AGCTGGGAAG CCCTGGTCAT TATCCTGATT TAGGCCGTCA AGAAGTACGC GAATGGTGGG GCAAGCAATA TCAATATCTG TATGAAATGG GACTAGAATT TGTCTGGCAG GATATGACTA CCCCAGCTAT CCGTGACTTC CGTGGGGATA TGAAAGGTTT CCCCTTCCGT CTGTATGTGA CTGACGACTT CTATCCCTCG GATGTCAAAC TAACCCCAGC TCTGAAAGTT TGGAATTTGT ATTCATACAA TTTGCACAAA GCCACCTACG AAGGACTCAA TAACCTGTAC AAACTATCCA AAGGACTAGA GTGGCGCGAA AACAAGCGCA ACTATATCAT CGGACGAGGG AGTTTTTCTG GTTCTCACAG ATATTCTGGT TTATGGACTG GGGATAACTC TTCCGAATGG GCTTTTCTTC AGATGAATAT CTCTCAGGTT CTGTCTCTAG GCATGAACGC TTTGGCAGTT ACCGGGCAAG ATATCGGAGG TTTCGAGCAA GAATATGGTA ACGACAAACA GCAGTGGGCG AGTCCGGAAC TGGTAATTCG CTGGACTGCT GCAGGTGCCT TCTTACCATG GTTCCGTAAC CACTATGTCC GTAAGGGTCG TAAAGAATTC CAAGAACCAT TCCAATATAT AGAGTGGTTT GAGACATGGA ACAAGCCAAT TCCCGAGCCC CAGGACTTGT ACAGGATGGT GCCAGAGATT TGTAAGCATT ATATTGAATT GCGCTACCGT CTGATGCAGT TATTCTACGA CACATTGTTT GAGAACACTC TGGATGGTCT GCCTATCTGT CGTCCTTTGT TCCTCAACGA TCCCCAGGAC AAGTCTCTTT ACAATGATAA AGACGAATTT TTAAACAACG AATTTTTCGT AGGCAAGGAT TTTCTCGTTG CTCCGGTATT GCTTCCTCAG AGTGAAACCA ATGGTGGCAA GCGGGATATC TATTTACCTA AGCCTAGCTA CTGGTACAAC TTTGTCAACA ACGTTATGCC TCTTAATAAT GCTTTGGAAG GGGGAACAAC GATACGAGAT TTTGATGCTA ATATCAATAC TCGTGACCAG CACATTAACT TTATTGTACC CATCTACGTT CGCTCGGCAG CAATTATTCC CACAATTGAG TTGGAACAAT ATGTGGGAGA AAAGAACGCC AAAGGAGAAA AAAATCCTAT TACCCTGAAT ATCTATCCAG ACTATCAGAA GGAGAATGGA GGAGAATACC ACATGTACTT GGATGATGGG GAGAGCCGTT CTTCTGCCCC GAAATCACAA GTGGACGATC CCAAGGCAAA CGACGAATAT CGCGAGATTT TGATTACTAA TAAATACACT GGGGAAAAGA CCAGAGAGAT TAAAGTTAAT CGTGTTCACG ATGGATATAC GCCTATGGAA GACTTTTTCT TTGTGGCAAT ATTGCACGAT CCTACTGAAC AAAAGGGAGA ACACGGCCCT TTGCAGGAGG TGACTATGGA AGGTAAACCG CAACAGATGG TTAGTGACAA TGGTGCCCTC TGGGGTAGCC CTAACAACGC ATGGTACTAT AATGCAGACA TCAATATCAG TTTTATCAAG GTGTTTGATA ATATGCCTAA GACAACTATC ATGTTAGGTT ATGTCTAA
|
Protein sequence | MPSEIFKSDR LYKFIKVEEY FDHYQEWDKL GPAKNPVFDG KYTLSLTFDK AGGGECSMLL EITQNDVLRM RFNPNKKLAG DYARGTRVDN DETDEEIKRN NQRTVTYRQV KDIKLGDQVI LTAKSNYNQV GRNTPFNMEV VITLNPFGIK VFNKSESNSE YANIPVWETA NTSIYYTANG TDDYAIIQSV KKSADAKYIG FGEQGGTKLS KNMDQLNYFN FDNMRYRQVY NRGPLDNREP LYHSEPFFYE FNGVPGSDNV NAVLVDNPSQ VFMDIGYSNS GRYMFGTRFG DLDYYVFFGE DPKNILDSYT AVIGRPELKP RYALGYHQGC YGYEKRSDLE WVVARYRDWG IPIDGLAVDV DLQANYRTFT ININNFWEPD KMFDNLRKQG IKCCTNITPV ISSQDKLGKT DYDYSTYVEG KNNNYFVVDK RYDPYNPISK EYQIYNGGIE DRSNKDDNSD PEGFDSSEPY IGEVYYGKDA NGKELGSPGH YPDLGRQEVR EWWGKQYQYL YEMGLEFVWQ DMTTPAIRDF RGDMKGFPFR LYVTDDFYPS DVKLTPALKV WNLYSYNLHK ATYEGLNNLY KLSKGLEWRE NKRNYIIGRG SFSGSHRYSG LWTGDNSSEW AFLQMNISQV LSLGMNALAV TGQDIGGFEQ EYGNDKQQWA SPELVIRWTA AGAFLPWFRN HYVRKGRKEF QEPFQYIEWF ETWNKPIPEP QDLYRMVPEI CKHYIELRYR LMQLFYDTLF ENTLDGLPIC RPLFLNDPQD KSLYNDKDEF LNNEFFVGKD FLVAPVLLPQ SETNGGKRDI YLPKPSYWYN FVNNVMPLNN ALEGGTTIRD FDANINTRDQ HINFIVPIYV RSAAIIPTIE LEQYVGEKNA KGEKNPITLN IYPDYQKENG GEYHMYLDDG ESRSSAPKSQ VDDPKANDEY REILITNKYT GEKTREIKVN RVHDGYTPME DFFFVAILHD PTEQKGEHGP LQEVTMEGKP QQMVSDNGAL WGSPNNAWYY NADINISFIK VFDNMPKTTI MLGYV
|
| |