Gene Tery_4624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4624 
Symbol 
ID4246278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7104047 
End bp7107124 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content42% 
IMG OID638109493 
ProductAlpha-glucosidase 
Protein accessionYP_724069 
Protein GI113478008 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGTG AAATTTTTAA AAGTGATCGC CTCTATAAAT TTATCAAAGT CGAAGAATAC 
TTCGATCACT ACCAAGAATG GGATAAACTG GGACCTGCAA AAAATCCTGT GTTCGACGGG
AAGTACACTT TAAGCCTTAC CTTTGATAAG GCCGGCGGAG GTGAGTGCTC AATGTTGCTT
GAGATCACTC AGAACGATGT CTTACGGATG CGCTTTAACC CCAACAAAAA ATTAGCAGGG
GATTATGCAA GGGGCACTCG CGTGGATAAT GATGAAACAG ATGAAGAGAT CAAAAGGAAT
AATCAACGTA CTGTAACTTA CAGACAAGTT AAGGATATTA AGTTAGGAGA TCAGGTTATA
TTGACGGCAA AAAGCAATTA TAACCAGGTT GGTCGTAATA CACCCTTCAA TATGGAAGTA
GTGATCACTT TAAATCCATT TGGTATCAAA GTATTCAACA AGTCCGAATC GAATTCAGAG
TATGCCAATA TTCCTGTCTG GGAAACCGCA AACACTTCAA TTTACTATAC CGCTAATGGC
ACAGATGATT ATGCCATCAT TCAGTCGGTC AAGAAGTCGG CCGATGCCAA ATATATTGGT
TTTGGGGAAC AGGGGGGAAC TAAACTCAGC AAAAATATGG ATCAGTTGAA TTATTTCAAC
TTTGATAATA TGCGCTATCG ACAGGTCTAT AATCGTGGAC CTTTAGATAA CCGCGAACCT
CTTTACCACT CTGAGCCTTT CTTTTATGAG TTCAATGGCG TTCCTGGTAG CGACAATGTT
AATGCAGTTC TAGTGGATAA CCCAAGTCAA GTATTCATGG ACATAGGCTA TAGTAATTCT
GGTCGCTATA TGTTTGGGAC TCGCTTTGGT GATCTTGACT ATTATGTGTT TTTTGGAGAA
GACCCGAAGA ATATTTTAGA TAGCTATACG GCCGTGATCG GTCGCCCAGA GTTGAAACCG
CGCTATGCTT TGGGATATCA CCAAGGTTGC TATGGATACG AAAAGCGGAG TGACTTAGAA
TGGGTTGTGG CGCGATATCG GGATTGGGGT ATACCCATTG ATGGTCTCGC TGTTGATGTT
GACTTACAAG CAAATTATCG GACTTTTACT ATCAACATCA ATAATTTCTG GGAACCAGAT
AAAATGTTTG ATAACTTGCG AAAGCAGGGG ATAAAATGCT GTACGAACAT TACTCCCGTG
ATCAGCAGTC AAGATAAGCT TGGCAAGACA GATTATGACT ACTCAACCTA CGTTGAGGGA
AAGAATAACA ACTACTTTGT TGTTGATAAG CGCTACGATC CATATAATCC AATTAGTAAA
GAGTATCAGA TCTATAATGG GGGTATTGAA GATCGTTCCA ACAAAGATGA TAATTCCGAT
CCCGAAGGGT TTGATAGTAG TGAGCCTTAC ATTGGCGAAG TATATTACGG CAAGGATGCC
AATGGTAAAG AGCTGGGAAG CCCTGGTCAT TATCCTGATT TAGGCCGTCA AGAAGTACGC
GAATGGTGGG GCAAGCAATA TCAATATCTG TATGAAATGG GACTAGAATT TGTCTGGCAG
GATATGACTA CCCCAGCTAT CCGTGACTTC CGTGGGGATA TGAAAGGTTT CCCCTTCCGT
CTGTATGTGA CTGACGACTT CTATCCCTCG GATGTCAAAC TAACCCCAGC TCTGAAAGTT
TGGAATTTGT ATTCATACAA TTTGCACAAA GCCACCTACG AAGGACTCAA TAACCTGTAC
AAACTATCCA AAGGACTAGA GTGGCGCGAA AACAAGCGCA ACTATATCAT CGGACGAGGG
AGTTTTTCTG GTTCTCACAG ATATTCTGGT TTATGGACTG GGGATAACTC TTCCGAATGG
GCTTTTCTTC AGATGAATAT CTCTCAGGTT CTGTCTCTAG GCATGAACGC TTTGGCAGTT
ACCGGGCAAG ATATCGGAGG TTTCGAGCAA GAATATGGTA ACGACAAACA GCAGTGGGCG
AGTCCGGAAC TGGTAATTCG CTGGACTGCT GCAGGTGCCT TCTTACCATG GTTCCGTAAC
CACTATGTCC GTAAGGGTCG TAAAGAATTC CAAGAACCAT TCCAATATAT AGAGTGGTTT
GAGACATGGA ACAAGCCAAT TCCCGAGCCC CAGGACTTGT ACAGGATGGT GCCAGAGATT
TGTAAGCATT ATATTGAATT GCGCTACCGT CTGATGCAGT TATTCTACGA CACATTGTTT
GAGAACACTC TGGATGGTCT GCCTATCTGT CGTCCTTTGT TCCTCAACGA TCCCCAGGAC
AAGTCTCTTT ACAATGATAA AGACGAATTT TTAAACAACG AATTTTTCGT AGGCAAGGAT
TTTCTCGTTG CTCCGGTATT GCTTCCTCAG AGTGAAACCA ATGGTGGCAA GCGGGATATC
TATTTACCTA AGCCTAGCTA CTGGTACAAC TTTGTCAACA ACGTTATGCC TCTTAATAAT
GCTTTGGAAG GGGGAACAAC GATACGAGAT TTTGATGCTA ATATCAATAC TCGTGACCAG
CACATTAACT TTATTGTACC CATCTACGTT CGCTCGGCAG CAATTATTCC CACAATTGAG
TTGGAACAAT ATGTGGGAGA AAAGAACGCC AAAGGAGAAA AAAATCCTAT TACCCTGAAT
ATCTATCCAG ACTATCAGAA GGAGAATGGA GGAGAATACC ACATGTACTT GGATGATGGG
GAGAGCCGTT CTTCTGCCCC GAAATCACAA GTGGACGATC CCAAGGCAAA CGACGAATAT
CGCGAGATTT TGATTACTAA TAAATACACT GGGGAAAAGA CCAGAGAGAT TAAAGTTAAT
CGTGTTCACG ATGGATATAC GCCTATGGAA GACTTTTTCT TTGTGGCAAT ATTGCACGAT
CCTACTGAAC AAAAGGGAGA ACACGGCCCT TTGCAGGAGG TGACTATGGA AGGTAAACCG
CAACAGATGG TTAGTGACAA TGGTGCCCTC TGGGGTAGCC CTAACAACGC ATGGTACTAT
AATGCAGACA TCAATATCAG TTTTATCAAG GTGTTTGATA ATATGCCTAA GACAACTATC
ATGTTAGGTT ATGTCTAA
 
Protein sequence
MPSEIFKSDR LYKFIKVEEY FDHYQEWDKL GPAKNPVFDG KYTLSLTFDK AGGGECSMLL 
EITQNDVLRM RFNPNKKLAG DYARGTRVDN DETDEEIKRN NQRTVTYRQV KDIKLGDQVI
LTAKSNYNQV GRNTPFNMEV VITLNPFGIK VFNKSESNSE YANIPVWETA NTSIYYTANG
TDDYAIIQSV KKSADAKYIG FGEQGGTKLS KNMDQLNYFN FDNMRYRQVY NRGPLDNREP
LYHSEPFFYE FNGVPGSDNV NAVLVDNPSQ VFMDIGYSNS GRYMFGTRFG DLDYYVFFGE
DPKNILDSYT AVIGRPELKP RYALGYHQGC YGYEKRSDLE WVVARYRDWG IPIDGLAVDV
DLQANYRTFT ININNFWEPD KMFDNLRKQG IKCCTNITPV ISSQDKLGKT DYDYSTYVEG
KNNNYFVVDK RYDPYNPISK EYQIYNGGIE DRSNKDDNSD PEGFDSSEPY IGEVYYGKDA
NGKELGSPGH YPDLGRQEVR EWWGKQYQYL YEMGLEFVWQ DMTTPAIRDF RGDMKGFPFR
LYVTDDFYPS DVKLTPALKV WNLYSYNLHK ATYEGLNNLY KLSKGLEWRE NKRNYIIGRG
SFSGSHRYSG LWTGDNSSEW AFLQMNISQV LSLGMNALAV TGQDIGGFEQ EYGNDKQQWA
SPELVIRWTA AGAFLPWFRN HYVRKGRKEF QEPFQYIEWF ETWNKPIPEP QDLYRMVPEI
CKHYIELRYR LMQLFYDTLF ENTLDGLPIC RPLFLNDPQD KSLYNDKDEF LNNEFFVGKD
FLVAPVLLPQ SETNGGKRDI YLPKPSYWYN FVNNVMPLNN ALEGGTTIRD FDANINTRDQ
HINFIVPIYV RSAAIIPTIE LEQYVGEKNA KGEKNPITLN IYPDYQKENG GEYHMYLDDG
ESRSSAPKSQ VDDPKANDEY REILITNKYT GEKTREIKVN RVHDGYTPME DFFFVAILHD
PTEQKGEHGP LQEVTMEGKP QQMVSDNGAL WGSPNNAWYY NADINISFIK VFDNMPKTTI
MLGYV