Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1771 |
Symbol | |
ID | 4242236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 2695073 |
End bp | 2697295 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638106896 |
Product | SMP-30/gluconolaconase/LRE-like region |
Protein accession | YP_721504 |
Protein GI | 113475443 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3386] Gluconolactonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.212253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAAAG TAGCAACAGT AACTGTAATT CAGGAAATTA ATTCAGCCAA TGAAAAAATT CCAGTATTAG CATTATTACC ATCTGAAAAT GAAAAGCAGT TATTTTCTAG CCCTGATTTT TCAGAATATT GCTTTCATTG TTTAGAATAT TCATCACTAA AAACAATACC ACCATCACTC AAAGGTCCAG TCAACTTAGT GGCCTTTTGC CAAGACGCGA TCGCTTATGC CCAAAGCCAT CATATTACAG TTGTTTACTA CAGCTTTGAT ATTAGCAACT TAATAGCCGC TGTAGTTTGT CAAAACCTCA ATCTTTTTGG TCCTTCCCTG GAAAGTGTAT TACAGTGTTT TCATAAATTT TATGGTCGTG AAATTGACTC ATACCCCCCC GGCTACAGTT TTGTGCAACT GGAAAATGAA AAAAAGGTTT CCATTGATCA ACCCACTCTC AACACCCTAA CTTTTCCAGT ATTTATAAAA CCAGTTTGTT CTAGTTACGG AATGTTTTGT GCTCCGGTTG ATACCCCTGA GCAATTAGAA CAGGTATGTA CAGAACTAGC GACAGCTTAT CAACCTTATT GGCAAATGTA TTCAACTTTC TTCCAAGAAT TTGTTGATGC TACAAAATAC CCCTTAGCTG TAACTTCTCA AGTTCCACTT TTAGTTGAAG AACTAATTTC TGGTGAACCC TTAACCTGTG ATGGCTTCGT TTATAAAGGA GAGATTAATT TTCTGGGTTT AGTAGATACT GTAGAAGCGC CAAATGGATC CGTAGACTGT TATATTTTTC CCTCTCAAGT CAGTGAACTC CAAAAAAAAG CTATTTATGA GCGAGTCACT AATTTTATTC AAAATAGCGG TTTAGATAAC AGCTTTTTCA GCGCTGAATT TTGGCTTCAA GGGCAAACGC CTCCCATCTT AATTGAAATG AATGCGCGTA TGAGTGCAAC ATTCAGCTTT CTATATAAAC AAAGTATTAA CTTCAACTTA CCATTAGCTG CCCTAAAATT AGCTCAGGGA ATTTGCCCCC AAATTCCTAA ACAGCCATTA CAACAACAAA TACCCACAAG TATTCGCCTA TATCTTTCAA CTCGTAAAAG TGGCCTCGCC TCAACACTAC TTGATTTTAG TCTGGCTCAA CAAACCCTAG GCAGCTCACA GCTCATTACC TTTAACTGCA AACCAGAAGA GCAAATCAAA AATAGCTCCT ATCATTCAAC TCCACTAGCA GAGCTCAACC TTTTTGGGAC CAGCCAGTTT CCCCTAAAAT TTTATGGAGA GCACCTACGG AATGCACTAC TACTAGAACG ACGCTCACCA GCCTATCAAA TTACCGTAGT ACCAGACCTT CTTCTCCAGG GAGGTGAAGG ACTCTGCTAT GACTCCAGAA ATAATCAACT ACTTTGTCTA GACTATGGCA ATTTTCAGCT CATTCAACTG TGCTTGACAA CCCGCAATAT ACAAAGGTTT TCATTACCTG CAGCATTTTA TGGCATTGCA GTATCTGCAA CAGGAACCGT ATTATTATCT GGGGAGTTAG GGTTGATGGA GTTTAACCTC AACACCCAAG AGCTAGTGCC CATTGTTCAA GAATATCAAG GTCAAAAATT AGTTTGTAAT GATGTAATTA TTGACAAAAC CGGATGCATA TGGTTTAATA CAATAAATGA AGACACCACA GGAATTGGAA AAGAAGGAGC ACTGCTATGC TGTAATTCCC AGGGGCAAGT TCAAGAAGTT TTTAAAGGGT TTGGTTATGC TAATAGCATG GGAACGTCAC CAGATGGGCG ATCGCTTTAT GTTGTAGACA GTCTCGAACG GGTGATCCAT GTCTTCACCC TCAAATCATC TAAAATGGAA GCAACCCTAA AAGATGTTCA GTACCATCAC AAATTTATTG TTGCTAATGA AAATGAAGGA GTACTTGATG GACTAGCTGT AGATAGATCT GGTTATCTTT GGCTCACATT TTGGTTTGGT GGTAAAATCA TCTGTGTTCA CCCGCGATCA GAAGCAAGGC TGAGAGAGGT GACTTTACCT GTAATGAACA TTACTAGTGT GAGCGTTGTT GATTCTGAAT GTAATCCCAA TGCTAAAGCA GCCAACAAAT TGTTTGCCTG CTCATCTGTA GTTTCTTGGC TCGGAGGAAA GAGCCTTTCA CCCTGGTATG CAAGTGAATT GAAAGAGTCA AAACAAGGAT ACCTTTTTGA AATTGATTTA TAA
|
Protein sequence | MLKVATVTVI QEINSANEKI PVLALLPSEN EKQLFSSPDF SEYCFHCLEY SSLKTIPPSL KGPVNLVAFC QDAIAYAQSH HITVVYYSFD ISNLIAAVVC QNLNLFGPSL ESVLQCFHKF YGREIDSYPP GYSFVQLENE KKVSIDQPTL NTLTFPVFIK PVCSSYGMFC APVDTPEQLE QVCTELATAY QPYWQMYSTF FQEFVDATKY PLAVTSQVPL LVEELISGEP LTCDGFVYKG EINFLGLVDT VEAPNGSVDC YIFPSQVSEL QKKAIYERVT NFIQNSGLDN SFFSAEFWLQ GQTPPILIEM NARMSATFSF LYKQSINFNL PLAALKLAQG ICPQIPKQPL QQQIPTSIRL YLSTRKSGLA STLLDFSLAQ QTLGSSQLIT FNCKPEEQIK NSSYHSTPLA ELNLFGTSQF PLKFYGEHLR NALLLERRSP AYQITVVPDL LLQGGEGLCY DSRNNQLLCL DYGNFQLIQL CLTTRNIQRF SLPAAFYGIA VSATGTVLLS GELGLMEFNL NTQELVPIVQ EYQGQKLVCN DVIIDKTGCI WFNTINEDTT GIGKEGALLC CNSQGQVQEV FKGFGYANSM GTSPDGRSLY VVDSLERVIH VFTLKSSKME ATLKDVQYHH KFIVANENEG VLDGLAVDRS GYLWLTFWFG GKIICVHPRS EARLREVTLP VMNITSVSVV DSECNPNAKA ANKLFACSSV VSWLGGKSLS PWYASELKES KQGYLFEIDL
|
| |