Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2102 |
Symbol | |
ID | 4243936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3276390 |
End bp | 3277850 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638107209 |
Product | hypothetical protein |
Protein accession | YP_721812 |
Protein GI | 113475751 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0719505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.288843 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTAA TGGTGACATT TTATATTAGT TTTATGACGG AGATACCAGA TTTAGATATA CTGTCACAAT TTTTGTTAGT TTTTTTACTA ATTACTATTA ATGCATTTTT TGTATTGGCC GAATTTTCTA TTGTTTCTGT GCGAAGGTCA CGTATTAATC AATTAGTAGA TGCAGGTGAT GTTCAAGCAA AAATTGTTCA ACAACTTCAA AGACACATAG AGCGACTTCT ATCGACAACT CAGTTAGGTA TTACTTTATC TAGTTTTACT TTGGGTTGGA TAGGTGCAAA AACTATGGTA GAAATCGTTA CAAATGCTAT AAAAACCATA CCATTACCTA TTCATATTGG TCAAGGGAAC GATGACTCCG TAGCTATTAC TATTGTCACC TTTTTGATTT TGGCTTACCT ACAAATTGTG CTCGGAGAGT TATGCCCAAA GTCAGTAGCA CTCATTTATT CAGAACAGTT ATCTAGATTA TTAGGCCCTT TAAGTATAGC CATCTCTCGT TTGTTTAATC CGTTTATCTG GATTTTAAAT CAATCTACTT ATTGGTTATT AAGACTGCTG GGAATTAGAT CTACTGGTAG TGCCTGGAAA GCACCTGTTA CTTCAGAAGA ATTGCAATTA ATTATCTCTA CTTCTACAGA ATCAATAGGT TTAGAAGCAG AAGAACGACA ACTACTTAGT AATGTATTTG AATTTGGGGA AGTATTAGCT GTAGAAATTA TGGTCCCACG GACAAATATA GATACTATTT CTAGTACAGC TACTTTTCAA GATTTGTTAA ATGAAGTTCA GCTATCTGGT CATTCCAGAT ATCCTGTAGT TGGTGAATCA ATAGATGATA TTCAGGGCAT TATTGACTTT AAAGAACTAG CTAAACCTTT AGCGAAAGGA TTATTATGCC CAAAAACTTC TATTTTATCT TGGGTTCGAC CTGCAAGATT TGTTTCTGAA CAAACTTATC TGAATGAGTT ATTATCATTA ATGCAACGAA TGCGTCAAAT TTCAAATAGT CCTCAGCACC CAGAAATGGT AATAGTGGTA GATGAGTTTG GTGGAACTGC TGGATTAATT ACACGGGAAG ATTTAATTGC TGAAATTATT AGTAGTGATA GTTATGAAGT AACAGGCTCT GAAGAACTTA CCTTACAGAT GTTAGATGAT CAGACTTTTA TTGTACAGGC ACAATTAAGT GTTGAAGAAG TTAATGAACT TTTAAATTTA GATTTGCCAG TGACTGAGGA CTATCAGACT TTGGGAGGGT TTTTGATTTA TCAATTACAA AAAATTCCCA CTCAAGGAGA AAAGTTACAC TACAAAAATC TTGAATTTAC TGTCATCTCT GCTGAAGGTC AACGTCTAGA TAAAATCAAT ATTTGTCAAT TAGAAGGGTC AGAAACTGAA ACAACTATTA ATCTAGTTTT AGAGGATGAG GCGGGCAAAA AGCAAAGCTA A
|
Protein sequence | MSLMVTFYIS FMTEIPDLDI LSQFLLVFLL ITINAFFVLA EFSIVSVRRS RINQLVDAGD VQAKIVQQLQ RHIERLLSTT QLGITLSSFT LGWIGAKTMV EIVTNAIKTI PLPIHIGQGN DDSVAITIVT FLILAYLQIV LGELCPKSVA LIYSEQLSRL LGPLSIAISR LFNPFIWILN QSTYWLLRLL GIRSTGSAWK APVTSEELQL IISTSTESIG LEAEERQLLS NVFEFGEVLA VEIMVPRTNI DTISSTATFQ DLLNEVQLSG HSRYPVVGES IDDIQGIIDF KELAKPLAKG LLCPKTSILS WVRPARFVSE QTYLNELLSL MQRMRQISNS PQHPEMVIVV DEFGGTAGLI TREDLIAEII SSDSYEVTGS EELTLQMLDD QTFIVQAQLS VEEVNELLNL DLPVTEDYQT LGGFLIYQLQ KIPTQGEKLH YKNLEFTVIS AEGQRLDKIN ICQLEGSETE TTINLVLEDE AGKKQS
|
| |