Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1785 |
Symbol | |
ID | 4243767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2726086 |
End bp | 2727363 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638106909 |
Product | hypothetical protein |
Protein accession | YP_721517 |
Protein GI | 113475456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000770102 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACGCA ATCAATTTTA TCCCGATAAA TCTGTTCCTC CAGAAAAATT TGTAGGTAGA ACATCTGAAC TATCCACGAT TTTTGACAAA ATTAATAGTC GCGATCATGT TGCTATTTTT GGTAGTTCTG GTATGGGTAA AACATCTTTG CTTCAATATA TAGAAAACTC TAAATTTTGG GAAGAAAGAG ACTTAGATTT TTCAGAAGCT TTGATTGTTT ATCACAACTG CGAGCTTTCA GTTATAGATA GTTTTTGGCA AGAAGTCCTC AGAACATTAA TAGATAAAGC TACAGGTGAT CAAGATTTAG TGAGTAAGAT TAATGCTTTA TTAGGGTTGG AGAAAATAGA AATAACAGAT ATACGGGAGC TTCTCAGAGA GATTGGGAAA AGAGGTAAGT TTTTATTATT ATTATTAGAT GACTATCATA GAATACTTGG TACACGAGAA GAGTATCCGG AAAACCAGGG AAAAAAATCT AAAAAAGTGC TGACTTTTTT AAGTGAGTTG CGTAACCTAG CAGTTCATAA TAGAGAAGGT CAATATTTCT CAACTATTGT TGCTACGTTT CAAAAGTTGC ATGAACTAGG TCCAACAATT GTTCCTGGTG GTTCTCCTTG GTATAATCAT TATGCCTATC TACCTTTAAA ACCTTTTTGT AAAAGCGATA TTGAGGGTCA TTTTTTTAAT CGCGATAGTC ATTTTTTCAT TTCAGATGCT CCGAAAGAAG AAGTTTTAAA AATGACTGGT GGGTATCCAG CGTTACTTCA GTTTACAGGT TATATATTTT CTCGCTTGGA ACCAGTTAAT GTTGATACTC TGAATACAAT GTTAAAAAAC GATGCTGATA GAATTTTTCA GGATGTCTGG AACAATTTTG AAAAAAATGA GCAAGAAATT TTGCAGTTAA TTTTAATTGA TAAATATAAG GGTAAATTCA GGGAAATTTC TTATTCTATT GCTGGCATAG AAAAAGAGTT TATTCGCAAT ATTAGCATAT TGAAGAGTCT TGAAGAAAAA GGATTTATCA GTCAGGTTAA ACAAGCAAAT AAATATAGTT TTACTTCTTC TTTAATGGAA GATTTTATTG GTGATCAACT TGCAGAAAAA AATGTTTCAA ACGCTAAAGA CCGCAAGATA GTGATTAATT TATTTATTAT CAAGATTACT CTTGGACTAT GGAAGAAAGT TAAAGAAAAA ATACAGCCTG TTACTAAGTT CATATCACCT CTTGCTAAAA TCATCGATTT AATCGCTAAC AAAATAGAAG GCAAATAA
|
Protein sequence | MPRNQFYPDK SVPPEKFVGR TSELSTIFDK INSRDHVAIF GSSGMGKTSL LQYIENSKFW EERDLDFSEA LIVYHNCELS VIDSFWQEVL RTLIDKATGD QDLVSKINAL LGLEKIEITD IRELLREIGK RGKFLLLLLD DYHRILGTRE EYPENQGKKS KKVLTFLSEL RNLAVHNREG QYFSTIVATF QKLHELGPTI VPGGSPWYNH YAYLPLKPFC KSDIEGHFFN RDSHFFISDA PKEEVLKMTG GYPALLQFTG YIFSRLEPVN VDTLNTMLKN DADRIFQDVW NNFEKNEQEI LQLILIDKYK GKFREISYSI AGIEKEFIRN ISILKSLEEK GFISQVKQAN KYSFTSSLME DFIGDQLAEK NVSNAKDRKI VINLFIIKIT LGLWKKVKEK IQPVTKFISP LAKIIDLIAN KIEGK
|
| |