Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2617 |
Symbol | |
ID | 4244685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4041422 |
End bp | 4043224 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638107686 |
Product | GUN4-like |
Protein accession | YP_722285 |
Protein GI | 113476224 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000986346 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.185793 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCTC TACTGCCTAT TATCAGTATA ACTTTCTTAT CAATAACTCT GACAAATTTT TCTTCCTCTG TCAAAAAAAC AGAGGTCGAA GACATAGCTT CTCGACTAGA GGATAGTATT GTTAAACTAT CTTATAAAAA TCAACCAGGA GATGGAACTG GTTTTTTTGT AGAGGTGGAA GGAAAGTCAG AAGTCTGTAG TGTACTAACA GCAGCTCATG TTGTGGAAAA AGAAGGACAA AAAATCTTAT GGACTGAAAA AGATGAAAAA GTCTGGGATG TTGCTACGGT AGAAAGATTT CCTGGTAGTA TAGACTTGGC TTTGATTACT TTTAAGCCAC ACACAAAAAG ATGTAATTAT CCCGCACTAA AAATAGGTAA ACCAGAAAAC CTGAAGATAG GTAGTTCTAT TTTTGTTTAT GGTTTTCCTC GTCTGGATCA ACATTTAGTG GCCCAGTTTG TTGGGGGTCA GGTTTCAGCT TTGAAGAAAA AAGCCCGGGG TTATGGGGTT GCTTATAAAG CTTTGACTGT TGAAGGAATG AGTGGAGCCC CTGTTGTGGA TACAAAGGGT GAGGTTGTGG CAGTCCATGG AAGTAGTAAC TCTAAAATGG TGCCAAGTTT GATATCTCAG CAGATAAGTA GGCCTGATAT AGAATGGCAG CTTGATCGGC AAACTTTCAA TAGAATTAAT AATAGTAGTT TGACCTGTGC TTGGGGTGTA CCTATTAATT ATTTTCGGGA GTCTAAGTTT TATAATGCTA AGTTATATAG TAATCTTCTA CCTTTAGACT TGAATAAGTG GATACTTTCG ATATTTTCCA TTAGTGGGTT GATGTTTAGC TCTGGTATTG TTTCTTTTCG GTTCAAGCGT TTTCAAGCTT CACCAGTTTC GGGACAAGAA CAAAATGAGC AGGAAAGAGA GTTTGAGGAT GTAGTATTTA GGGGAGAAGA AGAGCACAGA CCGTTGAGTT CCCCGGCAAA TATTCCGACT CAGCTACAAA ATGAGCAGGA AAGAGAGTTT GAGGATGTAG TATTTAGGGG AGAAGAAGAG CGCAGACCGT TGAGTTCCCC GGCAAATATT CAGACTCAGC TACAAAATGA GCAGGAAAGA GAGTTTGAGG ATGTAGTATT TAGGGGAGAA GAAGAGTACA GACCGTTGAG TTCCCTGGCA AATATTCCGA CTCAGCTACA AAATGAGCAG GAAAGAGAGT TTGAGGATGT AGTATTTAGG GGAGAAGAAG AGCACAGACC GTTGAGTTCC CTGGCAAATA TTCAGACTCA GGAGCAGCAA AGGCAAGGAG GGGTTCAGGT TCTTGAACCT CCGTCTTCTG TGGTTTCTGT GCCTCTGGTT TCTGCAGCTG GAGTTGATTA TACTAGGTTG CATGAGTTAT TGGTGGCTAA AAGGTGGAAG GAAGCAGACG ATGAAACATA TCAAAGAATG ATAGAAGTGG CGGACCGGAA GTCCCAAGGA TGGTTGAGAA TTGAGGATAC AAAGAATTTT CCTAGTCAAG ATTTAGGGAT TATTGATAAG CTATGGCTCA GATATAGTAA TGGTATGTTT GGTTTTTCTG TTCAGAAGCA AGTTTATCAG AGTTTGGGTG GTACCCAGAG GTATAATCCA AAAGTAGTAG AGGATTTCGG AGATAAGGTG GGATGGCGCC TGGAGGGAAA ATGGTTGAGT TATGATGGTT TGACTGTGAG TGATAATTAT TACAGGGGAC ACCTGCCGTG TTGTGGGAAT GAGGGGTCTT TGTATGGCTG GGCAGCAGTT CTTTGGTCTC TTCTCTCTCA TAAAGATTTG TAA
|
Protein sequence | MKSLLPIISI TFLSITLTNF SSSVKKTEVE DIASRLEDSI VKLSYKNQPG DGTGFFVEVE GKSEVCSVLT AAHVVEKEGQ KILWTEKDEK VWDVATVERF PGSIDLALIT FKPHTKRCNY PALKIGKPEN LKIGSSIFVY GFPRLDQHLV AQFVGGQVSA LKKKARGYGV AYKALTVEGM SGAPVVDTKG EVVAVHGSSN SKMVPSLISQ QISRPDIEWQ LDRQTFNRIN NSSLTCAWGV PINYFRESKF YNAKLYSNLL PLDLNKWILS IFSISGLMFS SGIVSFRFKR FQASPVSGQE QNEQEREFED VVFRGEEEHR PLSSPANIPT QLQNEQEREF EDVVFRGEEE RRPLSSPANI QTQLQNEQER EFEDVVFRGE EEYRPLSSLA NIPTQLQNEQ EREFEDVVFR GEEEHRPLSS LANIQTQEQQ RQGGVQVLEP PSSVVSVPLV SAAGVDYTRL HELLVAKRWK EADDETYQRM IEVADRKSQG WLRIEDTKNF PSQDLGIIDK LWLRYSNGMF GFSVQKQVYQ SLGGTQRYNP KVVEDFGDKV GWRLEGKWLS YDGLTVSDNY YRGHLPCCGN EGSLYGWAAV LWSLLSHKDL
|
| |