Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0178 |
Symbol | |
ID | 4242928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 264281 |
End bp | 265960 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638105524 |
Product | GUN4-like |
Protein accession | YP_720143 |
Protein GI | 113474082 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000497221 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.66578 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCTC TACTTCGTGC TTTTGGTTTG ACTTTCCTGT CACTAACTCT GACAAATTGT TCTTTGTCTC CAGAAAAAAT AGCTTCTCGG TTAGAGCCTA GTATTGTTAA AGTATTTTAT CAAGGTGAAC CAGGACATGG CACTGGTTTT TTTGTGCCTG GAGAAAAAGG GGTCTGTACT GTACTCACGG CGGCTCATGT TGTGAAAAAA GAGGGGGAAA ATTTATTAAA AACTAATGAT GGGAAGGAGT GGCCTGCTTC TCAGGTCGAA ATATTTCCTA ATAATATAGA TTTGGCTTTG GTCAGTTTTC AGCCGGAAAA AGGAAATTGT AATTATTCGC CACTGAAAAT AGGTAACTCA GATGGCCTCA AACAAGGTAG TTCTATCTAT ATTTCTGGTT TTCCTATTAG GGATGGGTAC TTGGTATCAC AGTTTGTTTT GGGCAGTGTT TCTCGTTTGG ATAGGTTGGC CCAGGGTTAT GGGGTTTCTT ATCAGGCTTT GACTGTGGGG GGTATGAGTG GTGCCCCTGT TGTTGATGTT AAGGGTGAGG TTGTGGCTGT TCATGGGATG AGTGATGTGG AGGTTGTTAA GAATTTTGCT TCACTACAGG CAAGTTTGTC TGAGTCGGAA CGCTCTGTTT ATCAGGATGC TGTGCGACGG GTTGAGGCGG GTGTTCAACG TTACACTTTT TCTTGGGGTA TACCTATTAA TATGCTTGAA AATTATCGCG GCGAGGCGAT CGCTCTGGGC ATGGAAGGAC AGGAGGTTGA GTTACAAAAA CAACTTGAGT TGGAGCGGCG GAAGCGTGAG GAGGCGGAAC GACGAGTGGA AAAGCTGGAG GCGGATCTAC AAAAGGAACT TCGAGAGGCT GAAAAGAGGG CTAAAGAGGA ACGGGCAGTT GAACTACAAA GGCAACTAGA GGAGGAAAAA CGCCAGCGTG AAGAGGCGGA GCGACGAGAA CGAGAGCTGG AAGCTGAACG ACAACGGCAG AAAAAAAATG AGGTTTCTCT AGTTTCAGCT AAGGGGGTTG ACTATCGGAA ACTGCGTGAC TTATTGAAGG CTAAAAAGTG GCAGGAGGCA GACGCAGAAA CAGAGAGGGT AATTTTAAAA GCTGCGAGTA GGGAGTCGGA AGGATGGTTG AGAGGGTCGG ATGCTAAAAA TTTTTCTTGT CAAGATTTAG GCACGATTGA CAAACTTTGG GTAAAATATA GTAATGGAAA ATTTGGATTT TCTGTGCAGA AGCAAATTTA TCTGAGTTTG GGTGGTACAA AAGAGTATAA TAGAGATGTG TGGGAAAAGT TTGGAGACAA AGTAGGATGG CGTAAAGGAG GTGAATGGTT GTCGTATAGT GAATTAACTT TTGATGATAA ACATTATGTG GGGCACCTGC CGTGCCGGGC GTTGGGCGTT TGGGGTATTT TTTTTATTGG CTGGGTATAC CGGCTGCACC TTTTCGGTCG CTGGGTTATT CCACCTGTAG ATTGTAACAT ATATTATATC TGGCGTCAAG GGGAAAAATT GGGGAAAGCT TTTTTAACGG TTTTTCTTAA TTTCGGCGAT CTGATACCTG GCACCAGGGG GAAAAAAAAG GAAAAAAAGA GGGGGAGGGC TCAGAGAAAA AAGTCAAAAG TAGAATTCAG TCAGCCAGAA GTTGGAAATG TTGTCAGAAG TCTCACCTAA
|
Protein sequence | MKSLLRAFGL TFLSLTLTNC SLSPEKIASR LEPSIVKVFY QGEPGHGTGF FVPGEKGVCT VLTAAHVVKK EGENLLKTND GKEWPASQVE IFPNNIDLAL VSFQPEKGNC NYSPLKIGNS DGLKQGSSIY ISGFPIRDGY LVSQFVLGSV SRLDRLAQGY GVSYQALTVG GMSGAPVVDV KGEVVAVHGM SDVEVVKNFA SLQASLSESE RSVYQDAVRR VEAGVQRYTF SWGIPINMLE NYRGEAIALG MEGQEVELQK QLELERRKRE EAERRVEKLE ADLQKELREA EKRAKEERAV ELQRQLEEEK RQREEAERRE RELEAERQRQ KKNEVSLVSA KGVDYRKLRD LLKAKKWQEA DAETERVILK AASRESEGWL RGSDAKNFSC QDLGTIDKLW VKYSNGKFGF SVQKQIYLSL GGTKEYNRDV WEKFGDKVGW RKGGEWLSYS ELTFDDKHYV GHLPCRALGV WGIFFIGWVY RLHLFGRWVI PPVDCNIYYI WRQGEKLGKA FLTVFLNFGD LIPGTRGKKK EKKRGRAQRK KSKVEFSQPE VGNVVRSLT
|
| |