Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0174 |
Symbol | |
ID | 4242924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 259770 |
End bp | 261428 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638105520 |
Product | GUN4-like |
Protein accession | YP_720139 |
Protein GI | 113474078 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.646851 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTA TATTTCGGGC TATTATTCTA GCCTTCCTGG CATTAACTTT AACAAGTTGT TCATTACCAC CAGATCAAAT TGCCTCTCGA CTAGAACCTA GTCTTGTTAA AGTATTTTAT AAAAATCAAC CCGGACATGG AACTGGTTTT TTTGTACCTG GAGAAACAGG AGTTTGTAAG GTACTGACGG CAGCTCATGT TGTGAACAAA GAAGGGGAAA AATTATTACA AACTAAAGAT GGTAATGTCT GGGATGCTGC ATCTGTTGAA ATGTTTTCAG ATGATATAGA CTTGGCTTTA GTAACTTTTG AGCCAGAGAA AGAAAAATGT GATTATCCTA CTCTCAAAAT AGGTAATTCA GAGGATATCA AACAAGGTAG TTCTATATAT GTTTCTGGCC TTTCTAGTCG GGATGGGAAG ATGCTATCTC AATTTGTTAA AGGAAATGTT ACGGCTTTGA ATGTTTTTCC ACAGGGTTAT AGGGTTTCTT ATCAAGCTTT GACTGTCGCT GGAATGAGTG GAGCTCCTGT TATAGATGAG AGGGGTAAGG TGGTGGCAGT TCATGGGATG AGTGATGTGG AAACAGTTAA AGGTTTTAGT TCTTTGAAAA CAAGTTGGCC TGAGTTAGAG TTACAGACTA CCTGGCAAGC TGAAGAAGTT GTGAATACTG CTATTAAACA TTTGACTTTT TCTTGGGGTA TACCTATTAG TTTCTTTAGG GAGTCTCCGT TTTACTATGA CTCTGGGGAT ATCTATGGGT TAAGCTGGTG GATATTTTTG TCTGGTGCAG GAATAGTTGC TGTTAGTTTT ATTTATGTTG GTTTCAGGTA TTTAAATGTT TCACCATTTA TTGCTGAAGT TAATAATTTG AAAACACAAC TCCAGGATGA GGAAGACAAA GGGGAAGAGG TTCAGAAAGA GTTAAAGTCA CTAAAAAATA GTTATAAGGG GTTGGAAAGA AAACTGGAAG CGGAAATATC GGAGAGATCT GAGGCTGAGG AGCAAATTCA AACTCTACAA GTTGTTGTGG AAAAACAAAA AGTGTTGGAA GTGCAACTAG AAATTGAGAT GTCAACAAGA TATCAGGTTG AGGAGCAAAT TCAAACTCTA CAAGTTGCTG CACAAAAGGA AAGGGAGTTG GAAAGGCAGT TGGAGTTTGA GTCTCAAAAT ACTGAAGATA AGTCTGGTGT TGATTTATTT CTGGTTTCTG AGGTGGTCGG TGACTATACT AAGTTGCGTG ATCTATTGGC GGCTAAACAG TGGCGCGAAG CAGACTTAGA AACATATAAA AGAATGTTAG AGGTGGCGGG CAGAAACTTG AAGGGATCTT TGAGGGTTAA GGATGTGTAT AATTTTCCTT GCAAAGATTT AGTGACTATT GACGAACTGT GGATAAAATA TAGTGATGGT AAGTTTGGCT TGTCTGTTCA GAAGCAAATT TATGAGAGCA TGGGTGGTAC GAAAGACTAT GACTATAAGG TAATAGAAGA TTTTGGAAAT AGAGTTGGGT GGCGTCAAGA TGGAAAATGG TTGAGGTATC ATTATTTGAC TTTTAGTGAG AAGTATGAAA TGGGGTGTTT ACCAGTAAGT TTTTATCTTG AGAGGGTTGC TTCTCTTCTG TTTGCAGGAA TGTGCAAGTT TATAGACTGT GATCTTTAG
|
Protein sequence | MKPIFRAIIL AFLALTLTSC SLPPDQIASR LEPSLVKVFY KNQPGHGTGF FVPGETGVCK VLTAAHVVNK EGEKLLQTKD GNVWDAASVE MFSDDIDLAL VTFEPEKEKC DYPTLKIGNS EDIKQGSSIY VSGLSSRDGK MLSQFVKGNV TALNVFPQGY RVSYQALTVA GMSGAPVIDE RGKVVAVHGM SDVETVKGFS SLKTSWPELE LQTTWQAEEV VNTAIKHLTF SWGIPISFFR ESPFYYDSGD IYGLSWWIFL SGAGIVAVSF IYVGFRYLNV SPFIAEVNNL KTQLQDEEDK GEEVQKELKS LKNSYKGLER KLEAEISERS EAEEQIQTLQ VVVEKQKVLE VQLEIEMSTR YQVEEQIQTL QVAAQKEREL ERQLEFESQN TEDKSGVDLF LVSEVVGDYT KLRDLLAAKQ WREADLETYK RMLEVAGRNL KGSLRVKDVY NFPCKDLVTI DELWIKYSDG KFGLSVQKQI YESMGGTKDY DYKVIEDFGN RVGWRQDGKW LRYHYLTFSE KYEMGCLPVS FYLERVASLL FAGMCKFIDC DL
|
| |