Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4979 |
Symbol | |
ID | 4246634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7604596 |
End bp | 7606332 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638109790 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_724366 |
Protein GI | 113478305 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [R] General function prediction only |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.363391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTAA AGAAAACAAT TGTCATAATT ATTATTAGCA ACTTACTCAA TATAAACACA ACCAATATCA GGAATTTTAT TGTTGCTATT ACCATCACTA ATTCACTAAT AACACAAAAA ATTGCTCTGG GATCAACAGC ATCAGAAATT AATAAAATTG CAGCCGAAAT TACAGTTAGA ATTGACGGAA GAAACAGAAG TGGCTCAGGA GTAATTGTTG AAAAACAAGG AGACACATAT TATGTACTCA CCAACTGGCA CGTTGTTAAT ACAGTAGGAG ATTACCAAGT ACGTACACCT GATGGAAAAA TGCACCCTGT TTACTATACC TTAATCAAGC AATTGCCAAA TATAGACTTA GCCATCATAC CATTTAGCAG TTCTCAAAAC TACCCCATTG CCGAAACAGG AGACTCCACT AAATTAGTTT CTGGAACTAA GGTATTTGTG GGGGGATGGC CTCGTTCTGG TAGCAATCTC AAGCAACGAA TTTTTCTGAG CACCCCAGGA GTGGTGAAAG GTCGTCAGCA ACCTGTTGCT GGTTATAGTT TACTATATGA TAATTTAGTC AGAGCGGGGA TGAGTGGTGG ACCTGTACTT AATGAAGAAG GTCGCTTAGT AGCTATTAAT GGTATTGTCA AATTGCAAGA AAACTCTGAT GTCATTGTCT CAGGAGGGAT TGAAATTAAT ACTTTTTGGG ACTGGCGACA AGGAGTTTCA TTGCCCATTG TTTCTCAAAT ACCTGAAAAT ATCCAAGCTC CAGCAAGTCC AACAGAAACA ACAAATCCTG TTGCTACAAA TACTGCTACC AGAGAGGACG ATACTGAAAC TGTCCTGGAC TTTACTCTCG CAAAAGCAAT TACTGATGAA ATTAGTGGAA TAGTTAATTC AATAGTTGTT CTAAATGCTT ACATCGTTAT GGGAAGCAGT AATGGTATGA TTTCAGTTTG GGATATCGAG AATAGAGAAA TTATAGCTAT CTGGAAAGCA CATCCAGAGT CAGTAAACTC AGTTGCAGTA ACTCCTGATG AACAATTTGT TATTAGTGGA AGTGATGACA AAACTATTAA AATATGGAAA CTACCTAAAA ATAAAAATAT CAATGATATT AGCTTGGTGC AAACTCTCAC AGGACATACT GATGTGGTAG ATGGAGTTGC GATCGCTCCC AATAGTAAGA TTTTTGCTAG TGGGAGTTGG GATGGGACTA TTAAAATTTG GAATTTGGCT AGCGGGGAGT TGCTGCAAAC AATAGCCGGA CATTCTGAGA TAGTCAATGG GATCGCCATT AGTCCAGATG GGCAATTTTT AGCTAGTGGT AGCAAGGATA ATCAGATTAA ATTGTGGAAT TTGCAGACAG GACAACTTGT TCGTACTATT AATACTAATT CTGTTTCAAT TTTGTCTGTA GTTTTTAGTC CTGATAGTCA AATCTTAGCT AGTAGTAGTA GTAATGGCAC AATTAATATT TGGAACTTAC AGACAGGTAA ATTAATCCAT AATTTAAAAG AACATTTAGA TGGAGTTTGG TCAATAGTTA TTACTCCTGA TGGAAAAACT TTAATTAGTG GTAGTTGGGA TAAAACAATC AAATTTTGGG AACTGAGTAC AGGTAAATTA AAAGGAAGTT TAAGGGGACA CAACAGTTAT ATTAGTGTCG TAGCAATTAG TCCTAATGGG CAAATCATCG TTAGTGGTGG TTGGGACCGG AAGATTAATA TTTGGAAAGC TCCCTAA
|
Protein sequence | MNLKKTIVII IISNLLNINT TNIRNFIVAI TITNSLITQK IALGSTASEI NKIAAEITVR IDGRNRSGSG VIVEKQGDTY YVLTNWHVVN TVGDYQVRTP DGKMHPVYYT LIKQLPNIDL AIIPFSSSQN YPIAETGDST KLVSGTKVFV GGWPRSGSNL KQRIFLSTPG VVKGRQQPVA GYSLLYDNLV RAGMSGGPVL NEEGRLVAIN GIVKLQENSD VIVSGGIEIN TFWDWRQGVS LPIVSQIPEN IQAPASPTET TNPVATNTAT REDDTETVLD FTLAKAITDE ISGIVNSIVV LNAYIVMGSS NGMISVWDIE NREIIAIWKA HPESVNSVAV TPDEQFVISG SDDKTIKIWK LPKNKNINDI SLVQTLTGHT DVVDGVAIAP NSKIFASGSW DGTIKIWNLA SGELLQTIAG HSEIVNGIAI SPDGQFLASG SKDNQIKLWN LQTGQLVRTI NTNSVSILSV VFSPDSQILA SSSSNGTINI WNLQTGKLIH NLKEHLDGVW SIVITPDGKT LISGSWDKTI KFWELSTGKL KGSLRGHNSY ISVVAISPNG QIIVSGGWDR KINIWKAP
|
| |