Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1950 |
Symbol | |
ID | 4244374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3023400 |
End bp | 3025337 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638107069 |
Product | protein serine/threonine phosphatase |
Protein accession | YP_721676 |
Protein GI | 113475615 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0631] Serine/threonine protein phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.996513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0303353 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAATT CTGCGCCAAT AATTCATTGT CCAAATATTA ATTGTTCTAG TGTTTCTCAC TCTTGGGATC AGGAAGAGTG TGAAGCTTGC CAAACTAAAT TAATCTATCG CTATTTATGG GCAGTTGCTC CAAAAGTCGA GATTTCTGTA GGGGAGTTAT TGGGCGATCG CTACTATGTT GTAGCTCCAC AAGTTTGGCT AGATACGAAG CCTGGTGAAC CTCCATTTAT GCACGAGGAA TTTCCAGATT ATATTATGCC TTATCTACGC TTATATCCTG AGAGGCTTCA TACACCAGAT GTATATGGTT TTTATCAGTA TGGAGAAGAA ACATATCCTA CAGATATTCT ATTATTAGAT AATGTTCCTC TGGACTCTAA AGGTAATTTG CTTCCTAGTC TTGTAGAAGC TTGGCCCACA GCAAGTCCTG TACGTCAGGT CTACTGGTTG TGGCAAATGA TGCAATTATG GAAACCTCTA TCAGAAGAGC GTGTAGCTTT TAGTTTATTG ACCAATAATT TACGGGTGGA AGATTGGCGA TTACGATTGT TAGAGTTAAA CCTAGAAACC CGTAAAACTA AACTTAGAGA TTTTAAGATT TTTTGGTCAA ATTTGCTTCC GACAGCTCAT CCTAACGTTC AACAACCATT GACAGAGATT TGTCATATGA TTAGTGATAT TGGGGAATCT TTAGAGGCGA TCGCTCCGAA ATTAAACCAA CTACTATTAG AACAGGCAGC CAAATTACCA TTAAAATCCG ATATCATGGG GGCGACGGAT ACAGGTCCCG TGCGTATGCA TAATGAAGAC TGTTGTTATC CTACAGAAGA GGACTTAAAT AGTAATCAAC TAGTGCCCAA TTTAGCAATA ATTTGTGATG GAGTGGGTGG CCATGATGGC GGAGAGGTAG CTAGTCAACT TGCAGTACAA TCAATAAAAA AATTAGTTCA AAATTTATTA ATAGAGGTAG AGCAACAAGA GGAGTTAACA TCCCCAAACT TAGTCAATAA ACAACTAGAG GAAATTATTC GGGTAGTCAA TAATATGATT GCTACGGAGA ATGATGAACA AGGAAGAGAG TCTCGACAGC GCATGGGTAC AACACTAGCC ATGGCATTAC AGTTACCCCA ACAGGTGAAA CCTTCCCCAG AATCTAGTCC AAATAATGCC CATGAGTTAT ATATAGTTCA TGTGGGAGAT AGTCGTGCTT ATTGGTTAAG TAGAAATACT TGTCAGTTGT TGACTGTGGA TGATAATGTT GCTTCTAGAG AAGTTGGTTT GGGTCATTGT TTGTATTGGC AGGGTTTAAA ACGTCGTGAT GGTGGTGCCT TGACTCAGGC TTTGGGTACA AGAGATGGGG AGTTTATTTA TCCTACTATT CAAAGATTTT TGATCGAACA AGAGGGGATA TTACTTTTAT GTTCCGATGG CCTGAGTGAT AATAATTGGG TTGAGAATTC TTGGGCTGAA TATGCTTCTA AAGTTTTCCA GGGTGAAATT TCTCTAAAAC AGGGGGTAGA GTCTTTAATT GAGCTGGCGA ATCATAAAAA TGGTCATGAT AATACTTCGG TGGTGCTTGT CCACTATCGT CTTTCTCCAG AGAAGTTAAT CTTATCTAAT ATTCCTACAG TAACAACTCC AGAAAAGGAA ATTTTAATGA CAGAATTGTC TGAGGCTTCT AAAGAGCTTC TTTACAATAA TGATGAAGAA GTATCAGAAG CTGAGTCTGA ATTTCAGTTG GAACCTGTCC AAAATTTTTT CTGGTTAAAA CCCTGGATGT GTATGATGGG TGTCTTATTT ATCTTGTTGG TGGGGGCAAT TATATCGGTA TGGCGCAATT GGGAAAATTT TCAGTCAACT CCTAGGGATA GTCCTGCACC TTGGGAGAGT CCTATGAGAC CTGAGCCTTC AGAAACTAGA ATTTCTCCTA GTTCTTAA
|
Protein sequence | MINSAPIIHC PNINCSSVSH SWDQEECEAC QTKLIYRYLW AVAPKVEISV GELLGDRYYV VAPQVWLDTK PGEPPFMHEE FPDYIMPYLR LYPERLHTPD VYGFYQYGEE TYPTDILLLD NVPLDSKGNL LPSLVEAWPT ASPVRQVYWL WQMMQLWKPL SEERVAFSLL TNNLRVEDWR LRLLELNLET RKTKLRDFKI FWSNLLPTAH PNVQQPLTEI CHMISDIGES LEAIAPKLNQ LLLEQAAKLP LKSDIMGATD TGPVRMHNED CCYPTEEDLN SNQLVPNLAI ICDGVGGHDG GEVASQLAVQ SIKKLVQNLL IEVEQQEELT SPNLVNKQLE EIIRVVNNMI ATENDEQGRE SRQRMGTTLA MALQLPQQVK PSPESSPNNA HELYIVHVGD SRAYWLSRNT CQLLTVDDNV ASREVGLGHC LYWQGLKRRD GGALTQALGT RDGEFIYPTI QRFLIEQEGI LLLCSDGLSD NNWVENSWAE YASKVFQGEI SLKQGVESLI ELANHKNGHD NTSVVLVHYR LSPEKLILSN IPTVTTPEKE ILMTELSEAS KELLYNNDEE VSEAESEFQL EPVQNFFWLK PWMCMMGVLF ILLVGAIISV WRNWENFQST PRDSPAPWES PMRPEPSETR ISPSS
|
| |