Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42426 |
Symbol | |
ID | 7196637 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 22835 |
End bp | 24639 |
Gene Length | 1805 bp |
Protein Length | 473 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176505 |
Protein GI | 219109501 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCAACCAT GGACAAGGAA CAGGAACTTG GAGATAGTAG CTTTCATCCA CAACCCTTTC CGTCCGTGGA ATACAACACT TGCACTCTAT TTTAACAACG GACACCATGA TGCGCATGTC CATCATCTTC ATCTTCCTAG CCCTACTGGT GCAGTCAGCG ACGCCCAAAG TGCACCGCAG CGTTAATCGC AGCATTCAAA TCGTAAACGA GTCCGCCTCG AAAATTGAGA TATTCTGGGT TCATCCAGAG ACGAGAGAAC CCTCGCTCAT GTCGAATCCG TTCATTGTCC CAGGGGCCGA CTTTTCGCTG AATAGCTTTG TGGGTCATGA ATTTCTAGTT AAAGAAATGC CGGGCAAGAA TGGATGCCAA GTAGACTCGT GCAAAACTGA AAACTTCAAG GTTTCGCCCA ATGATGAGCA AGTGATTCGG GTCAGTCCAG AGATCACGGT AACCTTTGTG GACAACAAAA TCCGAGCTCG AAAAGAAGCT GACGAGCTCA TCAAAGCCTG TCAGGTGGAC GCTCGCAAGC GCGTAGAGCT AGCAGGGCAA GACAAGGCTG CCGCTCTGGA CGCTATGGAC GATCTTGTCA ACTGCGTTCA AGGAGGAGTT TCCTCTCGTC TCGAGACAGT CAACGAAGAA ATTGCCTTCC AAGCTTCGGT GCGGACGGAC ATTGCCGCTT TGTTGGAAAA TTACACTTGC ACGGACGACT CCCTAAATTC TTCGAAAGAC ATTACGACTC AGCAGTGGAA GCAAGCTGAC CTCACACGCA CGGTGCATAT CAAACACGAA CGACCCACCT CGAGAATTCA CGTGATTGAG AATTTTATTT CCGATGATGA GTGTGACGCT ATGGAAGCTG CCGCGCAAAA ATCTCTACAC CGGGCCACTG TCGCCGATGG GAAAGGAGGC TCTCGCCTCA GTGACAATCG CAAAGCCATG CAGGCTGGTA TCAAAGTTCC TTGGAAAGAC GAAGCCTCGG GTAATGCCAT AGCTCGCTTA AGCCGCCGTG TCTATGACTA CACAAATCAT GTCCTTGGAT TAGGAATCGA GGAGCACGGC CAGGAAGATC TCATGTCGAT CCAGTATTTT GGAAGGGGCA AGAACGATAC TGAACCTGAT CGCTACACTC CTCACTGTGA TGGCGACTGC ACTGGCCTCC CTCACAAACA CGGTACACGA ATGGCTACCA TGGTGATGTA TTGCGATGTG GCGGACCTTG GTGGGCGTAA GTTGCAGTGG TTCCAAAAAG TAGGCGACAC GTTCTATGAG TGTGTTGATT AACCCTGTGG TTTCTCTTTG TCTGCAGATA CGAATTTTCG CAACGCCGGT GTCCACGTCA AACCAGAACG AGGCTCGGGC ATCTTTTTTA GTTACATCGA TCCCGAAAAT CGTGTCATGG ATACTGGATT TACGGAACAT TCAGGTTGCC CGGTGTTCGA GGGCGAAAAG AAGATCGTTA CGCAGTGGAT TAGGCTAGGT GTTGACACTG AGAATCCTTG GGACAGTTTT AACACTCTCG GAATCAAGAA GTCCGAAATG GAAGATTTCG AATCGGACGG CGAGGAAGAA ATTGATGAGA CCGAAGATAC TTCTTCGGAT GAGTTGTGAA GTGCTTTCAT CTCGCAAAGT CTTTATTGAC ACCTGCACAT TTGCGACCAA GCACAGTTTA CCATATGTCT GTTACTCCTA CGGTTTTGTT AATGCAAGCG AAATTACGAC TGCACCCTTT TGCAGGGACC TGCACTTGCC TGGACAGACC ATAACATCAT ATTAGTTTAG CATGAGTCAT CAATCGCGCT TTTCA
|
Protein sequence | MMRMSIIFIF LALLVQSATP KVHRSVNRSI QIVNESASKI EIFWVHPETR EPSLMSNPFI VPGADFSLNS FVGHEFLVKE MPGKNGCQVD SCKTENFKVS PNDEQVIRVS PEITVTFVDN KIRARKEADE LIKACQVDAR KRVELAGQDK AAALDAMDDL VNCVQGGVSS RLETVNEEIA FQASVRTDIA ALLENYTCTD DSLNSSKDIT TQQWKQADLT RTVHIKHERP TSRIHVIENF ISDDECDAME AAAQKSLHRA TVADGKGGSR LSDNRKAMQA GIKVPWKDEA SGNAIARLSR RVYDYTNHVL GLGIEEHGQE DLMSIQYFGR GKNDTEPDRY TPHCDGDCTG LPHKHGTRMA TMVMYCDVAD LGGHTNFRNA GVHVKPERGS GIFFSYIDPE NRVMDTGFTE HSGCPVFEGE KKIVTQWIRL GVDTENPWDS FNTLGIKKSE MEDFESDGEE EIDETEDTSS DEL
|
| |