Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31973 |
Symbol | |
ID | 7196453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1378558 |
End bp | 1380522 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177281 |
Protein GI | 219111059 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCCGG ATGAGAACTA TAAACCAAAC TTGGTGGAAG AGGAAAAGAC GTCGATTATG CAGGTCATCG CGCCGGACCC AAACTATCAG AAAAGGCTGC AAGAAGAACA AAATTCGAAG AGAATTCCTC CTTCCGAAGC TTTTCAACAA CGTGTGCAAT GCATACATTG TCATAAGTAC ACACCCGGAA TGGTAGTCCA GCAGCCTGCA AAGGCTTCAT CGGAGGCTGC GAGCACCCAG TCGAACGCTC GCCGTCTTAC ACCGACAACA CCTGGCTTTC ATACTCCTTC TCCTACGACG CCACTCACTC AATCATCCGA TCATGTTAAT GGTTCGGAAG TTCATTTAGT GGAGCATCGT GTCCTGCTGG CCGTCCGAGA TACAGCGTTT GATTTAGCGT CACCGGAGTT TGCCAAGAAA TCATCCAAAG ATGTCTTTCG CTGGCTCGCT AGTGCACAAC CCATCAACGA CAAGCTTCAA AAGCGATTGG AACGCCGCAT TAGTGCAGAC CCATCGATAT GTAAAGCACG TTCTCATGGC ATGGAACCCC TCTGTCCCGA TGGACTTACT CCGTTTTTGC TGGCGGCCCA TTCCAACCAA GTTGCGGCAG CCAAAATTTT ACTCCTATTG GGACCCCGAA CCGAGCAATT GCAGACCGTT AATTTACAAG GCAAGTCAGC ATATCACCTT GCCGCAACTC GCGGCAATCT CGAATTTCTT GATTACATCA AGACCGTATA CGAAGATCCG CAAAATGGGA CTCTCTTCTC TTCCCCGACA CCCGTTGACT TGCTGGGACG TACACCGCTC GGGGCCGCCT TAACGAGTCC TGAACCCCAG GCCAAGCGAA ATAAACAAAC AATGATGGAC AGGTTGTTCT CACCCCAAGA CATGTCGATT TTGGGTAGTC CCGCCCCTGT AACGCAGAGA ACAGCGACAC TTTCTGAACT ACAGCTTGCG TACGGATCTT CCCACATGCC TGGCAAGCGT ATTATGAATG AAGACGCAAT TCTGACAACC AAGATCCTTC TCAGCGACGA TTCTACTGTA GGAGTCTTCG GCGTCTTTGA TGGCCATAGC GATGCTGGAA AGGTTTCGAA CTTTATTGCG TCTCAGATTC CACACGCTCT ACGCGATGCG ATGCAACAAG CAGGCGACTG GAACAGCTGG TGTCGACATG CATGTCTCGA AATTGATGCG AATTTGAAAA AAAGCAACAT TGCAGGTGGT TCAACCGCCG TCTTTGCCAT AATTACGCTG GATCAAATTG TTGTTGCCAA CGTGGGTGAC AGTCGCTGTA TTCTAGTACA ACACGATTCA GTTAGTGTGA GCAATGTCGC GGAGGGTGTG GAAAGGCTAT CCATTTCTGA AACGGTATAC CCGCAAACAG GGACCGAAAC GAATACCTTA AGTGGCGCGT TTTTAGTAAA GGCGTTGTCA GAGGATCACA AACCCGAGGC TTCGGCGGAA CATGCCCGTA TTCAAGCCGC AGGCATGACC ATCACGGAGG AACGGTTCGA AGAAGATGGC GAAGAAGTTG TCATTCACAA GGTCCGGTTG TCGGACGGCA ATCGCATGGC ATGTTCCCGC TCCTTTGGAG ATTTTGAATA CAAAGCCAAC GAGACTTTAG AGGCCGAATC GCAAGCTATA GTTGCTGTTC CTGACGTTGT CGTTCACGAA CGCAGTCATG CTGACTGCTA TCTTGTTTTG GCGTGTGACG GCATTTGGGA TGTCATGAGT AGCGATGAAG TGGGACAATT TGTAGTGGAG CACATCAAAT CATGTGGCGA AACAGAAGGC GTTTTGCCCG AGGTTGGTGA CCGGCTGCTG GCGGAATGTT TGCAGCGCGG CTCTGGAGAT AACCTCAGTG CCGTCGTGGT AGCCCTCTCA AACTCAGCCG AGCATTTGTC TTCTGGACAA GTGTTGAAAG GCAAGGCACT GGATTTTTCG GGAACGCCAC CGTGA
|
Protein sequence | MAPDENYKPN LVEEEKTSIM QVIAPDPNYQ KRLQEEQNSK RIPPSEAFQQ RVQCIHCHKY TPGMVVQQPA KASSEAASTQ SNARRLTPTT PGFHTPSPTT PLTQSSDHVN GSEVHLVEHR VLLAVRDTAF DLASPEFAKK SSKDVFRWLA SAQPINDKLQ KRLERRISAD PSICKARSHG MEPLCPDGLT PFLLAAHSNQ VAAAKILLLL GPRTEQLQTV NLQGKSAYHL AATRGNLEFL DYIKTVYEDP QNGTLFSSPT PVDLLGRTPL GAALTSPEPQ AKRNKQTMMD RLFSPQDMSI LGSPAPVTQR TATLSELQLA YGSSHMPGKR IMNEDAILTT KILLSDDSTV GVFGVFDGHS DAGKVSNFIA SQIPHALRDA MQQAGDWNSW CRHACLEIDA NLKKSNIAGG STAVFAIITL DQIVVANVGD SRCILVQHDS VSVSNVAEGV ERLSISETVY PQTGTETNTL SGAFLVKALS EDHKPEASAE HARIQAAGMT ITEERFEEDG EEVVIHKVRL SDGNRMACSR SFGDFEYKAN ETLEAESQAI VAVPDVVVHE RSHADCYLVL ACDGIWDVMS SDEVGQFVVE HIKSCGETEG VLPEVGDRLL AECLQRGSGD NLSAVVVALS NSAEHLSSGQ VLKGKALDFS GTPP
|
| |