Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45494 |
Symbol | |
ID | 7200710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 348761 |
End bp | 351986 |
Gene Length | 3226 bp |
Protein Length | 348 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179834 |
Protein GI | 219118105 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCCAATCG AATCAAACGG AAGCAGGCTG TTTCAGGCCT GGGTCACAAA GCGAACTTTC ATTTTTTGCA TTTGAGCACC TCAATACTCT TCATGACCAA AAATCTCGAA GTTGCAAAGA TTTTAACGAA ATGCTGGTAG ATCGAGTCGA AAACAACGAG AAACAACAGC AACAGATGGC GTCATCTTCC GACGCGATGT CGGACTCCTC TCTTTCGGAC GATGAGATTA TCGAGCACGT TGTCCATGGG AAAGAACCAA AGTCGACATA CGAACTTTCT TGGGTATCCA ATGCGATCGC TTGGAGCGGT GCTTTGGTAT GGCCTCTGAT GCTGACGGTC CCTTTGCTTC TTTCGTCGAT GTACTCCCCG ATTTCATATC GACAAGTTTT TCCCGAAAGC TGGTACGTCT ACGACACGTT ATCAAATTGC GCACCCAAGC CACTTGGTCT GGTGCTGGGC ATTCTAGCAG TCGCCGTGGG ACAGGTATTT GTATGGATTT TCTTTTATTT GTTCAAGTTT GGGTACTTGG GTACGGACCC TCGGTCGATT CAAAGCAAGG GAGCGCGAGA GTACATATTC CGCGAGGGTC TTCTTACCCA TATCGGTCAG CCAGAAGGCT TTGTACTTTT GATTGGTTAC TTAGCAATAA CCTGGATGCT GAAGCTTATG CCACAAAGCT ACTACTCGTT CGAAGGGACT ATTCAGTACA AAGAACTATT TATGTGTTTG GTTCTTCAAG ACGGTATCCA GTACACGATG CATGTTCTCG AGCACATTGT ATCACCAGCC TTTTATCAAA TGTCGCATAA ACCGCATCAC CGCTTCACCA ATCCTCGTCT GTTCGACGCC TTCAATGGAT CACTAATGGA CACATTCTGC ATGATTATTA TTCCACTCTT TGTTACCGCC AACCTTGTGC GACACTGCAA TGTTTGGACG TACATGGCGT TTGGTTCGTC CTACGCATGC TGGCTGACAT TGATTCATTC CGAATACGTC TTTCCTTGGG ACGGCATTTT TCGAAAGCTC GGATTGGGTA CTCCTGCTGA CCATCATGTT CATCACAAGT TTTTCAAGTT TAATTATGGA CATTTGTTCA TGTGGTTTGA TCAGCTTGGA GGCACCTATC GTGATCCTAG CGGATTCGCT CCCCGCGTGT TTCGAGAAAA CGTGTAGTGG TACCCTTTAA ATTTCCTTGT GCTTCTTGTA AGGATCGTAG CGGTGAAATA TAGTAAAATG TGTTTCTTTC AGTGTACTCC TCTTTACTGT CAGTCAACAT GGAAAAAAAG TTTTAAAGTA CAGACGTTTC ATACTTTGCT GAAAAGTATG TTCCGAGATG TTCAATCCTC ATTGATCCCA GAAAATGTTC CACATCGACC GGGTCGTTTG GCTGTACGGC TTTAAACTCG ACAGTCTTAG GCAGCAACGG ACTCAAATGA AACGAGTTGT CAGTAAACCT GCCATCTGCG GCACTAGACA AAACAACGTA AAGAGCAAGC CTATTGGTTT GTAGTTGCAT CAGAACACGG TGTGGGCTGG ACTCCAGAAT TTTCAGACCT ATACGTGTTT GAAGTAACAA TCCTGAAAGG TTGTGGGGAA CCTGCCACAA ATATGCACTC TCCTTCATCA AAACGCGCCC ATCACGCTTG TCTGTGACTC TGACAATAGC AACCTCTGCA TCTCGCAATA GTTCAGAATT CAGTGAGAAT CGTTGCGTTT GGGCAGCATT ACCTTCTAAA TATGTTTCCC ATCTACTACT TCTTATGGGC TTCCTGAGAG CGAAATTCCA TGCTTCAACA TTTACGACAA CGTTTACTCC ATACACCCCA TAGTTTTGAA CGTAGCAATT TCCTTCGGAC CCACAAGCCG CAAGCACGTC CCGATACAGG GCATCTCGAA GTATAAAAAA CACTGGCTTC CATCTCCCCC CGACAATTTG GAGTGGCAAG CCTTTTCCAG ATCCATATTC GAGCAGACCC CAGCCACCCG TTGGCCAGTT CTCGTTTAGT TGCCAAATCA ACGTTCCCAG AGAGTTCTTT GAACGTTGAG TCTCAATTTC TGATTTCAGC CAAAGTGTTT GGCTTATCAT GCACATGTAC AGTTGCTTTT GGAAAGCTTC GGAACCCACG TCGTCAAGAG ACACGTTTCC GAAATATATT TGGATTTTAT TATCACACGG ATAGTTTCTT TCGGCCATGG CATTTTCTCC GTAGCATCGA CTTTCGTCCC CAATAGCAGG CTCACAAATG TCCGGCAATT CTCCACCATG TAGGCCCCAA GCATGCACAG GAAGCAATGC CGACATGGAC TCGAAAGAGG AGAACGCCGA TGACCCAAAT TCGGAAACAA AAACGTTTGG ATGCGCTGGG CCTGTAGCTC CTTCGAAAGA AAACACAGGA GGAAGTTTGC TTTCAAACGT GTTGTTCGTG AGCATACCGT TGACGCCCGA ATGGGTTGGT GAAAATCCTC TATGGTAAGG TCCGTGTGTC TCCAATGCTG ATTTTCCCGT TGTGTTTCGG ATACACAAGG GTTTGCCGTT CGGCCTACCA TCGATTGTAC GGACGCCTGA CTCCCAACCG TAGTCCGAAG GACTGCTTGG CCAAATAGCT CGCGATTTGT CCAAGTTGGC AACGGCGGCA AGAACGAAAT TTTGATAGAC ATCCATGTTT CCTCCATCGT ACGTGCATTC ATTGCAGCCA CTCCAAAGAA CAATAGAAGG ATGTTCAGCT AACCTTTGAA TCGTTATCTC GATTTCGCGC TCCACGCCTT CGGTGACCGT AGCACCATGC TTTTGTTCCC CGACAAACAT GAGATCATGA TAAAGCAAAA TTCCGAGCTC ATCGCAGGCA TCATAAAATG AGTCGGGTAG TACAGCACCA CCACCCCAGA CGCGAAGCAT ATTCATTCCT GCATCCGCTA CAGACTGCAC CAATTCCCTG TGTCCCAGGT TTGTCCAGCG TCCTTCTAAC TGATCCATCG GGACCATGTT GGCACCGCGG CTATAAACCA GAGCACCATT AACTCGAAAG TACATACCAT GGTGTCCTGA TCCGTCCTGT CCGTCCTGCA TCCTCTGCAA TGTCGTGTCG TCAGTCTCGT TGTTCGTAAC CAGCGCAACA GTCCGGAATC CTATCCGGCT AGTCATCCAC TTGATCGGAT CATCGTGATA ACTTTGTAGC GAGACACGTA CGGTGTATAA AGGCTGTTCT CCCAAGCCAT TCGGCC
|
Protein sequence | MLVDRVENNE KQQQQMASSS DAMSDSSLSD DEIIEHVVHG KEPKSTYELS WVSNAIAWSG ALVWPLMLTV PLLLSSMYSP ISYRQVFPES WYVYDTLSNC APKPLGLVLG ILAVAVGQVF VWIFFYLFKF GYLGTDPRSI QSKGAREYIF REGLLTHIGQ PEGFVLLIGY LAITWMLKLM PQSYYSFEGT IQYKELFMCL VLQDGIQYTM HVLEHIVSPA FYQMSHKPHH RFTNPRLFDA FNGSLMDTFC MIIIPLFVTA NLVRHCNVWT YMAFGSSYAC WLTLIHSEYV FPWDGIFRKL GLGTPADHHV HHKFFKFNYG HLFMWFDQLG GTYRDPSGFA PRVFRENV
|
| |