Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45558 |
Symbol | |
ID | 7200740 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 544327 |
End bp | 546484 |
Gene Length | 2158 bp |
Protein Length | 534 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179669 |
Protein GI | 219117760 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.402088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGTCGATG TCTACTGCTT CACTTACTAC CAACTTGATA AAAGATGAGT GAGGAACGCC TCAACCAGAA AACTATAGAC GACATGGCCT CTGAAGTTGC ACACCTACAC AGTACAAACA CATCTCCACA CAGTCCATCG ATACACGCGG CTGGAACAGA TCAGGGCGAT CATGCCTATT TTGGCGTTGA TCAGATAGCG TGCATGGAAC CGATTGGCTC AGAGTTCCGG CGAGAATCTT TTTTGACTGA GTTTTCCCAG TCTCGAGGTC CACCCCAAAT TGTACTCTTG ATTATTCTAC TGGCTCTAGG ATTTGGTTCG ACTATCGGTG TGGTGGGTCT TTTTGAGATT TAGATTTAGG CCATTCCAAC CTGCTGATTC TCACCGTACA TCAATCATCA CCCTTGCAGG TACCTGCAGT AATGACGGAT CGATATGCGC GCCTTAATCA TGACTATTCC GACCAAACCT ACTGTGCTGA CTATTCGATG AACGATAAGC CAAAGGCATG TTTGCTTGGG TCTGCCGATG CGCAGAATGC CGTTGCGTTT GAGCAACTTA TTTCGAATAT CTTTACTTTC TTTACAAGCT CACTGATAGG ATCTCTTAGC GACGAATACG GCAGAAAAGG TTCGTGCGAC CTAGATCTAT GTTATTAAGT GGCCTTGCTT GAAAAGTTCT GATTCCTTCC ATTATCATAT GTAGGAATTC TGACGTTGGG AGTCCTCATG TCGACCATGT CTCCCCTTTG TCTGTTGCTC ATACAGCTCC GGCCAGAAAT GAGTCCCTTT TGGTACTACA CCGTTGGCGC TGTTCAGGGG TTGATCAGTT GGATCACCAT TGCACTTTCT GCACTTTCGG ATGTCATGCC TCCCAAATGG CGAGCGCCAA GTTTTGGCTT ACTGCTAGCA GGCTTTTCCT TGGGATTTGC CATGGCTCCC CAATTGGCCC TTATTCTTGG TCATTTTTAT GTCACTGTCG TTTCTCTCTT CATGGTTCTG TCGGGATTGC TTATTGTGGT GTTTTTCTTC CCCGAAACTT TGCGCCCCGA AACGGCGCGC GAAGCCCGTC GTGTGCGTGA AGCGCAAGTG GAGGATCTTT CTGCATCAAA ATTAGCACTA TCAAATATTT TGAGGCCGAT GAGGGAGCTG TCTATTCTAA ATCGAAACCG CCTTTTCCGT TTGTTGTCGC TTTTGGCATT TTTTAGTGGT CTGGTAACCG CAGGAGACCG AACACTTCTT ATTTACTACA TTGAAGAACG TTTAGGCTTC GGCGACAAAG ACATTGCGAC CATGTTCATG ATCATGGGCG TTCTGGGTAT ATTTGTCCAG GGCGTCGTCT TGAAATTACT GAACGAGGCT ATTGGAGAGC GAATGGTTGT CACATTATGC TTCTGCCTTG GATCATTTCA CAATTTGCTT TACGGCCTCG CCAAAGACAA AACCACTATC TTCCTTGCGG TTGCCATTTC TGCCTTCGGG GGGATGGCAT TTCCGACAAT TTCTGCGATC AAGGCCAACA ATGTTGTAAG TCGATGAAGA ACAAAATTGC TCTGCAGAAT CTTTGGTGGA ATCACTTACC TGATCACGTG AGTTGCACAG AATGAGTCCG AGCAAGGCAG AATTCAAGGA GCACTTTTTT CACTCCAAGC ACTCGCATCA GCCACTGGTC CCATGCTTCT TCGATTCATC TACCATCTCA CAAAAGACGG TGCCTTTCTC GGTCCTGGTT CAATGTTTGT AGTGGCTTCG GGAATATACC TGATTGCGGT ATATTGCGCT TACTCTCTGC CGGTCAGTTT AGTCGCAGTC CAAGAAGAAC GAATCTCTAC TTAGTCCATG ATTCTCACCC ATTGACGTGT CTCATTTCAG GATGAAGCAA ACTCGCTGCG CAAAGAGAGG CCGTTTGCCG ACGTTGATAT GGACGAACCC TCTTCAAATA TGCCTCTATT ATAGCTTCGT GGCGAGACTA TATACTTTGA ATTTCGTTCG TTAATTTTTA GAGGATTGCT CCCAGAAGAA GGTACTCAAG AAGGTTTAAA AGGTAATCGT CCAGAGTTTA TCGGATCCTA TCGAAGTCGT CAGTTCTGCA GCCAGAATGC CATATCTTGG AGCAAATACC CCTAAGCTGG TAAATGCTAC CTCTATGCTA TACCTTCG
|
Protein sequence | MSEERLNQKT IDDMASEVAH LHSTNTSPHS PSIHAAGTDQ GDHAYFGVDQ IACMEPIGSE FRRESFLTEF SQSRGPPQIV LLIILLALGF GSTIGVVPAV MTDRYARLNH DYSDQTYCAD YSMNDKPKAC LLGSADAQNA VAFEQLISNI FTFFTSSLIG SLSDEYGRKG ILTLGVLMST MSPLCLLLIQ LRPEMSPFWY YTVGAVQGLI SWITIALSAL SDVMPPKWRA PSFGLLLAGF SLGFAMAPQL ALILGHFYVT VVSLFMVLSG LLIVVFFFPE TLRPETAREA RRVREAQVED LSASKLALSN ILRPMRELSI LNRNRLFRLL SLLAFFSGLV TAGDRTLLIY YIEERLGFGD KDIATMFMIM GVLGIFVQGV VLKLLNEAIG ERMVVTLCFC LGSFHNLLYG LAKDKTTIFL AVAISAFGGM AFPTISAIKA NNVNESEQGR IQGALFSLQA LASATGPMLL RFIYHLTKDG AFLGPGSMFV VASGIYLIAV YCAYSLPDEA NSLRKERPFA DVDMDEPSSN MPLL
|
| |