Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48565 |
Symbol | |
ID | 7194731 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 225749 |
End bp | 227771 |
Gene Length | 2023 bp |
Protein Length | 648 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183052 |
Protein GI | 219125575 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.767614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGTGA CTCTGAAATG TACTCCGAAC ATTCCAATAA TCGCTAAATA TTTGGCGACC GTTCCCGGGA TAAAAAGGAC CACGAAAAAA ACTCTCTGCT ACCTCGGCCG ACTCATCAAA ATTCTCGTGG TGGCTTTGGA CAAAGATACA ATGAGATTTG CTTCGGCACT GGCATTACTA ACGGTGTCGT CGCTCTACCA AAGACTGGCG GTACGCCGAG GAGAATTTAC GGCTGGCGCT TTGGAGCAGC CGGGACGGAG GCGGTCTCAA AGAGTCGGAG CAGCTCACCT TAAACTGGAA CACTTCAGCG TGAATACCAT ATCACTTTCA TCAAGCGCGG CACTCGATGA GAAAGAATCG ATCGGGTATG AGGAGAATGT TCCATCTTCA ATAATCAACG CCCTAGGGGA GTTCGACCCT GGCGGTGGTT CCACTGATGT CCCGTCAGCC GTTCCCTCGA AAGTTTCCTC AGACTCACCT TCTGATACTC CAAGTGATAC TCCGCAGAGG ATCATACCCA CCTTTGCGTC AAGTAGTTAC CCTTCATTGG TTCCCACCAT GTCAACGTCG GATGTCCCGT CGTCGCTTCC GTCTGATGTT CCGTCAACAG TACCATCGGA TGTCCCGTCG TCGATTCCGT CGGATGTGCC GCCCACCACA CCATCGGATA TTCCTTCTTC AGTCCCCTCC GACGTGCCGT CTACCATACC GTCGGATGTC CCTTCGCTTT TTCCGTCTGA TGTACCGTCC ACTATACAAT CGGATTTTCC TTCTTCAGTC CCCTCGGACT TGCCTTCCAC CATACCGTCG GATGTCCATT CGTCGATTCC GTCTAGTGTT TTGTACTCTA TACTATCAGA TGTTCCTAAT TCTCTCTCAT CCGAAGTACC TTCAGAACTT CCATCTTATA TGCTAGCAAA CGTACCTTCA TTCAAGCCAT CGAATGTTCC GTCATTTTCA AGTATACCTT CAGCTATCCC GACAACAACC AGTGAAATTT TAACTTTGGC AATGTCCAAT CTTCCGTCAA GCCTTCCCTC CGGCATGATG TCTGTACCAT CTGATCTTCC TTCAATTACA CCGTCAAATG CGGTGTTGGT AACATCGAAA AGCCCATCAA AAATGCCATC AAATGCAACA TCAACGCCAC CCTCCCAAAA TCCCAGCAGA ACGCCAAATG GCTATCCCTC ATTCCGTCCA GATCCTTGGC CCACACAACT CCCGATTTCA AAAATGCCCA CAGCATCACC TTTATTTGCA CCCACTTTTA CCCAGAATGA AGGCACTTGC GATGGGGATG GCTATACTTT GAATCCCTCT CAGGCTACGT TTCTTGCTCA AAACGATCGT GAAGCGGACA AAAGGATTGT TTTCAGCTAC GAACTTCATT TATCAGAGAA TGCGACAGAA GCCCCAATTA TTATCGAAAG TGTCGAAAGG CGCATACTTC AGGCCGTCCT AGGGGGTTAC TGTATCACTG GAGACATGCG CAGGCTACAA GTTGCCCCCG TCTTCGACTC CAATCCTCCA GATTTTCCGG TTGGTAAGTC TGAGTTGACC AGGCTTTGCA AAACTCTACC AATCCATAGT CGCTTACCGT TCTCCTTTGT TTCGATAAGG CTCTTGTGGC GACAAATGTA CCTTGGTATA CGGCTGGATG ATGGCAACTG GGGAAGATCC CGGCCTCTTT TGCAAGGTGA AACAAATTAT TTATGACTCT ATCATCAGTG GCGCGTTGGA GGATGTTGAT GGTGTCAGCG AGCTTGTTTA TAAAGAGGTT CGCGTAGCGC CTTGCTATGT CTCGGTGGAT GGCCCACCGG GCGGTTCCAT TAGTGCAGTT GAAGGTAGTC GATCCCGTCA ATCAAGTAAC TCGGATTCCT CACCAGTGGG GTTTGTAGCC GGAATTTCCC TTTTCGCTGC TTTTATTGTT TTTGCAGCGG CCGTGCTGGT AATCAGAAAA CAACATCGCT CCAGTGAGGA AGCTAATGTC CAGAATCGCA GTGACATCTC CCCGGATATT TAG
|
Protein sequence | MFVTLKCTPN IPIIAKYLAT VPGIKRTTKK TLCYLGRLIK ILVVALDKDT MRFASALALL TVSSLYQRLA VRRGEFTAGA LEQPGRRRSQ RVGAAHLKLE HFSVNTISLS SSAALDEKES IGYEENVPSS IINALGEFDP GGGSTDVPSA VPSKVSSDSP SDTPSDTPQR IIPTFASSSY PSLVPTMSTS DVPSSLPSDV PSTVPSDVPS SIPSDVPPTT PSDIPSSVPS DVPSTIPSDV PSLFPSDVPS TIQSDFPSSV PSDLPSTIPS DVHSSIPSSV LYSILSDVPN SLSSEVPSEL PSYMLANVPS FKPSNVPSFS SIPSAIPTTT SEILTLAMSN LPSSLPSGMM SVPSDLPSIT PSNAVLVTSK SPSKMPSNAT STPPSQNPSR TPNGYPSFRP DPWPTQLPIS KMPTASPLFA PTFTQNEGTC DGDGYTLNPS QATFLAQNDR EADKRIVFSY ELHLSENATE APIIIESVER RILQAVLGGY CITGDMRRLQ VAPVFDSNPP DFPVGSCGDK CTLVYGWMMA TGEDPGLFCK VKQIIYDSII SGALEDVDGV SELVYKEVRV APCYVSVDGP PGGSISAVEG SRSRQSSNSD SSPVGFVAGI SLFAAFIVFA AAVLVIRKQH RSSEEANVQN RSDISPDI
|
| |