Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49981 |
Symbol | |
ID | 7198768 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 19020 |
End bp | 21820 |
Gene Length | 2801 bp |
Protein Length | 544 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184813 |
Protein GI | 219129265 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000710505 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCAATCGAT GGTGACTGGA AGCTGAATCA GTTCTTTCAT TTCTATCTAC GAGTCCCAGG GCCGTTTCGT TACTTTGAAG TACCGGGTGA AAGAAAATAT ATTTGGAGGG TATCTGAGGA GCTGCAGAGC TAAAGGGGCT AGCAATTAGT ACTCGAATAG CGGGCCACCG AATTTCGAAG TTGCTCTTTC GCTTTATCTC GACAACGAAC ACGATCGACT CTAAAATGCT ACAGAAAAAC CATCGTATCT TATTCACAAT CAACTTTCGT CTCCGATATG CTATTCCTTG GTTTTTGGTT TTGGCGCTGT CGCCGTCCGG GGACATGGCT TACATTGGAA AGTTGACTAC ACGTCCTTTG AAAAAAAATT CCTCTCCGAC GTTTGTTACG GTAGCAAACC TCGTTGGGGA CTTCGGGCGA CAATCTGGAG GGAGTGGTGA AATCTTCATT CGAGAGTCGG CCAAGAGGAG TTTTGCTTTT GGACAGTTTG GGGTCAAAAT TGAAACTCGT GACTGTGAGT CGCGTGGTAC ACCTTGATAA TTCACAGGCA TTTTCTAGTC AAAATTTTGG AAACATATTT CGTGGGACTG AATTGACTGT AAACCGAAGA TGGCAGATCT GCACGTACGC ACGCAGTTTC TGTCAAGTCT CTACAGGCTT TGTTTCTAAC TGAAGACCTT TTTTGCTGAA CGTCATGAAG TTTACTGAGA GTTCGCTGTT CAAAGTGTTA CTACATCTAG AGTGGTAGTG TCGAATACTG TGCTTTCTTG AAAGCATTGG TAGTGAGAAC AGTAAGCTTT GATTACCGTC ACAGTCGGTC ATCGCTACAA ATGTCTGACG TTGTCCTCAT TCCATCCACG GAAAGGTAAA GAATGAGACA AAGCACCGCC GCAGCCTCCA AATTGAAAAG TAACGGTAAG TGATACGTGA CAGGTCTCAA CAAGATCAAC AACTAAGTTG CGTCACTAAC CGGATCGTTG CCTGGTATCA AGGGTCACCG AAGAAGACTG CGAACAGTAA GATTGTCCGG AGCAACGGCA TCGACCGATC CAAGGATGAG TCCGAATTGC CCGCGTCCGT AAGTGAGGCC GACGAGACTA TAATTTTCTT TCAGAGCCGT TGGCGCAAAA TATCGACCGG GAAATGGGGC CCTTGGCAAC GTACGGAGCT CACCAATCCT CGCACTCGCT CTGCCCTTAC CCAAAAAAGC CGACAAATCC ATCTGCCTCG CGGTGGTACC GTCACAATCT ATCCCGGACT ACTCTCCAAA ACCCAACGAG CGACGCTGAG CCAAGAGCTC CTCTCTAGCA ATTTATTTCG TCAGTATGAG ATTCAAGGTA TGCCGGAACC ACGGGTGCAC TTTTTGCTAC ACGAGCAGGC TACTGATGAG CCCGAAGCCC CGCAACCCGG ATACAGGTAT GGGAGTGTGC GCATGAAAGC GTGCCCTTTA CGAGGGCTCC CCAAGCTCTG TCGCTTATCG AAAATTGTGG GCAAAAGGGC TGGCGTAGAG AGCTGGAATA TTGGGGTCAA TCCAGTATTC TACCGAGACG GCAAAGATCG AATTGGAGCA CACGCCGACG ACGATCAGGG CGAATCCTGC ATCTTGAGTG TTATCGTGGC TTCCCCTATT CCGCTTCGAC GACTCTTGAT TCAACCTAAA CCAACCAAAA TTGTGGATAG GACTGTACTT GGGACAAAGC AGAAAGGAAA CAAACGGTTG AAACTAAAAA CGGGAAACGG CGATATTGTT GATGATGGTG TGGATGAACA GCATGAGCTT CGACTGGGTC CGGGGGATGC ATACTGCATG GATGGTACGT GAAATAGAAA CGACTTGTAC GTTTTGATGC GAAATAAAGC ATTCTTAAAT TTTTTGGAAC CCTCTGATTC ACACGCATTT ATTGGTCAGG AGAGATGCAG GAGCACTATG TTCATTCCCT CCCCCCCGAT GAACACAGTA AAGGTTCAGT AGAAGGGCGC CGTATTATTG TCGTCTTTCG TTCAGGCACA CAAAAAGTGT TCAAAAAGGA TTCCGGTAAA CCATGCAACT TGGCGGCTTT GGAGCCTCGA CCTGATCTGT GTTTTACGTT TGGAAACGAA ATAAAAGATT TGCGGGAGGG CGACATCTAC GGCGGACGTC AGCTGCGGGC AATGAATGCC CATCGGTACG TCGAGGTAGT CCAACCGGCT TACTGTTGTG CGAGTTCAAA AAGGTAACCC CTCACTCATA TTAATCTTGT ATTTTACAGA TCCACACAGC GCAGTGTCAG CGGGAATAAA GTCATGGGAT GTGATGCAAT TACAGTTTCT CGGGATCGCG AGGATGACAC TTTTGTGAGA TTTTCCTTTG CAGCAGAAAC ACGTGTTGGT GGGGGGAGCA TGCTCACAAG TTTGCAGAAG GGATATCCTG TTCGGGTCTT TCGCACGTCA GCTTTGCACA ATAAGTACAA AGCAGTCGCA AAGAAAAGTG GCCCAAAGTC AAAGTCAAAT GTGTACCGGT ACGATGGCCT TTACCATATA GAATCTGCGG TGGAAGAACT GGGAGACAAA GTAAACGATG TGTCCCTTGG CTTGAGCAAC ATGATCAACC GCAAGTCAGA AATTATTTTT CGGCTTTGTC GATCCTCAGA GAACAGTGTA TCGTCAATGC GCCTTTTGAG ACGCATTGTC ATTGAGCGCA TGACCCACAG CAAAGTGACC AGCAACAACG ATCTGCGCAA AAACTGTCCG AAGAAGATCA TTGTCAATTA ATGAATCTGT ATGTTATGCA CAAAATATTC TTATATCAAG CAGAACCAAA T
|
Protein sequence | MRQSTAAASK LKSNDQQLSC VTNRIVAWYQ GSPKKTANSK IVRSNGIDRS KDESELPASS RWRKISTGKW GPWQRTELTN PRTRSALTQK SRQIHLPRGG TVTIYPGLLS KTQRATLSQE LLSSNLFRQY EIQGMPEPRV HFLLHEQATD EPEAPQPGYR YGSVRMKACP LRGLPKLCRL SKIVGKRAGV ESWNIGVNPV FYRDGKDRIG AHADDDQGES CILSVIVASP IPLRRLLIQP KPTKIVDRTV LGTKQKGNKR LKLKTGNGDI VDDGVDEQHE LRLGPGDAYC MDGEMQEHYV HSLPPDEHSK GSVEGRRIIV VFRSGTQKVF KKDSGKPCNL AALEPRPDLC FTFGNEIKDL REGDIYGGRQ LRAMNAHRST QRSVSGNKVM GCDAITVSRD REDDTFVRFS FAAETRVGGG SMLTSLQKGY PVRVFRTSAL HNKYKAVAKK SGPKSKSNVY RYDGLYHIES AVEELGDKVN DVSLGLSNMI NRKSEIIFRL CRSSENSVSS MRLLRRIVIE RMTHSKVTSN NDLRKNCPKK IIVN
|
| |