Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40254 |
Symbol | |
ID | 7195860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 526792 |
End bp | 529053 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184151 |
Protein GI | 219127874 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0361313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAAG GGGATTCCCG CCGAAAGATT GGTGCTCAGG TGACAGCGAA GGCCTGTCAT GTTGTCCATT TGAGTGAGTG TGCTCGGCGA TATGGTGCTT TGAGGACCAC CAAGGTCGTT GTGGGGACTT TTGTGGAGGT CAACAATACC AGAAAGGCGC CAAACAACCG TGTATCAACC TTCATTACTG CTGACTTTGA TATTGGTGGA GGATCAGTCA AGCGGAGCAC TCTGAACATC CGTAGCGTCA AACTCTTCAA ACCGGACCAG TCGACAGTAC CATCCAGTCC CGCAGCACCA ATACCGGCAG TAGACAACGC AGACACAGAT TTGGCCGTTC CAGAGCAAGA GGAAGGAGAA GCGGTCTTGC AGGAGACTTC TCCTGATGAA GAATTGGAAT TTCCAGCACA ACCGATGATG GAAATTGGAA TAGCTGCGGG GGAACAGGTA GCAGGACCTA CCGCACAAGT AGCCACGCAG GTTTGGGGTG TTGAAGACGC TTCCTTTGTA ATGGCTCATG AAACAAAGTG GTATGCTGAC GAGCAAGCTA CATTGATTGA TATAAATGGC AGTGTCCAAA GTAAGCAGTT TGGCATCAAT ACACCAATTG GTGACCTTCT TGGTCCAGAC TCTGACATTG ATGGAAAATA TTCGCGGCTG CAATATTTTC TTCTCATGTT TCCACCCGAC CAACTGAGCG CCATGTGTCA GCTAACAAAT GTGCAGCTTG CCCAACAGAA CAAGCACTGC ATGTCAACAG GAGAGCTGCT TCGATTCTTT GGCATTCTAA TTCTTGCGAC AAAATTTGAA TTTAGCAGTC GATCGCAATT GTGGTCCACA ACCGCGCCGT CAAAATACAT TCCTGCCCCT GCATTCGGAA AAACAGGAAT GTCGCGGCAG CGCTTTGATG ATCTTTGGCG AAATATCCGA TGGAGCAACC AGTGTCCTGA ACGGCCGGAA GGTATGAGCT CCCATACGTT TCGGTGGCAA CTTGTCGATG ATTTTGTTGA AAGATACAAC AATCATCGAG CCAATACTTT CAAACCATCT CATCTTATTT GTGTGGATGA ATCAATGTCG CGATGGTATG GACAAGGGGG GGAATGGATA AATCATGGAC TCCCCAATTA TGTGGCTATT GACCGAAAGC CCGAAAACGG TTGTGAGATT CAAAATGCCG CATGCGGCTG TTCGGGTATC ATGCTTCGAT TGAAGGTTGT AAAGGGTAAG ACAGCAACAG AAGATGATGG GGACTACAAT GAACAGTTGC TGCATGGAAC AAAGATCCTC AAAGAGCTTG TCCTTCCTTG GTGGTGGACG GATCGGATTG TTTGCGCTGA CTCGTATTTT TCATCTGTCG GTACAGCTAT GGAGTTGCAG CGACATGGTT TGAGATTTAT TGGAGTTGTA AAAACAGCAA CAAAACAATA TCCGATGAGA TACCTTTCGA CTTTAGAGTT GAACCAGAGA GGCGAACGGA GAGGGCTTGT GATGCGAGAT GTTGATACAA ATTATAGCAC TCTGTTGGCT TTTGTGTGGA TGGACAGGGA CCGCCGATAT TTTGTGTCGA GTGCTTCCAG TCTGGATGCA GGCAAGCCCT ACATACGCTA TCGTTGGAGA CAGATTGACC AATCTCCGGA TGCAGATCCA GAGAGGCTGG AAATTATCAT TCCACAGCCC AAAGCAGCGG AATTATACTA TTCTGCATGT GGGATGATTG ACAGGCACAA TCGAAGTCGT CAGGATACAC TGATGCTTGA ACGAAAGTTG GGTACAACAA ATTGGTCGAC AAGGGTTAAC CTCTCAATAT TTGGAATGAT TGTTGTTGAC ACTTGGTTGG CCTACAGTCT GTGTACAGGA ATAGGAAGAG CTAACGGGAG AGAAGAAAAG CAGAAAGACT TCTACACTGC CTTGGCTGAG GAGCTAGTGG ATAACCAATA CGACAATGTT GGAAGTCGCA GAGTTTTCGT GGAGGCAAAT TTGGACAATG ACAGCCCAGC ACTTTCAAGG ACTACGGGAG AACCAAGAAG TGGCCTGTAC GCACATCTAA CACCAACCAA AAAAAGAAGA AAGAACAAAG ACGGTAGTTT TAGCAGCAAT AGACTACAAG GACGATGCTT GGTGTGTTCC AAGAAGACAA CATATGTTTG CTCAGTGTGC AAAGATGAAG AAACACCTCA CTCCAGAGAA CCGTGGGTTT GTTATACCAC CAAGGGGAAG CTGTGCTATG CAAACCACAT GGCTACCTGT CACGGCGCCT AA
|
Protein sequence | MSEGDSRRKI GAQVTAKACH VVHLSECARR YGALRTTKVV VGTFVEVNNT RKAPNNRVST FITADFDIGG GSVKRSTLNI RSVKLFKPDQ STVPSSPAAP IPAVDNADTD LAVPEQEEGE AVLQETSPDE ELEFPAQPMM EIGIAAGEQV AGPTAQVATQ VWGVEDASFV MAHETKWYAD EQATLIDING SVQSKQFGIN TPIGDLLGPD SDIDGKYSRL QYFLLMFPPD QLSAMCQLTN VQLAQQNKHC MSTGELLRFF GILILATKFE FSSRSQLWST TAPSKYIPAP AFGKTGMSRQ RFDDLWRNIR WSNQCPERPE GMSSHTFRWQ LVDDFVERYN NHRANTFKPS HLICVDESMS RWYGQGGEWI NHGLPNYVAI DRKPENGCEI QNAACGCSGI MLRLKVVKGK TATEDDGDYN EQLLHGTKIL KELVLPWWWT DRIVCADSYF SSVGTAMELQ RHGLRFIGVV KTATKQYPMR YLSTLELNQR GERRGLVMRD VDTNYSTLLA FVWMDRDRRY FVSSASSLDA GKPYIRYRWR QIDQSPDADP ERLEIIIPQP KAAELYYSAC GMIDRHNRSR QDTLMLERKL GTTNWSTRVN LSIFGMIVVD TWLAYSLCTG IGRANGREEK QKDFYTALAE ELVDNQYDNV GSRRVFVEAN LDNDSPALSR TTGEPRSGLY AHLTPTKKRR KNKDGSFSSN RLQGRCLVCS KKTTYVCSVC KDEETPHSRE PWVCYTTKGK LCYANHMATC HGA
|
| |