Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41639 |
Symbol | |
ID | 7199464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011701 |
Strand | + |
Start bp | 27642 |
End bp | 29678 |
Gene Length | 2037 bp |
Protein Length | 678 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185597 |
Protein GI | 219130913 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.627134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGACG AATCAAATGT GCATGAGGAG GCAGCTTTTC AGGATCACGT ATCGTCTAGT GAACACTCTC AGAAAGCACG TGAACGTCGA CACCCCGTCG CAATCGCAAA CAACGCAGAG GGTAAATCTT CGGCTGAATT TTCCGATCAA ACGATTACAC AAACGACCGC AGCCGATTCA GTTTCAGACC TGATCGGCGG CGAAGACTAT GAGACTCCAG TTGTGCCTTA CGCCAATCTG GGCAATCCTA CGCTTTTTGA ACAAGCTGAG ACCAAGCATT TCTCTTCGCA AAAGAATAGT GACGATATTT CTGACACGGT TGAAACTCAT TCGAGCGCGG CCACCCAAGA CAATCCGGAT AGGAATAGGG AGAGTTTGAT CAAAATCAAC ACGTTGTCCA GCGATAGCGG TAGTCCACCC TCTGGTGACG AGGCACAAAT GAGCACGGAA AGTACATTCC AACAACCCAG ATCGCGAGGG GGCTACGTGG ATCAAGAGAG AAGTTCACCG ATGACAATCA TTCCAAGTGA CTCTCTCAAA ACCGACGTCA GATCAGAAAT CTTTGCGCTG GAGCACACGC AGTCCCACAG TCAATTGGAG TCGATCGACG CGGAGCTCAA AGCCGCAAAA GCGGCAACGG AGGTAGTGGA TGATTCTTTA GAAGAGGAAG TCCGAGCTAT GCGAGCTGAA GGGTCGACAT CGGGTCGGCA GGCCTGTAAT CTAGATTCTA AGCGAGTCTT CGTGACCGAT GACGCTGTCT TAGCAGAATC GGATGCCTCA AATGAACTGG AGCTGCTAGA ACACCCGCGT TTGGAGTTTT CGCAAATATG CGGCAGACCC GAATTCGAAA AATACGAATT TCCGGACGAG GAGACGCGAA AAGGCGACGC GGAGGAACAA GAGAACAAGT ACAAGAGAGA TGAAGTAGCA ATCAAACACA GTATACTTGA AACAGCGGGG AGTCACAGCG AGTGGACGGA GCAATTGAGG GACGAAACCG GCAAATCTCT GATCCCAAAA GAGACCGAAA AGCAGGACAA CCCACACAAA GAAAACTCTG TTGTGGGGGA AGAAGCCATG ATACCGACCA ATTCACGTAT GGAAAACGAA GATACACAGG AAAATTCCAA ACCATCGGAA CCTGCCGTTG CTTCAGAAGC TAAGCCATCC CTCAAAGCTG GTTCGTTCGC TAGCCATATC GACAAAGTAA CTTACGACGA AATGGAAAAT TCCTTTGCCG GTGCAAATGA GCACTCCGGC GCAGTGATAG CCCAATCAAT GAAGCAAGAC ACTGTTGCCC ACGAACTTCA GACTCCATTT TGTGGTGTTT GGGGGGCGGT ACATGGGGAA CGAAAGCTCG CCGATTTGTC GGTTCTCGTT TTATTGTTTG AAAACTTGCT TTCTGACAAG AACGACGACA ACGATGCTCG AAACGCGGCT GAATTGGCTT TACTTGAGCG AAAATTCGAA GATCTTATGG ACGCGGATTA TTTTTTGGCT GAAAGTGTGC CCTCAAGTAT TGAGAACATT GCTGAATCAC AGGTCCCCAT GGCGAAGTCT ATCAGGAATG AGTTTGTTGA TGGACTAGAT GACATAGATA AATTTTTCGA AGACATCGAT CCTCCGGACG AGTTGGATGT TGGAGCTGGT GGATCGTCTA TACAGGAAGT CCTGATGGGA CAAGGTAGTC GAATCATTCT TAAACGGCTA GTTATAGCAG CTAAAGTCGT TCGAGACACC GCGGTTGAGA TTAAGAGGAC GCTACTGACA AAAATCGCGG ACGACGACGG TTCGTTCAGC ATGGCGCGGA GGGAAAAACT TTACGGTATA TTGAGGACAA TACGGAGGCT AACCCGTAAG AGCATTGAAG CTTTTCGTCG TTTCATTGAA GGACTCTTGG AAGGTGATGT TTTCGATGGA GAGGATTTTG TCCTTGACTT CACAGTCAAT CCAAATCCGC CCAGCGAGGC GGACGCAGAC CCGGGAAAAA AAGCATTTCG TCAGCAATCA CAGTCAATAA ACGGTCGAGC TAACTGA
|
Protein sequence | MPDESNVHEE AAFQDHVSSS EHSQKARERR HPVAIANNAE GKSSAEFSDQ TITQTTAADS VSDLIGGEDY ETPVVPYANL GNPTLFEQAE TKHFSSQKNS DDISDTVETH SSAATQDNPD RNRESLIKIN TLSSDSGSPP SGDEAQMSTE STFQQPRSRG GYVDQERSSP MTIIPSDSLK TDVRSEIFAL EHTQSHSQLE SIDAELKAAK AATEVVDDSL EEEVRAMRAE GSTSGRQACN LDSKRVFVTD DAVLAESDAS NELELLEHPR LEFSQICGRP EFEKYEFPDE ETRKGDAEEQ ENKYKRDEVA IKHSILETAG SHSEWTEQLR DETGKSLIPK ETEKQDNPHK ENSVVGEEAM IPTNSRMENE DTQENSKPSE PAVASEAKPS LKAGSFASHI DKVTYDEMEN SFAGANEHSG AVIAQSMKQD TVAHELQTPF CGVWGAVHGE RKLADLSVLV LLFENLLSDK NDDNDARNAA ELALLERKFE DLMDADYFLA ESVPSSIENI AESQVPMAKS IRNEFVDGLD DIDKFFEDID PPDELDVGAG GSSIQEVLMG QGSRIILKRL VIAAKVVRDT AVEIKRTLLT KIADDDGSFS MARREKLYGI LRTIRRLTRK SIEAFRRFIE GLLEGDVFDG EDFVLDFTVN PNPPSEADAD PGKKAFRQQS QSINGRAN
|
| |