Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50617 |
Symbol | |
ID | 7199451 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011700 |
Strand | - |
Start bp | 129222 |
End bp | 132392 |
Gene Length | 3171 bp |
Protein Length | 859 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185586 |
Protein GI | 219130889 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00889588 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATTC CAACGATTCT CACCGTGGAA GAACCAGAAC GGGATCTGTA CGGCCGCGTG AAGGAAAAAG CCGCAATCTC CGTCGCTAAC GATCAGTGCA TGGATTTGGC CACAGCAATG CGAGGAAGCG AGAAGTCTCA CGTAATTCTC GTACATGGTC CATCAGGTTT GGGCAAAACT GCTTTGTTAC AGCACGTCTT CGAGGAACGG ATAAGGCAAG AGGGCGGCTT CTTCATATAC GGAAAGTTCG ACTCAGTCCC GTCGGCGAGC CCTGGCCAGC CATTGTGGAC GCTTTCAGCG ATTTGTGTGT ACAGTGTCTT GCTTCCGAAA GACGATCGAA AATACGGAAG GCGCTTTGCG AGGAACTAGA ATCAGACGTC AATCAACTTT CACATTTAAT TCCCAACTTC AAGGCGCTCG TTGACGGCTT TCTTGAATCG GATGACGACA CTGAGGGAAA TGGAGATTGG AATGACATCC GAGCATCATT CAGGCGAAAC GAATTTGGGT TTACCCGCAT AAAACTCCTC TTTCAACGAT TCCTCAGAGC AATTTGTCAC CCCATCTCCT CCCGAATAGT TTTGTTGCTT GACGACGTGT AATGGGCAGA CCAATTGAGT TTGCTTCTGA TAGAGGCGCT CTTATCCGAA GGCGAGCCTG TAAGCGGGTT TCTGTTAGCT ATCACATCTG ATGAACCAGG ACTTTTTCGA CGAGCTCAGG TCAATAGAGA TCTCGTAAAT GCTACCAAAA TTCGTATTCG CAACTTTTCC AAGAATGATT GGATCGAGAT GACAAAGCAA ACCTTGAAGA AAGCGAAAAG CTACATTGCA ACCGAAGAAT TGGATATTCT CTTCAAAAAG ACTATGGGCA ACCCTTTCTT GACCGTTCTT TGTGTAAAGG CAATAGACGA GGGAGGCTTG CTTAAGGAAT TAGAAATTGA AGCTGACAAC GGTGTAAAAG CTGGAATTTT GTCAGTGATC AAGGGCCGCC TGAGCCGATT GGCAAAGCCA GCTCAAGATG TTTTGTTTTT GGGAGCCTGC TTTGGACTGA GGTTTTGCAT GGATTGGGTG GCTCCTTTTG TCACGACATA TGGATCAACA CGAAACGCTT CTCTGCAACC GGACGTCATG CCTTTTAATT GGGGATCGGG ACGTTTGGAC GACTCAGAGC ATTTGTCTGA AAGTGAACTC AAGGAGACAT TGAACGAAGC TATCGTTAAC GGCCTGGTCA CTAAACGATG CGGTATCCCT TGGTTCGAAT TCACTCATCA TTCGATCCGC GATGCGGCCT ATGACCTTTT CAGGAATAGT TCGTGTAAGA GAGAACGAAT ACATTTGCCA ATAGGCGAAC ACACGCTTCG AAAGGTGAGC TTTTCCTCTA GCTCCGAGGA CGATACTATT CTATGGACAG CAGTGTCTGA ATCTAGGTTC TTCTCAGCTG CATGAGGAGC GCAAGTTGAT TGAACTTGCT CGTCTCAATG GAAGGGCAGC AGAAAAATCA ATGCTGAAGT TAGCGTTCTT CTCGGCTGCC CAGTATGCGT CAGCTGGTTT GGAAAAGATA GGACGAGTTG GAGGATGGCA AACAGACTTT GCTGTGACCT GGGAGTTGAG CACGCATTTA TGTCGAATGT ACAGTTGCCT CGGTGAGCAT GAAGCTTGTA AGAGAGTCGC CAACGATGTG GTGTCCCGCA GTTCGTCCAT TTTTGAAAAG CTTGGTGCTT TCGAAGCTGT CATGGAGTCT TCTCGAGTTG AAGGGAATGT AGAGGAAGCA TTCAACATAG GATTTGACGT CCTCAGAAGC TTGAATGAAC CTTTGCCCAA CAGAGTTAGC AAAGCGCTTT TAATTTGGGA AATTGTCAAG ACAAAACGCA TCTTGACCCG GAACCCTTTA AGGATTTGTC AGGCCTACCA AAAATGGCTG ACGAAAGAGC ATGTGCTACG GTTCGGTTTC TAGGATTACT ATCCCTCAGC TTCTTTGCCA TGGGAAACTA TTTCAGCTAC TTTGTGGCAT CGTTGAGAAT CGTCAGACTG AGCACAAAGT ACGGTGTTGC TCGGGAATCT CCAAAAGCAT TTGTGGTCTT CGGGAATATT CTATCCCAAA CAAGTCGACA CTTTAACGAA GCTAGCCAAT ACTTGTCCGT TGCGCTGTCT TTAGGTGAAA AGGCGGGCAA ATCTGGAAGA GCGCAATCTC TTGCCGTTGG AAGTTGGATC TTGACGCCTT TGCAAGGCAC CGTCTCGGAA GCAGCTGGTC AAGCTCTGTA CGGATATCGG CTCGCAATGG AGTGTGGCGA GGTTGTGTGC GCGTGTACCT CAGTTCTTTC TTATTGTGGC CTCTATTTTT GGAGTGGACT TCCTATACCA CCCCTAATGA AAGATCTGCC GACTTTCCTG ACCATGCTAA GCGAATATAA GCAGGTACGG CATTTCAAAT TCTAGCTCTG CATGGGCATC CGACCATTTT CTCACCCTTT TCTTTCAGAC TGTGCACGAA GTAGGACTCT CCTCACTGAT GTATTTCATT CAGACATCAA CAGCGGAGGC CAATCCGACT GATTTATATG ACGATACCGT ATGGATGGTC AGATGTCAGA ATGCAGGAGC CGCTATTCAA GTCAATACCA TTTGTCTATA CCGCATAATA TATGCTTATT ACATGCAAGA TCTCAGTTCG ATACGAGCTT CTCAACTAGA TGCGCACCGA GCTGTAAAGG TACAATTGTC AAGAACTTTG CAAGTTGTCG TGTGCTGGCT TTTTGTGGGG CTCTCCGATT TCTTCTTAGC ACAGTCTGGG TGTGGCATTG AGTTTCAACG AAGTGGCCAA AAGATTTTAC GCATGATGCG TCGGTTGGTC GTCAAAGGGG ACAGCAAATG CGAGCATATG TTCATGTTCC TCAGGGCAGA AAAGTACAAA CTTGAATCAA AACAGAGCAA TGAAGTTCTA AAATCCTACG ACGAGGCCAT TTCTGGAGCC GTACAGGCTG GATTTTTCAA CCATGCGGCC TTGGCGAACG AGCGCGCCGC ACTATACTGT TTAGCGCGTG GAAAGGAAAA GAAAGCGGCT CAATATTTCC AAGAAGCCTG GCAAGGGTAC CTGAACTGGG GAGCCCATTC TAAAGTTGAC CAGCTTGGGG GGTGCTACTC AGCATACATA CAACAAAGTT CTAGAAAATA G
|
Protein sequence | MAIPTILTVE EPERDLYGRV KEKAAISVAN DQCMDLATAM RGSEKSHVIL VHGPSGLGKT ALLQHVFEER IRQEGGFFIY GKFDSVPSAS PGQPLWTLSA IYQLSLLLIE ALLSEGEPVS GFLLAITSDE PGLFRRAQVN RDLVNATKIR IRNFSKNDWI EMTKQTLKKA KSYIATEELD ILFKKTMGNP FLTVLCVKAI DEGGLLKELE IEADNGVKAG ILSVIKGRLS RLAKPAQDVL FLGACFGLRF CMDWVAPFVT TYGSTRNASL QPDVMPFNWG SGRLDDSEHL SESELKETLN EAIVNGLVTK RCGIPWFEFT HHSIRDAAYD LFRNSSCKRE RIHLPIGEHT LRKQCLNLGS SQLHEERKLI ELARLNGRAA EKSMLKLAFF SAAQYASAGL EKIGRVGGWQ TDFAVTWELS THLCRMYSCL GEHEACKRVA NDVVSRSSSI FEKLGAFEAV MESSRVEGNV EEAFNIGFDV LRSLNEPLPN RVSKALLIWE IVKTKRILTR NPLRICQAYQ KWLTKEHVLR FASQYLSVAL SLGEKAGKSG RAQSLAVGSW ILTPLQGTVS EAAGQALYGY RLAMECGEVV CACTSVLSYC GLYFWSGLPI PPLMKDLPTF LTMLSEYKQT VHEVGLSSLM YFIQTSTAEA NPTDLYDDTV WMVRCQNAGA AIQVNTICLY RIIYAYYMQD LSSIRASQLD AHRAVKVQLS RTLQVVVCWL FVGLSDFFLA QSGCGIEFQR SGQKILRMMR RLVVKGDSKC EHMFMFLRAE KYKLESKQSN EVLKSYDEAI SGAVQAGFFN HAALANERAA LYCLARGKEK KAAQYFQEAW QGYLNWGAHS KVDQLGGCYS AYIQQSSRK
|
| |