Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43772 |
Symbol | |
ID | 7197282 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1431877 |
End bp | 1435168 |
Gene Length | 3292 bp |
Protein Length | 1013 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177831 |
Protein GI | 219112159 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.400632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTCTATCGA GAACGTTTCG ATCGCTTCCT AGCTCTAGAT TAAAGAGCTT AACATGGATA CTCGCCACCT TAAGTACAGC TTTTGGACCG GAGGAAATTT CAGCAATAGC AACAATAGCA CTGACTTTGA TTCGAATTTG GAACGCCCTC AAGATGTTTC CGTGGATTCC GTCTTGACAT CGTTGTACTT CAATAGCATT GTATTTGTCA TGCTCATGGC CAGTTATGAA ATCTTACGTC GTGTGTTCCC TGCTGTATAC TCATCCCGTA AACGAATTTC ACACGCTCGT CCCGATACAC AGAATGGTCA TCGACCGGAA GCTCCCTTGC ACGAGGATGC CACAGTGCCC GACCCGAATG GCACCGACTA TCCCAAAATT CATCACGAGC GTCATGCTTC GTTGACTTCT CTGCCGGACG ATCGACCACT CGACTGGCTT GGGCCCGTAT TTGGCGTGCC CTGGAGCAAG GTTCGTCGCA TTGCGGGCTT GGATGGGTAC TTCTTTCTAC GCTATATACG TATGAATGTG AGAATTACGG CCGTTTCCAC GTTTTGGTTC TTTCTGATAC TGGTCCCTAT TTACGCGACG GGTAGTTCAA AGGAACACTC GGCGGAGGGA TGGTATCATC TGTCGGCGGC GAATATTCCA AGAGACGGTT GGCGTATGTG GATACCTTGC CTTTTCGCGT ACTTGTTCAG CGCCTTTGTT TGCTTTGTTG TCAAGCAGGA GTATCGGCAC TTTCTGGACC TCAGACAAGA CTTTTTGGCG AGAGGAAACA TGCACGTCGA TCCGCAACAC CATCATTCCT TGGAAATTGA AAATATTCCC TACGAATTAC GATCGGATCG TGCCTTGAAA GAATATTTCG AAAAGATGTT TCCCGGACGC GTTCATTCAG CCAGTGTCGT TCTCAATCTA CCGGAACTGG AGGACGCCTC CGTCAGATGT ATGAGAACAT GTCGTCGTCT CGAAAAGAGT ATTGCTTTTT TGCACGCAAC GGGTAGCCGG CCCACTCACG TTGTTGGCCG TGGCCGAATA TCTTGCTTGG GAATCGAGCT ACAACCGTTG GACTGCAATT GCACAGCGAG TCAGGAAACC TTGTTCGTCG AGAACGATAT GCGAGCAGAA CGACCGAAGA GAGGAACTCG TGTCGACTCA ATTTCGTACT ACGCACAAGA ACTGGCAGCC GATAGTCGAT CGTTATTTCA GATGCAAAAA CGCAAATCAC GGATTGCGGA ATCTGGAAAT CAGTTAAAAC AGGTGGACAA CTGGTTGGAC AAGGCGGTCC GCCAAGCATC AGAGGTTGCG AATACAATTT TGGAAGACTC AATAAAGGAC AACCATTTGA CTTCGCCGTA CGAGAGCTTT GACGAAGGTG GAACAGTACC TCCAGCCGAG AGCATGACTT CCAGGTATGG GTCCTTCAGT CAGGCAATCA GCCATCGAGC AATATACGGA CGGAGCAAAT TTGACAAAAT TCCTGAGGAA AGAAAGGAAC CTCTTGTTTG TGATGATGAA ATGGTACGCA GAGATCGCAA CCCCTCGACT TTATCGCATA TTTTGACTTA CCTGAATGAT CTGCGTCAAT TTACAGTCAG ACTCGACCGA AGACTCTGAT TTTCCAATCC CCTTTAGTAG AGATAGCTAT CAAAACAGGT GGAGACGTTG GGCAGGTCGG TTAGGTTTAG ACTTTGCCAT TGCCGGTCTT AAACTTGTTA ATAAGCAGCT CGACGTTGCA CTTGAAGAAG TCGTCGGAGC TACAATGTCT TCTACTGGGT ATGTCACGTT CTTGGATCTT TCCTCGACAA CATGCGCGGC GAGTGCACCA CTGACGGTGA AGGCCAATGT TCTCGATGTA TCTGTTGCTC CCGAACCTAG GGATATCATT TGGAAAAATG CTCATATTTC CAAGAGATCA CAGTTGAGAC GTGGCAATTT CACGAACTTC TTTCTATTTC TTGGCGTTAT TCTATGGAGT TTCCCTCTGG CTGCTATTCA AGCTTTCGCA AAAGCTGAGT TTTTGGCACA AATTCCTGGA ATGGAATGGA TTTTAACTTT TCATGGGGGA ACTTTTACAA ACTTTATGAA CGGCTACCTT CCAGTGGTGG CCCTTTTGTG TCTGATCCTT ATACTTCCGT TGATTTTTGA GTATGTGGCT GTGAGTTACG AGCATCGCAA GACTTATTCG GATGTTCAAT CATCAATGCT GAGCCGTTAC TTCTATTATC AGCTTGCCAA CATCTATGTG TCTGTGACTG CAGGATCAAT TCTGAAGTCT CTTTCGGACA TTCTTGACCA TCCATCGAAC ATTTTGCAAC TTTTAGGGGA CTCCTTGCCT ACCATGGTCG GCTACTTTGA TGCTCTATTA GTCACAAAGA TTATGGCCGG TCTACCAATG ATTTTCTTAA GGTTTGGTGC ATTGTCCCGT ATGCTTTTTT TGAAAACACT GTCAAACGAA AAGAAAATGA CACAGCGTGA ACTCGATGCC GTGTATAGGC TGGAAAATGT CCAGTACGGG TGGGAGTTTC CAACACAGCT TCTTGTGGTT GTGATAGTTT TTACGTATGC CATTATTTGC CCCGTCATCC TCCCGTTTGG CTTGCTTTAC TTCCTCGGAG CACTTTTGGT GTACAAAAAG CAAGTACTAT ACGTCTACAG TCCGGTATAC GAAAGCGGAG GTGCTATGTT TCCCGTTGTA GTCCAGCGAA CGCTTTTCGG ATTGGTGTGC GGCCAGATGA CATTTATTGG ATATGTGGTA ACACGAGGTT GTTACTATCA GCCCATTTGC TTATTCCCTT TACCTATTGG CACAATTTGG GCAATGAACT TTTTCCGACA AAATTATGCA GATCCTAGCA CTCGGCTAAG TCTGGAACGG GCCCGCGAAT GCGATCGGTT GTCCTCGTCT AAAGCGGCAA CGGAAGAGGA TGGATTGGAC AGCAACATTG ACCGTGGCGT AGAATTGCGA AGAACGAAAT TCGATCGCAA GTCATACCGG CAACCTGTCC TCACAGAGCT CGCCACGGAA CCAGAGTTCT ACCGCTCAGG CTTTCAAGAC GACGAAACCT TCGCTGTAAG GAAACAGCTT CAACGAATTA ATCGATACAT CAAGGAAGCG ACTTTGGAAC ACAATGATGG TCTCAAAGAT GCTTTGTTTC CAATATAGAA AAAAATTATC ATTTCCGTGG GCCTCAAATA CTTTTTACAC CATTCGTTTA TGAGAGCGGA GACAGTGAGA GTTAGCCTTT TCGGCCAAAG CGCTGTCTAA TCTATACTGT GGTCCGTCGA AT
|
Protein sequence | MDTRHLKYSF WTGGNFSNSN NSTDFDSNLE RPQDVSVDSV LTSLYFNSIV FVMLMASYEI LRRVFPAVYS SRKRISHARP DTQNGHRPEA PLHEDATVPD PNGTDYPKIH HERHASLTSL PDDRPLDWLG PVFGVPWSKV RRIAGLDGYF FLRYIRMNVR ITAVSTFWFF LILVPIYATG SSKEHSAEGW YHLSAANIPR DGWRMWIPCL FAYLFSAFVC FVVKQEYRHF LDLRQDFLAR GNMHVDPQHH HSLEIENIPY ELRSDRALKE YFEKMFPGRV HSASVVLNLP ELEDASVRCM RTCRRLEKSI AFLHATGSRP THVVGRGRIS CLGIELQPLD CNCTASQETL FVENDMRAER PKRGTRVDSI SYYAQELAAD SRSLFQMQKR KSRIAESGNQ LKQVDNWLDK AVRQASEVAN TILEDSIKDN HLTSPYESFD EGGTVPPAES MTSRYGSFSQ AISHRAIYGR SKFDKIPEER KEPLVCDDEM SDSTEDSDFP IPFSRDSYQN RWRRWAGRLG LDFAIAGLKL VNKQLDVALE EVVGATMSST GYVTFLDLSS TTCAASAPLT VKANVLDVSV APEPRDIIWK NAHISKRSQL RRGNFTNFFL FLGVILWSFP LAAIQAFAKA EFLAQIPGME WILTFHGGTF TNFMNGYLPV VALLCLILIL PLIFEYVAVS YEHRKTYSDV QSSMLSRYFY YQLANIYVSV TAGSILKSLS DILDHPSNIL QLLGDSLPTM VGYFDALLVT KIMAGLPMIF LRFGALSRML FLKTLSNEKK MTQRELDAVY RLENVQYGWE FPTQLLVVVI VFTYAIICPV ILPFGLLYFL GALLVYKKQV LYVYSPVYES GGAMFPVVVQ RTLFGLVCGQ MTFIGYVVTR GCYYQPICLF PLPIGTIWAM NFFRQNYADP STRLSLERAR ECDRLSSSKA ATEEDGLDSN IDRGVELRRT KFDRKSYRQP VLTELATEPE FYRSGFQDDE TFAVRKQLQR INRYIKEATL EHNDGLKDAL FPI
|
| |