Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35852 |
Symbol | |
ID | 7201057 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 897427 |
End bp | 899026 |
Gene Length | 1600 bp |
Protein Length | 422 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180342 |
Protein GI | 219119151 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.318034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGGG AATGTAATAC ATCCCGAAAT CATAAGACCC AAATCCTTTC ATGGGACTCC GGAAACGTCG TTGAGAAAGC GTTGTTTTAC CCGTTCTTTT TTCGAATTTC TTCGAGGTCA AGCAGGCAAA GAACAATTTC TGGCTGGTGT TGCTTACTTT TCTAATAATC TCACCGTAGT GGATACGTAG GTCCATATTT ACAGTACATT CCATACAGGA CGTGATGATA CGCAACATAA GATTAAATTA TACGGGCTGT GACATACTAC CTATGTCACA TCACCAGATC CACTACCACT GGATTTGCTC ACTTTTGTAC GTGGAATGAA GAGTTTGGCT CAAGTGCTTT TGTGCACAAT TTGCATTCTA CCAGGGTGTC CAAGCTTGCA GAATTTGAAA CGGAAGCTGC TGATTCAGTG AATAAAGCTA TGCTAGAAGC ATTGGAGAAG ATACAAGTCA ATATTGCAAA AAGGGCTGGT AAGATAAATT GGTCTAAAAT TGAAGTAAAA AAAAAAGACG ACTCCCAACC AGTCGTATTG CTCCCCATGC CAATTGTTTG GCCCAAAACA TCACCAGCCT TGAACACAAA CCTGACTTTC GACACGCCGT GGCTCTCCGC CGCTCAAGTG GCCACCAAGA AAAGTGACCC CGTCGTGAAA AGGAATAAGG ACAGCGAACA GATTTCCGCT AGCACACTTG TCGCCTCGCG ACCTAGCGTG GTCGATCCAC GGTTGGCTAC GTTCTTGGAA CGCTTTGACA ACGCCGAATG GCGATTGCAA GCCCTGCAGG CCAAAAACGC CGAACTCCAA GCCAAAGTGC AGGAAGCCGA ACGGCAAAAA CGCGCCATGC AACAGTTGGC CAGGTAATGC AAGTGCCTCG ATAGAAATTC TCATAGATAG AATCGGCGCG GCGATCTATA GTGTCGTAGC TATGCACACG GGGCCCGGAG TAACATAAGT CAGAGTTGAA CGGTCTCGCT CGGTAGCTCA ACATTGCATC GTTCGGACTA CTCCGACGAT CAGACGCCGT TTGTCCCAAA GTGTTGTGAA TGTGGCAGGA GTAGCGATCG TAGGGAAGCG TCCCAAAAAT ATGGCGTGGG TGACTTTGCG AGCTCGTCGG CATGCAGGGA CGACGTTGCA CCACCACTGC AACAGTAGCC GATTCCTCTG TACCGATCGT GTTGTAATCG AGGAAAGGAC AATGACTCTG AATCCAAGGA ATCGTCCGAA CCGTCATAGG TTCCTTTGCG AATACATATC TGATGTGGAT GCGTGCGGTC GTCGTCGCTG GAACCGGACA AATCCACGGT CCATACGGGC CGTCGCGAAG TCAAGACGCA CTGTGTGTGC GACTGCTGTC GTTGCTCCTC TCCTTCGGAT ACGTGAGCAG GATCTCGCAC ATTCTCTCCT TTCGAAAGGA TTCCTATCCC GGGCAGTCTC CAGGCGCATA CCCCACGAGC GAGACACAAT TCGTCCAGCG ATGGTAAAGT TGGCAAGCCT TCCGCACGGC ATGGGTACAA CATGGAGTCG AGGAAAGGTT GTTGAAGTTG TTCCAAAACA CCGATCCTGT GGAGAGCTTG AACGGTGTTC ACGTATCTAG
|
Protein sequence | MMGECNTSRN HKTQILSWDS GNVVEKALFY PFFFRISSRS TTTGFAHFCT WNEEFGSSAF VHNLHSTRVS KLAEFETEAA DSVNKAMLEA LEKIQVNIAK RAGKINWSKI EVKKKDDSQP VVLLPMPIVW PKTSPALNTN LTFDTPWLSA AQVATKKSDP VVKRNKDSEQ ISASTLVASR PSVVDPRLAT FLERFDNAEW RLQALQAKNA ELQAKVQEAE RQKRAMQQLA RRRLSQSVVN VAGVAIVGKR PKNMAWVTLR ARRHAGTTLH HHCNSSRFLC TDRVVIEERT MTLNPRNRPN RHRFLCEYIS DVDACGRRRW NRTNPRSIRA VAKSRRTVCA TAVVAPLLRI REQDLAHSLL SKGFLSRAVS RRIPHERDTI RPAMVKLASL PHGMGTTWSR GKVVEVVPKH RSCGELERCS RI
|
| |