Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26975 |
Symbol | |
ID | 7200053 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 1007381 |
End bp | 1009687 |
Gene Length | 2307 bp |
Protein Length | 701 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179558 |
Protein GI | 219117527 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0792483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACACGTTG CCGGAAAGAA CCAAACAACG AGCGCTCTTT ACTGTCTTAC AAGAGCCTGC ACGATGGCGA CTGAACAAAG CAGAGAAGAT GAATTAGCGG CCGAAGCCGA AGAATTGGAC GGCTACTTTA GCGATACGCC ACTCGATCCG CAACCGGATT ACCCACAGTT ACGGGAATCC TTCGAATCGG CCGTGATCAT CACCAATTTG CCCAAAGTCG GAGAGTCCAA GATGGATAAA CTGACCAAAG TCGTCATGAA GCTCGTATCC CGCATTGGCA CGCTGGTAGA AAATCCGGAG ACGGGCTTCA CCGGCGTTTT GATGCCGCAC GACAAAGACA ACGACACTAC CATGGGGTTT TGCTTCGCTG AGTATACTAC CAAGGAAGAA GCCAAAAACG CAGTGGAAGT CTTGCAGGGT TACAAGTTTG ACAAAAATCA TTCCTTGTCC GTCACGTTGT ACGCTCGCGC GGAACAATTG AAAGATTTGG ACGTGAAAGA GTTCCAGGAA CCGCAACCTA CTCCTTTTCA AGAAAAACCC AACGCCATGT CCTGGCTCGA AGATCCTAAC CAACGTGACT CTTTCGTTAT TCGACACGGC AAGGAGACCG TTGTCAACTG GTTTGATGGT AGGAACGATC CCATACTCGA CTACGATGGT GCTCGCGAGA AGGAAGCCGG AGTGCAGTGG TGCGAGTTTT ACTGTCACTG GAGTCCATGC GGATCTTACC TAGCAACACT TGTCCCCGAC CGTGGTGTCA TTCTTTGGAG CGGGGAAAAC TACGAAAAGT CGGGACGATT TGTCGCTCCG GGCGTCAAGC ACATTGTCTT TTCGCCGGAA GAAAATTACA TCTTGACGAA TAACGATGAT CCAACCGATC CGGGGGCCAT TAAAGTTTAC CATATTCCGA CGGGCAAGTT GCTACGGACC TTTCCGCTCT TCCCCGACGG TGTAGCAACC GATATACCGC CACCTCCCTT TCTGTGGAGC CACGACGATC AATATCTTGC TCGAATGGGC GACGGTCTCA TTTCCATCTT TGAAACGCCA GGCATGCGAT TGCTCGACAA ACGGAGTCTG GCAGCCGAAG GAATTTGTGA ATTCCAGTGG AGTCCTAAAG CCAACGTTTT GGTCTACTGG GTACGTGTGA TAACATCATC TGCGCTGCTT GAACTAGATT GATGTTCGCT AACGGCTTTT GCTTCTACAC AGGCCCCCGA AGCTGAGAAT TCTCCGGCAC ACGTGGATGT TATCGAAATT CCGAGCCGAA AGAAACTTCG TCAAAAGAAT CTCTTCAATG TTTCACGCTG CAACATGGTC TGGCAGGAAC AGGGCGATTA CTTAGCTGTC AAGGTTACTC GGCATACTAA GAGTAAGAAA ACTCTGTACA ACAATATTGA GCTGTTTCGC TTGAACGAAC CCGGCGTGCC AGTTGAAATG CTCGATACCA AAGACGCCGT CATGGCCTTA TCCTTTGAAC CGCGAGGTTC TCGTTTTGCC ATGATTCATG CCGAGAATCC GAGTGCATCC AAGGTAAACG TTTCGTTTTA CGACATGATG AAACGAGAAA GCGTATTATC CACGAACAAG AAGGGTGGGC AAACAAAGCA ATTTACGTAC GTTCCGGAAC TCAACAAGAT CGAAACTTTG GAGGGGAAGC AATGCAACGT CATCTTTTGG AGCCCGGCTG GTTCGAATAT TATTTTGGCG AGTCTCGGTG ATACTGCATC TGGTACTCTT GAGTTCTACG ACGTTGATAG CAAGTCGCTA GTAGTGAAGG AACATTATCG ATCCAACCAG GTTATTTGGG ATCCCGACGG TCGAAGTGTT GCTACTGTTG TGTCACAGCC AATCGAAGGT GGACATTTCA AATTTGCTAT GGATAACGGG TACATTATCT GGAGTTTTCA AGGGAAGCAG TTGTATCAAC AATCGTTCGA GACATTTTAC CAATTTCTAT GGCGTCCTCG CGAACGTCTT TTGTCCAAGC CAGAGATTCG CAAAGTCAAG AAAAATCTCA AAAAGTACGA AGAACAGTTT GATAAGGCGG ATCGGGAACG ACAGCGTGCT CTGTACTTGG AAGAGACCAA GGGAAAGCGT TCCGAACGCG CAAAGATCCG TGAACTTCTC GCTCGAAACC GGGCTATCCG ACGAAAGCAG CGTGCCGAGT ACATTGCCCT GTTGGGAGGG TATGACTCGG AGGACGACTC TCACTATGTT ATCCGAGATC TTACTATTGA GACTGTGCTT AGTACGAAGG AGGAAGTCGT TACGTAGTAT ATTCTAGCTG TTCAATTATA AAAAGGC
|
Protein sequence | MATEQSREDE LAAEAEELDG YFSDTPLDPQ PDYPQLRESF ESAVIITNLP KVGESKMDKL TKVVMKLVSR IGTLVENPET GFTGVLMPHD KDNDTTMGFC FAEYTTKEEA KNAVEVLQGY KFDKNHSLSV TLYARAEQLK DLDVKEFQEP QPTPFQEKPN AMSWLEDPNQ RDSFVIRHGK ETVVNWFDGR NDPILDYDGA REKEAGVQWC EFYCHWSPCG SYLATLVPDR GVILWSGENY EKSGRFVAPG VKHIVFSPEE NYILTNNDDP TDPGAIKVYH IPTGKLLRTF PLFPDGVATD IPPPPFLWSH DDQYLARMGD GLISIFETPG MRLLDKRSLA AEGICEFQWS PKANVLVYWA PEAENSPAHV DVIEIPSRKK LRQKNLFNVS RCNMVWQEQG DYLAVKVTRH TKSKKTLYNN IELFRLNEPG VPVEMLDTKD AVMALSFEPR GSRFAMIHAE NPSASKVNVS FYDMMKRESQ FTYVPELNKI ETLEGKQCNV IFWSPAGSNI ILASLGDTAS GTLEFYDVDS KSLVVKEHYR SNQVIWDPDG RSVATVVSQP IEGGHFKFAM DNGYIIWSFQ GKQLYQQSFE TFYQFLWRPR ERLLSKPEIR KVKKNLKKYE EQFDKADRER QRALYLEETK GKRSERAKIR ELLARNRAIR RKQRAEYIAL LGGYDSEDDS HYVIRDLTIE TVLSTKEEVV T
|
| |