Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40581 |
Symbol | |
ID | 7198462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 337223 |
End bp | 338776 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184527 |
Protein GI | 219128663 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.365172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCGTG CCCGGTGGAT CGTCCTTTGT TGGGCCCAAC CGAACGTACG GGGGACTCCC TCACCGTCCC GTTGGGGGGT CCGACGCTTC CCCCAAGCCG CAACCCTTCG CCATGGTACC AAAACGACGG ATGCCGGACG AGAATTCCCG GTACGGCACG GGGAACGTCG TGGTACCGAC GATACAATCA CGGACAGTGA CAACGCTACC AGCAGTCTCT TGCCGTTCGC TAACGACGAA TGCGACGCCG TTCCAAACGT CGCCAACAAT CCCGCAGGGC CGGACGGTAC TCCTCCATCG GTCACCGCCG CGGTCACCAC GGATGCTTCC GACGAGACCT TCCGGTGCCG TCCTACTAGA ACACCCGTTA TTGTCACCAC AAACAATATC ACGCAGCCTT TGCCCACCGC ATCTACCGTT GGTCCGGCAC CGGTCCATCA CGGACACGGG GACCCGGCCC TGCATCAGGT CGAACACCGC GCCTTGGAAA AGACGAGCGT TGCTCTGGCC GAAAAGTTTC TGGAAAAGAC GGTCGAACGC ATTGCACCGG GACAAAAGTT GGCCGAAACC ATCGTCCACT CGGCGGCCGG GAGTTTGCTA CAACGACGTG CTGGCGAACG AGTGGCGGAG CGCACGGGTG AACGGTTAGC GGAACGAGCA GGTGAACGGT TAACGGAGCG CACGGGTGAG CGGTTAGCGG AACGCACGGT TGAGAGGCTA GCGGAACGCA CCGGTGAGCG GTTAGCGGAA CGCACGGGTG AGAGGCTAGC GGAACGAGCA GGTGAACGGA TAGCGGAACG CACCAGTGAA CGATTAGCGG AACGCACGGG TGAGAGAATC GCAGCACGCG CTAGCGAACG TTTGGTAGAA AAGACTGGAG CACGCGCGGC ATCGCGTCTG CGAAAGGGGA TTGGTGAGCG CCTTTCGGAA TACGCCGCCA AAATTCCGAC CCGTTGGAAT CGCATCTGGG AATCTGCCCT GGGCAGGGGA GTGGAACGTA CAACCGAGCG AGGATTGGAG CGATCCGCCG AACGATTCGG AGAACGGGGA CTGGAACGTG CGGTCCAGCG CAGCGGGGAA CGGGCTGCCG AGCGGACGCT GGAGCGAGCT GGTGAACGCG CCGCGGAACA CACGCTGACC ACAGTAGGCC GAGGAGCAAC TTCGGCAGTG GAGCGCATCG CGGGAGTCTC TTCGGAGCGG GTTGCCGTGC GGGCCGGGAG AGGCTTACTC ATCACTCTCC CTGCATTGGG CGGATTCTTT GCATTGTTGC TTCTCAAATC AGACATTCAA CGAATGCGTG AAGAATGGAC GAATAAATCC AAATCGTCCA CCTTATGTTT TGTTGGCGCA GGGCTTGCTG ACCTTTTCGA CTCGTTCTTG CACTTCTACA TCGCCTATGC TCTTTTTGCA CATATTGGGC ATCAAGCCTT GGTTGTTCCC GAACAGCTCA GTATGGGCTG CGCTATAACC TCTACCATTT GCGCCGTTGC AGGTGAGGTA ATAAGTCTAC ATCTGCAGCG ACAAAAGAAA AAGAGAGCTT TGACCAAATC ATGA
|
Protein sequence | MHRARWIVLC WAQPNVRGTP SPSRWGVRRF PQAATLRHGT KTTDAGREFP VRHGERRGTD DTITDSDNAT SSLLPFANDE CDAVPNVANN PAGPDGTPPS VTAAVTTDAS DETFRCRPTR TPVIVTTNNI TQPLPTASTV GPAPVHHGHG DPALHQVEHR ALEKTSVALA EKFLEKTVER IAPGQKLAET IVHSAAGSLL QRRAGERVAE RTGERLAERA GERLTERTGE RLAERTVERL AERTGERLAE RTGERLAERA GERIAERTSE RLAERTGERI AARASERLVE KTGARAASRL RKGIGERLSE YAAKIPTRWN RIWESALGRG VERTTERGLE RSAERFGERG LERAVQRSGE RAAERTLERA GERAAEHTLT TVGRGATSAV ERIAGVSSER VAVRAGRGLL ITLPALGGFF ALLLLKSDIQ RMREEWTNKS KSSTLCFVGA GLADLFDSFL HFYIAYALFA HIGHQALVVP EQLSMGCAIT STICAVAGEV ISLHLQRQKK KRALTKS
|
| |