Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46043 |
Symbol | |
ID | 7201527 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 71778 |
End bp | 73347 |
Gene Length | 1570 bp |
Protein Length | 334 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180563 |
Protein GI | 219119614 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAGGCGGAC GACAGAACCG CAGCTCTCCT CCGGCCATAA TCGGTAGAAA CGATCGAAAG AAGCTGGAAG ATGGGAGAGG ATCAGACGAG CGTTTTGTTG ACGCACCGGC TTCGAACAAG CGACAACGAC GATCGGCAAT CTCGTTTTAA CGGAACTGCT GTGATTCTCA TCGTGTTTAT CATAATATCC TGCGCCATTT GGTGAGACTC ATTACGTATA ATTCGTATAA AACGATCGAT CGATTCAGGG TCTCCGCAGT GAATAGGGAG GAGTGTGGGA ATTAAACTTT TGCGTTGCTA ATGTTGATTC GTGTTTTTTT AGGTTGCCGG CGTGTATCAG GAAATGCCGT CGGTCTGCTT TGCACAGAAA CCAATTTCAA GGACAAGGCG CCACCCAGAC AACCCGTAAC GTCCCAAATA GTAATGTACA GCGTATGGGC GGAGTCATGG CGGAGCAGAT TATCGAATCA CTGCGGCGTC TCATGGAGCA AAGCAGAGAA AATGAAGGTG TCGAAGGAGC TGCGCTGGCC GATCGCAAAG AGTACATCGA GAATGTCTTA ATGACCAAGG TATGTACGGC CAGCCTAGAA TTTCGCTGTT GCGCATGAGC CAAACCACGT TTTCACTGTA TGGTCGCTTA CTGCAGAATG TCGAGCTCGG AACGAATATA CGAAAGTTAG AAGAGTCCTT TCATAGGAGC ACAAACGACG GAGCTGAACC ATCATTAAAT GTGGTCGGAT CAGAGGGAAG CGCAGAACTT CAACCATGGC AGAGCGACAA CGATAACGGA CATGAATGCT GTGCCATCTG TTTGAGCGAT TACCAGGATG GTGATGTGAT CGGTTGGTCA CACAATAAAA ACTGCAAACA CATATTTCAC CGAGAATGCA TAAGCGAATG GCTACTGACA CATGAAGAGT GTCCCTGTTG CCGACACTAT TACTTGTTTT TCATGGAGGA TGGTGACCTG AACGGTCAAT CTCAATCTCC TTTACCACCA CCCCTACCTG CGCCTCTCTC TCGTGAGGAA GAGCAAAGTT GGACGCTGGA AAGAGGCCTA CGGATATTCT ATAATTTGAC AGGGAGTCCA TCACAAAACT ACTCAGACAC AGGAAATACA GATACACGAG GGAGCGATGT TGAGCTTGCA CGTCCAGCAA TAAGAAATGC GAATGAAGAC TCTTTTTCAA TCGGACCTGA TGTTTCCTCA ACCGACAATC AAAGTGGTGT GATCGTTGTC GACCGCCCGA TCCCAGGTAG AAGATCACCT TAGGCACACC CGGAAATGTA CAGTCAGAGA CGCACTGGCA AACGCCGAAG CTGATGAAGA AGTCAAATTG GTCGTTGAAA GGTACTACGG GACGTAGTAT TTGCAGCTGA GAGATGAACC AAGTAGGTAG TCGAATGCGA AGCTTCTCCA AAACTTTTTT TATTATACAA CTGCGCAGCT CTCAAGAGCA ATGCGGCATT TTTGGGAAAC ATCTGCAGCT ATCTTACGTT GGGCACTTGT CTGATCCTGC TTTTAAATAT CGAGTGACTG TAATATAGTG CTATAGAGCA ATCTTTCTGC
|
Protein sequence | MGEDQTSVLL THRLRTSDND DRQSRFNGTA VILIVFIIIS CAIWLPACIR KCRRSALHRN QFQGQGATQT TRNVPNSNVQ RMGGVMAEQI IESLRRLMEQ SRENEGVEGA ALADRKEYIE NVLMTKNVEL GTNIRKLEES FHRSTNDGAE PSLNVVGSEG SAELQPWQSD NDNGHECCAI CLSDYQDGDV IGWSHNKNCK HIFHRECISE WLLTHEECPC CRHYYLFFME DGDLNGQSQS PLPPPLPAPL SREEEQSWTL ERGLRIFYNL TGSPSQNYSD TGNTDTRGSD VELARPAIRN ANEDSFSIGP DVSSTDNQSG VIVVDRPIPG RRSP
|
| |