Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35370 |
Symbol | |
ID | 7200665 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 756208 |
End bp | 757818 |
Gene Length | 1611 bp |
Protein Length | 487 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179907 |
Protein GI | 219118257 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCTGA TTGCGATTTC GCTTTGTTGT TTGATGCCGT GCACGGTGCG ATGCCGATCT TGGCGCAATA TCGAGCCTCT TCACGGCTGG AATGAGAATG ACACTTCCGG TACTATTTGG AGAATGGAGT TCAATCCGTT ATTCACCTCG GCACCCACCT CGATGCCAAC TACAGCGACG CCATCCGACA TTCCCTCATC GAGGCCGTCA TCCTTTCCTT CGGCTCCGCC ATCGGCCTCA CCGTCCGTAG CGCCATCCCC TTCACCTTCG ACCGCACCAA GCGAATCTGA TCCATACAGA CCAAATGATC CACCTAAAAA TCCAGAGCAG TGGTATTTCA ACTACGATAC CTCTGCAAAT GCTTTGTATG GGCCTGGGCA TGCTGGCATT ATTCAACAGC AGAACAACCA GTTCAATGTC GGATACAAAA ACAATCGGTG GGGCTCGGTA GGAAACCCTC CTAACAATTA CTGGACTGAG TTTATGGACA ATGGATTCGG TCCATGGAGA GGGATTTTGG CAAACCGAAA TCCTACTCGA AACATGTGTG ATCGAGTCGG GATGCAGAGT CCGATTGACC TTCGGCCGAG TGGGGCAGTT TGTGATGAAC ACCATGAGGT TCGATCTCGT GTAAGGAATA TATGCTGTTC GTCTGAAACT CAGTGAATGT AGCTTCCTTT TAATCAATTG TCTTTATCAT TTTTACAGAG AGGAGATTTT CAAATTTTCG AAGACGAGGT AACCAAAGAA ATTCAGCCGA ACAAACTACG ATTGCGATAC AAACGACGTC CATGTCGCAA TCTGAATGAA CTGGCATGCC AAGAGGTAAG CATGTCTACT GTGCAATTTT TGATTCCAAA AACTTGAACT GACAAGTTAG ATTGCATCCT CTAGCCGGAT CCACCGAATG CGGACTTTCC TAACAATTGG GGTGGCTATG CAGACGTGAC CCATATTGAC TTCAAAGTTC CAGGAGAGCA TCTGATTCGA GGCGAAAAAT TTGATGGGGA AATGCAGATT TTTCACATTC ATCGAGGAAG GCGACGTATG GTGGTACAAA GCGTTACAAT TCGAGCCACA AGTACAGGCT TCAACAGCTA CTTCCAGGAG GCGATTGATG TCTTTCGAGC GGTATATGAC ATCAATATAG CTCGATGTTC GGCCCTTCGA AGAAAGGAAC GTCGTCTTGT CTCGAATGCC CATATTATAT TGGGAAAGAA CATGACTAGC AAATTCCATG ATTATTCATC TTGGGGTGAT TTCTCGACGG GACTTGAGGA TGTTGAATTG GAAAGCAAGC GCTCGCTTCG AAAATCAAAT TGGGATCCTT ACCACGAGCT GCTCATTCCT TCCATACATT TTTATCGGTA CGATGGATCT TTGACAGAGC CACCGTGTGG CGAATTTGTC TCTTGGTTTG TTTCTGACAC TCCCATGAGA ATCAGTTTGA GCCAGTTGGA AGAGGTCAAA ACAATATTAT TCAAGAATGT TGACGAAAAT TGCCAACCTA CCAGTGTACA ATTTGGCCAT AGCGTGGCGC GCCCGATTCA GGAAACAGCG GGTCGTCCAG TCTGGCAATG TACTCCCCGA GAGTTTGGCC CCGACCCGTA A
|
Protein sequence | MRLIAISLCC LMPCTVRCRS WRNIEPLHGW NENDTSGTIW RMEFNPLFTS APTSMPTTAT PSDIPSSRPS SFPSAPPSAS PSVAPSPSPS TAPSESDPYR PNDPPKNPEQ WYFNYDTSAN ALYGPGHAGI IQQQNNQFNV GYKNNRWGSV GNPPNNYWTE FMDNGFGPWR GILANRNPTR NMCDRVGMQS PIDLRPSGAV CDEHHEVRSR RGDFQIFEDE VTKEIQPNKL RLRYKRRPCR NLNELACQEP DPPNADFPNN WGGYADVTHI DFKVPGEHLI RGEKFDGEMQ IFHIHRGRRR MVVQSVTIRA TSTGFNSYFQ EAIDVFRAVY DINIARCSAL RRKERRLVSN AHIILGKNMT SKFHDYSSWG DFSTGLEDVE LESKRSLRKS NWDPYHELLI PSIHFYRYDG SLTEPPCGEF VSWFVSDTPM RISLSQLEEV KTILFKNVDE NCQPTSVQFG HSVARPIQET AGRPVWQCTP REFGPDP
|
| |