Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35250 |
Symbol | |
ID | 7200604 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 454891 |
End bp | 456691 |
Gene Length | 1801 bp |
Protein Length | 527 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179848 |
Protein GI | 219118134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTCGA CCAAGGCGGC CAAAGCGGCT GGGTCCTTGA GTAGCAAGAT TCCGTATGAA CTTCTCGCGC CCCAAGCTTT GATTGGACTG GTAGACGCTT TGTCATATAT GGTGACGGCC CCATCCCTTG TCTTCTACGT TCTTCAATCC GGTGGAACGT ACGGTCAGTA TGGAATGATT TTATCTATTT ACAGCTTTGC TTCTTTTGCT TTCAAGCCTG TCCTTGGCTA TTGGTCCGAC AAGTCCGGTG GAAAATTTCG AATGCCATAT CTGTCATCCA TTTTCGTCGC AGCTTTCGGA GGTTGGTTGT ACTTCTTGGC CAGTGCCTTT TCGGGAAATG CCGCGATTCT CGTCATCTTT TTTGGACGTC TGCTTGGAGG CATAGGAGCT GGTACGTTTT CAGGTGCCCC GATTTGGAAT TTGTAATCGC TGGATGAACT GTACCGTTCT CACAAAAAAT GAACCTCGTT TCTACCAGCC AACAATACGC TTGGCTTTAC CTATATAGCT CAGGTTATTC CAAAAGAAAA TTTGACCCAA GCTAGCGCCG TTTTGAGTAT GACACGGATT GTTGGAATGG TAATCGCCCC CGGTTTGAAT GTCTTCTTAG CTTGGATTGA CAAGGACATC TATTTGGGAT CATTCTCGCT AAAGTTGGAT CCTTTGAACA GCGTTGGTTT GTGCATCATG TTTGGCAACG TAGCCGCTTT TTTCATCGTT TATTTTTTGT TGGAAGAACC ACAAGAATCC ACTAGACCTC GACGAGCAAG TATTGACAGC GTTGACGGAC GCAGTTGGAA ATTTTGGAAG AATATCTTTA CTATGGACAT CATGGTCCCC ATGCTTTCCA TTTTCTGCAT GAACGCCAAT TTCCAGCTTC TAGAGACGAG CATGGCACCG GCTGCTAGTG ACGCGCTTGG TTGGGGACCG ATTGGAGTCT CCTCCATTTT TGGGACGAAC GCATTCTTCA TATTCTTTGC CGTTATTGTT ACTTTAAAGC TGTCAGAATG GGGCGTGTCG GATGAGGGCT TGCTCAAAAT TGGTCTTTGT TTCTCCATTG CAGGATATTC ATTGGTGTAC ATTCTATGGG AAAATCCTAC CAATGTCATT CAGTTTTTCG TACCGATTGC TCTTTCGACT TGTGCTTTTC CGTTCATGGG TGCCCCCACC CGCAGCCTCT TTACTCGATT TGTCGACAAG AATCATTATC TTCGGAATCA CCAAGGTACC CTGCTATCAC TACCGTTTTG CGAACATATG TCCTGACTGT GCACTTACAC TTGGTCTCCT ATCTTTTGAC CGCCATTGTC AACCGTTGGT ATCAGGATCC ATGCAAGCAC TCTTGTCGAT GGTCGCGTCT GCGGCTGGAT TTTTGGCACC TGGATTTATT TCAGCATATG TTCTACGCAA ACCGGAGGAA GTAGCGGCCA GTTCCAATCA CCGCGAGCTC GCTCCGCTCG CTCTGTTCGC ACCGATACTG AGCGCCTTGA CTTTGATTGG CTTTCTTTGC TTGGAACGTA AAACACGTCT AGGAAAACCC CTGGACGAAT CTTCAAAAGT AGACGAAGCT ACATCATTAG TGGAACAGGG GGAAGTTTTG AAGGAGGGCG ATGAATGGAT ACGAGAGTTC CATCCCCGAG TCGAAGCTTA TCGTCGAAAC TCGGTGACCC TCATGGGCAT TCCGCAGATC TGTCTCGATC AAACGCCCAA CGTCGCGAGC CGTCGGCATT CCACCGCGGT CTCCTTGGGA ACTCATCCGG CTGCCTATTC CGAGGCCCGA CGGGCAACGA CGCTTTTCTA A
|
Protein sequence | MVSTKAAKAA GSLSSKIPYE LLAPQALIGL VDALSYMVTA PSLVFYVLQS GGTYGQYGMI LSIYSFASFA FKPVLGYWSD KSGGKFRMPY LSSIFVAAFG GWLYFLASAF SGNAAILVIF FGRLLGGIGA AQVIPKENLT QASAVLSMTR IVGMVIAPGL NVFLAWIDKD IYLGSFSLKL DPLNSVGLCI MFGNVAAFFI VYFLLEEPQE STRPRRASID SVDGRSWKFW KNIFTMDIMV PMLSIFCMNA NFQLLETSMA PAASDALGWG PIGVSSIFGT NAFFIFFAVI VTLKLSEWGV SDEGLLKIGL CFSIAGYSLV YILWENPTNV IQFFVPIALS TCAFPFMGAP TRSLFTRFVD KNHYLRNHQG SMQALLSMVA SAAGFLAPGF ISAYVLRKPE EVAASSNHRE LAPLALFAPI LSALTLIGFL CLERKTRLGK PLDESSKVDE ATSLVEQGEV LKEGDEWIRE FHPRVEAYRR NSVTLMGIPQ ICLDQTPNVA SRRHSTAVSL GTHPAAYSEA RRATTLF
|
| |