Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41243 |
Symbol | |
ID | 7199064 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 286714 |
End bp | 288431 |
Gene Length | 1718 bp |
Protein Length | 549 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185167 |
Protein GI | 219130008 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGTC CGAGTCTACG GTCCGATCTT TCCGTAAGTT TGCTTCGCAA ATGCTGCGCC TCCAATCGGC GGATACCGGG CAACCTCATT TCAGTGTTCC AGTCTTCCCT GACGCCTTCG AGACTCACGC GCGGAAGCAG GGCTTTCTCA ATCCGGCCTG CGATTGGATC AGCTCTCGCC ATTTGCCAGT CTTCCGCACC GCGAGTTTCC TTTTCGGCCG GGAAACTTTT TGAGTATGCG AGTCACAAAG GCATCGGACC CAAAAGTGGC TCCTCACGTT CCGACACGGT ACGCCTTTTT TCCTTGGAAA CGCGCAGGAC GGTCTTGATC AACGAGGACG GATTTAAAGT TCACGTTTCT TATCGAGGCG AACAGTCGAG TTTTGACGCT TCTTGGTTGT GGCACAACGA TCCTAGCTAC GTGCACCCAT CGTCGGGTCA ACGACTGCGG TCGTCTCCGT GGACGGGGTT GGGTCGTGCA ATCGAATCGG CATGTATCGT TACAGAGAAA AGGGCACAGT CGGAAGCATC GGAAGGGATT GTCATCCCCA GTGCGGCTCC ACTGGGAAGT CTGCATCCGA TCGGTAACTT ATACCAAGGC CCCATTGTCC AAATGGATGC CTCGTCACGG CTATTGCTAC GAATTCTATG GAAAAAGACC CCGATCGATG ATCCTTCTAC TTTGGCACCC ATATCTTTCT TTGACATGAA TTGGCTTTGG CGGTGCCGAT ACGATGCTAC TGCTTCGGAG CGTTGCCTGC AAGAGACACA AATACGGAAA GAACACGCTT TGAGTCGAAC TTCCACTCTA CGGAACGTAT TATTCGATGG CCTCCTACTG ACCAATCAAC CCGCCGATAT GGATAATGCT CGTTATGACA TTCTTGACGC TGTTGTCCAT GACGGTGCGG TTATTGTCGC TCAAACTCCC GACACCCTGG ACCAACACGA AACAACAGTA GGCTACGTAG GTCACAGCTT GTCGGGAGGG GGTTTGTCCC ACGGGCAACT GTACGGCGAT ATCTTTCACG TTGAGACGGC ACACAATGCA CACAATTTGG CGTACACTTC AGTGGCTCTA CCACCGCACC AAGATCTAGC CTATTATGAT TCCAAGCCAG GCCTACAGCT TTTACATTGC GTTGCCAACA CCGCCGACGT GTTGGGCGGT GAGTCGGTAC TGGTCGACGC TGTTGCAGCG GCACAAGAGC TACGCAACCT TGCACCCGAC CACTTCGAAG CATTGACAAA ATGCCCGGCG ACTTTTTTGA AGCAACGGGA AGGAGCCGAT ATGGTCTACC GACGAACGCA CGTGGAGCAA GATTCCACCG GATCTGTTGT CGCTGTCCAT TGGAGTCCAC CATTTCAAGG ACCGCTGTGC ATACGTCCCG ACTTGGTCGA CAACTACTTT GTCGCGTATG CGGTCTTGGA ACGTATGTTA GATAATTCAC TGCCGCGCGA TCGATTCATC CTACCCATCG CACCGGAACT GGAACAGTCG TTAATCGACT ACGCACATGA GTATACGTGG CAACACCGAC TGGAAGAGGG CCATCTCTTG ATATTCAACA ATCAGCGTAT GCTGCATGGG AGGCGAGGAT TCCAATTGTT GAGTGCCACG GCGGCGCGTC GTTTGGTAGG ATGCTACACC GATATGGACG ATACAATGAA TCAATATCGT CTACTACGAC GACAGAGAAT GCTTGTGGGA GAGGATGA
|
Protein sequence | MVRPSLRSDL SVSLLRKCCA SNRRIPGNLI SVFQSSLTPS RLTRGSRAFS IRPAIGSALA ICQSSAPRVS FSAGKLFEYA SHKGIGPKSG SSRSDTVRLF SLETRRTVLI NEDGFKVHVS YRGEQSSFDA SWLWHNDPSY VHPSSGQRLR SSPWTGLGRA IESACIVTEK RAQSEASEGI VIPSAAPLGS LHPIGNLYQG PIVQMDASSR LLLRILWKKT PIDDPSTLAP ISFFDMNWLW RCRYDATASE RCLQETQIRK EHALSRTSTL RNVLFDGLLL TNQPADMDNA RYDILDAVVH DGAVIVAQTP DTLDQHETTV GYVGHSLSGG GLSHGQLYGD IFHVETAHNA HNLAYTSVAL PPHQDLAYYD SKPGLQLLHC VANTADVLGG ESVLVDAVAA AQELRNLAPD HFEALTKCPA TFLKQREGAD MVYRRTHVEQ DSTGSVVAVH WSPPFQGPLC IRPDLVDNYF VAYAVLERML DNSLPRDRFI LPIAPELEQS LIDYAHEYTW QHRLEEGHLL IFNNQRMLHG RRGFQLLSAT AAQNACGRG
|
| |