Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39311 |
Symbol | |
ID | 7195040 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 188356 |
End bp | 189843 |
Gene Length | 1488 bp |
Protein Length | 414 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183309 |
Protein GI | 219126114 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCAC GACTGTCGCT AGGCGGCAAG GCTACGGCAT TGCGCGATGT TTCGTCACAT CAAAATTCCC GATCGTCTTC TTTCTCCCGC CGCATGTCCT TCAGCGGTGA CGCACTCAAC GCGTTACGGG CTGAAACTCC CAAAAAAGAT CGCCATGCCA TGCTGGACGA ATGGCGTCGT CAGTCTCGAA CCCGTGACTT CTCCAGTAGC GTGGGTATGC CCACGGATCA TGATTCGTCT ACGATCACAG ATGGAGCAGG ACCGACAGTC CCCACCGGCA AACGGTTGCG TACACGTCAT CCCGACGGAG TGCCCCCTCT ACCACCGTCA AACATCTCCG CAGCCGACTC AGGACTTTCG GCTCTGGAGC GCATTCGTCA ACGTAAGGCT GAACGGGAAA AACAGCAGAG TATCGCATTC GATGCAATGA ACGCCCATAA TTCGTCCATC TGCTACGATG ACAGCAACGA TGCACCACGA ATCGCAAGTA GCGTTGGCCG CACTTTATTG CCCGGAACAC CCAGCGGTGG AAACCTTTTC CGTGGTGGGG CAAGTCGTCG GCGCAGTATG TCGGTTCAGC CTATACATCG ACGGACTTCG ATACTCTCCA CCGACTCTGA ATTTTCATTC TCACAAGGAA GTACTGGCAG TAATGGTTTC AGCCAACGAT CGACAGGGAT GAGTCAAGAC ACGATAATGT CGGACCCGAT GCCTTCTTCT TCGTCCAAGG GGATGGATAC GGTGGAGGAC GCTGTTGTAC AGCAGCTGCA GCAGCGGGTC AGAGACTTGG AACGAATCAA AATGGATCTT AGCATGGAAA TTGCTCCACT CAAGGCCCGA CTACGTCAGA AAGAGGACAT TTACTTAAAA GAGCAGAAAA AACTATTGCA AGAGATTGAA GACTTACAGG AAGCGAACCA AGGAGCGAAT GAGCGCAACC GAGACCTTCA AATGCAATTT GAAAATCTGA AGGAAGAATG CAAAAAACTC CAAACCGAAG TCCGCAAAGC ATCGAGTACA CTGCATAACA ATAATCATAT TGATGGTGCT TCAAGTGGTT GGAATCGCCA GTTACAGAAT GATCGGGATG TAGCAGAACT CAAGGAGAGA TTGAGGAGTG CCGAAGATGA GATCGATTCT ATACGACTAA CAAAAGTTTC GGTCGAAAAA GAGTTACACG GCACGAAGAT TGAGCTTGAT TCACTCTATC GTAGTTTTGA TGAACTTCAG ACCGAGTACG AGTTAGTATC TCAGTCGACA TCCGATAATC GAGAAGCTGA AATGAAGTTA GAGCATTTGA CGACGGAGCA CATTGCGACT TCTGCACAGC TGAACGCAGT CTGCGCAGAC CTAGCAGCTA CGAAAGCTCG CGCCGCTGCA ACAATCACAG CCAAGGAGGA AGAGCACAAA AACGAAGTTG AACAGTTGCA TTTTGAAATG AGTGTATTAA AAACACGTGC TGGAAACACG GGCGGCGCTG GCACCGAG
|
Protein sequence | MSSRLSLGGK ATALRDVSSH QNSRSSSFSR RMSFSGDALN ALRAETPKKD RHAMLDEWRR QSRTRDFSSS VGMPTDHDSS TITDGAGPTV PTGKRLRTRH PDGVPPLPPS NISAADSGLS ALERIRQPTM HHESQVALAA LYCPEHPAVE TFSVVGQVVG AGMDTVEDAV VQQLQQRVRD LERIKMDLSM EIAPLKARLR QKEDIYLKEQ KKLLQEIEDL QEANQGANER NRDLQMQFEN LKEECKKLQT EVRKASSTLH NNNHIDGASS GWNRQLQNDR DVAELKERLR SAEDEIDSIR LTKVSVEKEL HGTKIELDSL YRSFDELQTE YELVSQSTSD NREAEMKLEH LTTEHIATSA QLNAVCADLA ATKARAAATI TAKEEEHKNE VEQLHFEMSV LKTRAGNTGG AGTE
|
| |