Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47423 |
Symbol | |
ID | 7202456 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 591498 |
End bp | 593202 |
Gene Length | 1705 bp |
Protein Length | 542 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181760 |
Protein GI | 219122870 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATCTGAAG AGACGACATT GTCCCATGAA AGCGTATTTA GGGTCTTCCG CGATTTGGAT TCTTCTGCCC CCAACGCTTT GTGTTTCTGC CTTGGTACTG ACTAACAGCA AGAATGTGAA TCCGATCACC CTTCAGAAAA GATCTTGGAT TACCCGACTC GAATCGAAGA GAGTGGCGCG CGACGATATA GCTCCATCAA AGTTTGCCAA GGTTGACCTG GAGTCTCTAG CATCGCTGAA TACTTGCGAG AGCGGCACAA AAGCACGGCG CCTACTTCGG AAGATTTTAC TGGAAGATAG AGACTCCCAA AGCCAAGTTC TCTATGGATC CGTTAAAATA CCTCCCGGGG CTTCGTCTTG GGGCATCTCC GACGGAGATT TAGCCATTCA GACTCGTCTA TTGAATTCCA AATATTCAAT TATGGATGTT ATCGAGCTTT CGGGAAATCG TGATGCTGAT CGTGCCTCCT TGTCACTTTT GTGTCTAATG GTAGCGAGCA CGATCAGTGC CATTATCGCG AACCAACATC TACCAGGGCC CGAGATTTTA CGATTTATGG TGGTATGGTT GTTCTGCTTT TCCCCTTTTA TTTTCGTGGG TTACGGAATA GCGACACCTG AGAAATTGCA AGCTTTCCTT GTATCCGTTC AGCGCGAACT TTTTCCGGCA TATCGACAGC GAATGCTGCA ACATGAGGCT GGACATTTTT TGATGGGACA TCTACTGGGT TTACCAGTAG CAGGTTACCA AGCCAACGCA GTCAAAAATG CCGTCAATTT TTATCCTCTA GCCGACACCG ATCGAAGTTT CGACTTGGCA AGTCAGTTAG GGTTTGACAA ATCCCTCTCG GCACGGAGAG TTGAACAAAA TTCAGAAGCA TTATCCGCGG ATCAGCTGTC AGCATTTGAT GCCCCATTCT TTTCGGAAAC AGGTCGAGGT GCATGGCTTG TCGAAGAAAG GTCTGTTTTC CGAAATGCTC AGAACTACAC AAACAATCCT TTCCTCAAAC TCGCGTCACG GAACGAGCCG TCCAATTCCT GGCCTTTTCG AGGGTTTGAT GAAGCAACAT TAGACCAGCT ATCAGCAATC TCTGTGGCTG GTGTGTGCTC TGAGATTTTG GCGTTCGGTA TGTATACAAA ATGTTCTTTT TGTCTGAGAG TATTGAATTC TGATACAGTG TGTATACGCC TTTTCCTTTG GGCAATAGGA AACGCCGAGG GCGGAGTAGC TGATCTGAAT CAACTACGAC AGATTTACCG TTCGGCCGAG CCGAGTATCA GCGAACGAGA TGAAGAAAAC AGGATTCGGT TTGCGCTGGG GTACACAATG TCGCAGCTAA GACGACATCT AGGGGCCCTC GATGCTTTGT CGAAAATCAT GGAACGTGAC GGCACTATTG CTGAGTGTGT TGCGGCGATC GAAAATTGCG AAAACATGAG TGGAGTCGTC TCTATTGCTG GCGAAAATTA CGAAGTCCAG AGACGGAAGG ATTTCTTGTC AGACCAGAGG AGTCCTTTAG AGCGCTTGTT CCTTGGTGGA GGAAGAACCA TTGACGAATT GGAAGATAGA CTGGTAGAAG GAAAAGGTGG TGGATATCGA AAAGAAACCT TTCGACTGAC AGGCGATGAT CCTGTTTATG CTGCTATTGC AGTAGCCCTG CTCTTTGCAC TGTGGGCGAG CGCTGGCGGC CTATCACTTC ACTAA
|
Protein sequence | MKAYLGSSAI WILLPPTLCV SALVLTNSKN VNPITLQKRS WITRLESKRV ARDDIAPSKF AKVDLESLAS LNTCESGTKA RRLLRKILLE DRDSQSQVLY GSVKIPPGAS SWGISDGDLA IQTRLLNSKY SIMDVIELSG NRDADRASLS LLCLMVASTI SAIIANQHLP GPEILRFMVV WLFCFSPFIF VGYGIATPEK LQAFLVSVQR ELFPAYRQRM LQHEAGHFLM GHLLGLPVAG YQANAVKNAV NFYPLADTDR SFDLASQLGF DKSLSARRVE QNSEALSADQ LSAFDAPFFS ETGRGAWLVE ERSVFRNAQN YTNNPFLKLA SRNEPSNSWP FRGFDEATLD QLSAISVAGV CSEILAFVCI RLFLWAIGNA EGGVADLNQL RQIYRSAEPS ISERDEENRI RFALGYTMSQ LRRHLGALDA LSKIMERDGT IAECVAAIEN CENMSGVVSI AGENYEVQRR KDFLSDQRSP LERLFLGGGR TIDELEDRLV EGKGGGYRKE TFRLTGDDPV YAAIAVALLF ALWASAGGLS LH
|
| |