Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50112 |
Symbol | |
ID | 7198825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 69136 |
End bp | 70490 |
Gene Length | 1355 bp |
Protein Length | 414 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184962 |
Protein GI | 219129579 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGTGGTTCG AGTCGGAGGC ACTGCTACGC TCGACCAGCC GCACTTCTTT CGCATCGAAC ACCGGTAGTA AGCGGTCTTT GGGAAAGATC GTAAAGCGGT GGGACGCAGC ATGTCGCACT CGGAAGCAGC GAACGACCTT GACGGCAGAA TCCGTGCGGA AAGCAAAGGC ACAACGCTGT CGTTGAAACG AAAAGCTCCG CCGGAGCGTC GCGTCGGTTC TACAGCTCCA GACAAACATG ATCGACGTAT CTATGCAACA GCGGCTTCAC CGGGTCTGCT AGCTGTCTTC TCCGCCGACG ACTCGATCCA CAATGATTAC CGTAGCAGGT CGTACTCCGA GCCGATCAAA GCAAAGTTGG AAGTAGACCA GAGCCTACTC TTCTCACCGT CTTCTGCCGC GAACTATGAC GAGCTCCGAG AACGCAAGCG TTCCGCAACA AACTCTCGCA AAGCTGCTGC TACGGCTCGC CAAACGCTAG ACAAAGGTAC CCAAGGTAGG GAAAACCTGT ACATGCGTCG GGACCCACCT TCTGTTGGAA CCGAGTCGGT ATCCGTACAC GGTAATTCTT CGACGGAGGA GGACGATGTC GACCGCCAGC ACATCCTCAA GCCAAAGCCA CGTTCGCCTG CGGACAAAGC TCGCTCGCAA CGACGTCGAT TGCGGAAAAC AGCTGGTGCA GAAGTGCGCC GACGAAAGAT TCGAACGCTG CCATCCAATC CACTAGAATT TGAGGGAGAC TACAAGACCC AATTTGCCCC GGAAGACGGA ACCGGCAAGC CTCTCACCGC CACTAGTCTT TTGTACGTCA TGAATGTCAT GGAAAAAGAG TCCTCGGAAG CAGTCAAAGC TGTCACCAAG GGATTCTCGC GGGACGAAAG TCGTCGGATT CGTCAGTCCA CAACTAAACA TTCACAAGCT ATTCTGAAGA GTACGCGACT ATTTTTACGA AGCGCCAAAG TCTCGACCGT CTACACAGGA GCTGAAGACC CTATGGGGAG CGGGCTTGCA CAGGCCAATA AAACACGCAA AGCGGACATT GAACGGTCGA ATCGAATTTT GCAACAGTTG GAATCGCAAC AAGATGTTTT AGAAGCGGAG TTGGCCGAAT ATGTTGAGAA GCAGTCGCAG CTGGAAGAGA CACTGCGGGC GTTGCAAGCT TCTCAGGAAC GTGCTCATCC TTTGTTGATA CCTGCGCTGG CCCCCGTCGC GGAAGAGAAC GCGTGGAATG AAGGCTTCCC AGCGCGACGA TTTCGTCCGG CGCCCCGAGG TTGCATGCCC CAGCTTCTGG AAAAGCTGGG TGACGAAGAA TGGAGAAGCA GAGGCAGTGC TTTGAATACA AACAATCCAC GGTAG
|
Protein sequence | MSHSEAANDL DGRIRAESKG TTLSLKRKAP PERRVGSTAP DKHDRRIYAT AASPGLLAVF SADDSIHNDY RSRSYSEPIK AKLEVDQSLL FSPSSAANYD ELRERKRSAT NSRKAAATAR QTLDKGTQGR ENLYMRRDPP SVGTESVSVH GNSSTEEDDV DRQHILKPKP RSPADKARSQ RRRLRKTAGA EVRRRKIRTL PSNPLEFEGD YKTQFAPEDG TGKPLTATSL LYVMNVMEKE SSEAVKAVTK GFSRDESRRI RQSTTKHSQA ILKSTRLFLR SAKVSTVYTG AEDPMGSGLA QANKTRKADI ERSNRILQQL ESQQDVLEAE LAEYVEKQSQ LEETLRALQA SQERAHPLLI PALAPVAEEN AWNEGFPARR FRPAPRGCMP QLLEKLGDEE WRSRGSALNT NNPR
|
| |