Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48044 |
Symbol | |
ID | 7203031 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 809463 |
End bp | 811515 |
Gene Length | 2053 bp |
Protein Length | 577 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182460 |
Protein GI | 219124331 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGAAA CCCAGCTGAT TAAAAGATCC AACAGACGAT CTTTTTTCAA AGTTGGTGGG CTGCATTTTG GCATGCACGG GTTGCTAGGT TTTTCTTCAT TATCTCTGAC CATCCTCGCA TACTACAGCT ATCCAAGCGA ATACCCAATA TGGATTGGAT TGTCGCAAGT TGCAAATCTA GTAACGGTTA CCCATGCCCG AAATCTTCTC TCTCAAGTGC CGGCATCCAC ACAAATTTTT CCTGGTATAG TTGCCCCGCA CAAGGAAGCA TTCCAACGGA CAATCAGCGG AATGCAATAC CTCGTCACAC GTGTCACATG TCTGGTCTTT CGGGACCATT CCATGGATAT TGGATTCCGT AGTACCTTGG CACTCTTACT GTGGCGTGCT TGGCCTTTAA TTCCTTCGTA CCAAGCTGAG TGGCTCAATG GCAATACGTG GATTTTTGTT ATTCCAATGG CCCTCGGTGT CGCAGGAGAT TTGATCCAGT TCTGGAACGG TGACGTCTTT TCGTCGCGGC AAATTCTGTC AATTCAATTG CATGGGTTGC TTATGGCCTT TGGCTTCACA CTGGGTTTTC GAAACTATTT ACCCATGCCA CTTGGTAAGG TCGTTTGTTT TGTATGATCG AAAAGCATCA ACATCCTGTT ATGTTTAAAA GATAAGACCA ATCGCGTTCT GACTTTTTGT TGCGATTGTT GTTTTATACA GTTTATATGG GAGCTGCGTT CGGTGTGTGG AAGATCCTGC GTGAGGGAAT AATGACCTTT GAAAACGCGT CGCGCGAACG GCTCGCATCC CGGATGGAAC TGTATGCACT CCCGGAGTAA ACCGGCCAAC TTTTTTCGGG GGAGGTGCGA AACTGGTCCA TCCAACCGTA AACATTCGAA AAAACGTTGA TTTGGGATCG GTAGAAGACA CGACTCCTCT GAACGCCCCC ACACGTTTGG CAACAAACGT CCGTTTAATG AAGTCGTGGG CCCCGCACAC TGTCAGCAAT TTTGCAACAA GCCTGGTGTC CCGTGCCCCC CTTCCAGACA CTTTTTCCGA CTCAGTACAA ACGCAAAGCC ATTGCTTCAG AATATCAATC AGTGGTCACA GCGCCATGAG ATCCCAGATT TTGTTCATAT GCTCCATTGT GCTTATCAAT GCTGATGCTC TTTGTTTTTG CCAGCGTGAC CATTTTTTCA AACATGGAAA CGTCCTTTGC AGGAAAGAAA AAGCAGATTT CTTTCTCCCT GGGGAAGCAT TTTCGGTGTG CACAACAGTT TTGGATGCCA CGGTGGTTGG AATGGACACG TTCACCATCG TGTACAACGG CGGTAGCTCG ACCGCGACGC CTGTCGCGCA CGGTGGTCTA GTGCACGACA CCGACGTCAC CCGACTCGAA TACGCCAACG GGCAAGTTAT GATTACCACT ACAATCGTTG ACGAAAATTT TGCTTCGCAA CACTCTGAAA TTGAAATCGA CGAGATGGTC GCCGTTACCA TATTGCAAAA CATTGGTGAA ATCCGTCAAT CCTCACGACA CCTTATTCAC TATCACGGTT TACCGCGGAA GGGAGATGAC ATTCAAGCTC GACCAAATCT GAGACGGTTG CAGCCAGGCT TAATTTCCAA CGACTCATTG ACACCAGCAG AAAATTCGAC TCGGCTTGTG CCATCGCCTC CATCTCCTAC AACCAGGACC GTCTCTTACC CCGATTGGAA GATTCTGGAG TGGTTGGATG AGTTACCCAC AGGGATGGCC TATTTTATCT GGGTTGTCCT GGCTGCCATC GTACTGATCA CTGGCTTTTG CTGTGTACTA TTTTCGGTCC TTCCCCTTAT TTACCGCTTG GAATCTTTTC GAGCTCACCG TGCTGAACAG CTGGCGATTG CCCGAGCTTG CGCAAAGATG GATGTGTTTA CGAACGAAAA TCTTACACAA TGCTTTGGAA GCAACTGGTA CAACTTATAC ATTGACGGTA CGCTGCCACT GGAAGCGATC AAATTTGATG ACGGCCTACT GCGTGTGGAA CGAAACCGCA AGCGTTACGA GCGCATGATG GAG
|
Protein sequence | MGETQLIKRS NRRSFFKVGG LHFGMHGLLG FSSLSLTILA YYSYPSEYPI WIGLSQVANL VTVTHARNLL SQVPASTQIF PGIVAPHKEA FQRTISGMQY LVTRVTCLVF RDHSMDIGFR STLALLLWRA WPLIPSYQAE WLNGNTWIFV IPMALGVAGD LIQFWNGDVF SSRQILSIQL HGLLMAFGFT LGFRNYLPMP LVYMGAAFVN RPTFFGGGAK LVHPTVNIRK NVDLGSVEDT TPLNAPTRLA TNVRLMKSWA PHTVSNFATS LVSRAPLPDT FSDSRDHFFK HGNVLCRKEK ADFFLPGEAF SVCTTVLDAT VVGMDTFTIV YNGGSSTATP VAHGGLVHDT DVTRLEYANG QVMITTTIVD ENFASQHSEI EIDEMVAVTI LQNIGEIRQS SRHLIHYHGL PRKGDDIQAR PNLRRLQPGL ISNDSLTPAE NSTRLVPSPP SPTTRTVSYP DWKILEWLDE LPTGMAYFIW VVLAAIVLIT GFCCVLFSVL PLIYRLESFR AHRAEQLAIA RACAKMDVFT NENLTQCFGS NWYNLYIDGT LPLEAIKFDD GLLRVERNRK RYERMME
|
| |