Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45348 |
Symbol | |
ID | 7200036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 915815 |
End bp | 917456 |
Gene Length | 1642 bp |
Protein Length | 481 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179535 |
Protein GI | 219117481 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGAATGTGA GCTACTGGGA GATGTGGTGC GCCCAACAAG AACATGGATT CGACGAGCCA ATAGGGTCTG GCACCAAACA ACAATACCTT ATGGATAGTA GAGTCACCGT CGCAGAGTTT CAAACAACGA AGGTCTCCTC CCCAGAGGCC TCGCTGTACG GGACTCTTGC AACATGAACA GTGTCGTAAA GCCTGATGGA GGGGAATCGC ACCCCCTAGA GGAATCCCGC TCCCTCGAAG CCATGTTGAG CGGCACTACA AGCGTCGAGA CGTTCTTCCA AACGTTTTGG CAAAGAGCCT GCGGTTACTT TCCCAATACG TTTCTGGACA GTCCGCCGAA AGCAGAAAGC ATGTCACGCT GTGCTTGGAA CAAGGAACGA GTTGAGCAGA ATGCGTATCA CGAACTTGTG CGAAACGGGT GGAGTGTTTT GGTACAGCTT CTGGAAACGA GCCGCAATCG CCCGGAGCAC GACGCCGACC TTTCCCATCA ATCGATACCT CTCCTATTTC GAGACCAAAC GACGCTCACT CTTGAAGAGC AAGTCTTGTA CGACGACAGT CTTTTTGCTG CGTTTCTGGA CGGATGCTCG GTAGTGACGA ATCATGCGGA TCGACGATCC CCCTGGATCG CGGCGCTCTG CGAGGATTTG CAAGCATCTT TTCCCCACGT CTACGCCAAC ACATATTTGA CGCCACCTGG TTCCCAAACA GTTCCTGCCC ACGCGGACGA CCGAGATGTT TTCGTTATAC AACTCGTCGG TTGCAAAGCT TGGAAGATTT ACCGAAATAT CCCCGTGCCG TATCCGTATA GTCACGAGCA AGTCGGCAAA GGGGAACTGG AGGTGCCCGG CCAAGTTTTG GACGGACCAG TATTGACCGA TCGAGTACTC GCACCAGGAG ATGTCTTGTA TATGCCTCGA GGTTACGTGC ACGAGGCCCA TGCGGTCGAC GGCGGACCCA GTTTCCACGT CACGGTGGCG TTAGCGACGC AGGACTGGAC TTTGGCGGGT CTCGTCACCG CCGCTACCGA AGCTTCCTTA ACGCAGCAAC GCAGCTATCG CCAGGCCGTG CCTCGGTGTT TTGGGCGACG GTCGTTCGAA TCTATTGCAG TCGATGACAA ACAAAGTTTA CAGAAGCAAC TGGATGACGC GTTTCGGATA CTCCGGGAGA AGATTACGGT AGAGTCCGTA CACAATAATC TACGGACCAA GTTTGATCGT CACAATCAAC GCGCCCTGGA AACTCGTCGT CGACTCGTAT TGGAGGCGTC GCTAAATGCT CTTTGCGTGG AAAATACGTT ACCTGGAGGT GTGGTGGGAA GGCAGGCCGC GGATGGAGTC CTTTGGACAA CCGCGATACG ATGCAGCACT GTGGACGAAA GAGCGACCCT ACCTATTTCA GCAAAGCCAC GCGGTTTGAA CGTACGGGAA GAGTGCTGCG ATTTTATTAT GGACATTCTA CAACTTGTCA AGCGAAATTC CGGCAAAGCC TACAGGGTAA GCGAGCTCCG GGCGCTCCTG CCGGCAACCA GTAACGCAAT CGCTGTATGC GACTTGACAA TTTTGTGCTT CGCGAAACAA TGCGTTGAGT TGGGAGCGTT TTCCGTAGTT CGCATGTAAT AGCCAAATAT ATATTGTGCC CG
|
Protein sequence | MNSVVKPDGG ESHPLEESRS LEAMLSGTTS VETFFQTFWQ RACGYFPNTF LDSPPKAESM SRCAWNKERV EQNAYHELVR NGWSVLVQLL ETSRNRPEHD ADLSHQSIPL LFRDQTTLTL EEQVLYDDSL FAAFLDGCSV VTNHADRRSP WIAALCEDLQ ASFPHVYANT YLTPPGSQTV PAHADDRDVF VIQLVGCKAW KIYRNIPVPY PYSHEQVGKG ELEVPGQVLD GPVLTDRVLA PGDVLYMPRG YVHEAHAVDG GPSFHVTVAL ATQDWTLAGL VTAATEASLT QQRSYRQAVP RCFGRRSFES IAVDDKQSLQ KQLDDAFRIL REKITVESVH NNLRTKFDRH NQRALETRRR LVLEASLNAL CVENTLPGGV VGRQAADGVL WTTAIRCSTV DERATLPISA KPRGLNVREE CCDFIMDILQ LVKRNSGKAY RVSELRALLP ATSNAIAVCD LTILCFAKQC VELGAFSVVR M
|
| |