Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_15081 |
Symbol | |
ID | 7203799 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 147515 |
End bp | 148741 |
Gene Length | 1227 bp |
Protein Length | 368 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182780 |
Protein GI | 219125004 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAGC TCGAAAGTCT CGCTTCGCAG CGTCCGACAC CGCTGCGTCT CGCTGACATG TACGAGTATG GGCGCGGTAT AGATCCGGCG CAACGTCTCC GCAATTCGCA GTTCCTTCAT CGTGAATTGC CCATTCGCGT CGCCCAACGC GCCTACGACT TACTCACTTT GCCACACGGC TTGTCCAACG CCACACCCAT TCGACAAGTC GCCGCAACCT ATATACAATA TCTCCAGCAG TTTAAATCAC GACCCTGCCC GCAGAACAAG CCACAGGAAG AAGAGTTTAC GGATTTTGTG CAATCGCTGG TGCTGGATCG CGCGGCCGTT CCGATTTCGA TTTTTCGGGG TATTTTGGCT TGGATGGGAT CAGCCCCGCA TAGCGACGAT GACCGCCAGG ATCAAACTTC CTCATTGTCG TCCTTGGAAG AGCAACCCGA TCGGTTGCAG GAAATGGAAG ATGCCTTGTA CAGGTTTTTC ACCGCCCGAG TGGGGTTGCG GTTTTTAACC GTTCATCACG TATTGTCGTC CCGCCGTCCG TCGGCTAAGG CATTGAAAGA TGTCACGTTT TTGTTCCCAC CGGACCAAAG CGATGATTTC TTGGGATGTA TTCAAACTAA CTGCGATCTT GTTAAGGAAG TTAACAAGGT TGCAAAGTTA ATTCATGAGC AAACAATGGA ATATTACGGT ATCTGTCCAG AAATTGAAGT AGTGGACTGC ATCGAGGATA CGGATCAGGC AAAAGACGGC AAGAGCAAAA ACAAAACGCG AGACTTTACC TACGTTCCTC ATCATTTGCA CTACATGATT TGCGAACTCT TGAAAAACTC ATGCCGGGCT ACGGTCCAAC AATTTCGTGC GCAAGAAATG CACACCCAGG GTCCATACGG ACACGATAGC GCCAAAATTC CCTCGATCAA GGTCGTCATG GTCAAGGGTG AAGAAGATGT GACAATTAAA GTTGCCGACA AGGGTGGTGG CATTCCCCGC TCCAAAATGG AGCGCATTTG GAAATTTGCA CATTCGACGG CGGACCAGAA CGAAGCGGAA TCCGATTTTG GAACGGACGC GACTAGTGGT GCACGGATTC GCGGATTTGG CTTGCCACTC GCACGTATTT ACGCGCGCTA CTTTGGAGGC GAATTGACTC TGAAGTCGAC GGAAGGGTAC GGGTTGGATG CGTACTTGCA TTTACCGCGA CTTGGAGATG CCTGCGAGAA GCTACCC
|
Protein sequence | MQQLESLASQ RPTPLRLADM YEYGRGIDPA QRLRNSQFLH RELPIRVAQR AYDLLTLPHG LSNATPIRQV AATYIQYLQQ FKSRPCPQNK PQEEEFTDFV QSLVLDRAAV PISIFREQPD RLQEMEDALY RFFTARVGLR FLTVHHVLSS RRPSAKALKD VTFLFPPDQS DDFLGCIQTN CDLVKEVNKV AKLIHEQTME YYGICPEIEV SKNKTRDFTY VPHHLHYMIC ELLKNSCRAT VQQFRAQEMH TQGPYGHDSA KIPSIKVVMV KGEEDVTIKV ADKGGGIPRS KMERIWKFAH STADQNEAES DFGTDATSGA RIRGFGLPLA RIYARYFGGE LTLKSTEGYG LDAYLHLPRL GDACEKLP
|
| |