Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41064 |
Symbol | |
ID | 7198865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 266805 |
End bp | 267815 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185077 |
Protein GI | 219129818 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATTT TTCTCGCGAC TGCTCGATTG CGAATGTCGG GGACCGTCGC CGCCTTTTCA CGAGCCGTGT CCATCCCTCG ATGTGCGCCT TTCCTCGCTC GGCGACCAAC CTTTGGTCCC GGTACAGGAG TGCGTCTCAC TACACGCCGA CCTTTTTTCA AAGTGTTCAT GTCAGCCGTT GAGGATGACG AAACGGCCTG GAACGAAAAG ACGGTGGACA GTACCTGGAA CGTCACGGGC TTGAAAAAGG AAGTCCAACG TTTGATACTG CGCTGTCACA AAAAAATCGG CAAAAGCAGT CAACGCGTAT CGCAAGCTCA GATGCAATTA GACCAACTGA TGGCGGACAA AAACGCGAGC TTGTCGGATT TGGAAAAGTG TCCCGATGTG GATGCCCTGA CTGTCGAATT GGACGAATTA AGGGCGCGTC TGCACAAATT AAACGTGCTC GAACAAGCGC TGGCGGCGGA GAAGAAGGGC ACACATCGGA CTCTTTCGGC TGAAGTAGTG GAGCTGGTGC GAGAACTAGG CGTCAATGAC GAGCCGCCGG TACCACAACC ACGGCCTCTT AAAAAGCAAA AAGGTCCGCG AGTCACCGAG TCGTCGCGCA AACCCTATCG ACGCTACTTT TCCGCTCAAC AGGTTGAAAT TCGAGTTGGC AAGCAAGCCG AAGATAATGA CGAGCTAACA ATGTCGCCGG AACATCGTGA CGGTGCCGAT TGGTGGATGC ACGCGTCGGG GTGTCCCGGA AGCCACATTG TGATTCGATG CCACGACCAG AATCTTGACG AACAGGTCGT CCAGGACGCG GCAGCCTTGG CCGCTCGCCA ATCCAAATGC AACGGATCCG TCATTAAGGT ATCTCTGACG CGGTGTCGCG ATATTGTCAA GCCGCCGGGG GCCAAGGCGG GTCTTGTGCA GTTGGTGGGA AACGTCCGTA CCGTTTCGGT CAACATGAAG GAGGCACAAG CGCGGCTGCA ACGGTTAGAC GCTACCTGTA TCGTAAACTA G
|
Protein sequence | MSIFLATARL RMSGTVAAFS RAVSIPRCAP FLARRPTFGP GTGVRLTTRR PFFKVFMSAV EDDETAWNEK TVDSTWNVTG LKKEVQRLIL RCHKKIGKSS QRVSQAQMQL DQLMADKNAS LSDLEKCPDV DALTVELDEL RARLHKLNVL EQALAAEKKG THRTLSAEVV ELVRELGVND EPPVPQPRPL KKQKGPRVTE SSRKPYRRYF SAQQVEIRVG KQAEDNDELT MSPEHRDGAD WWMHASGCPG SHIVIRCHDQ NLDEQVVQDA AALAARQSKC NGSVIKVSLT RCRDIVKPPG AKAGLVQLVG NVRTVSVNMK EAQARLQRLD ATCIVN
|
| |