Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44687 |
Symbol | |
ID | 7197921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 1264259 |
End bp | 1265697 |
Gene Length | 1439 bp |
Protein Length | 426 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178674 |
Protein GI | 219115757 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGGAT CCCAGGCATC GCCTCCGTCC GAGTCCGCCC TGTTGCAGGG CTTGATCGAA ACGTCCCGCA AGGTGGAAGC CTCCCTTCTA CCGGCCATTC ACGCACACCA CACGACGCCG GATCAATTCC ATCCTTCGCA GGGCTTGGAC TTTCTCGACG CTCGCAACAC CGTCCTCCTC TCGTATTTGA TCGAACGGAC CTTGGCCTTG CGACACCGAC TCGTCGAATC GCCCACGATG TACGACGGTG ACAATGTGAA TGCCCTACAT CAACACCAGC ACCGACTGCG CTTGGTCACC ACCGTTCTCG ACAAGACGCG GGGACTCGAC CAGAAACTTC GTTACCAGAT TGACAAACTC TTGGCCAAGG CGGCCCAGGA TGACACTGTT GATCATCCCA ACGATTCAAA CAATGACAAC GCCATGGGTC CCGAAGATCC CTTGCAGTTT CGACCCCGCG TCGAAGACGA CAGTGACGAC AGCAACAGGG ACAGTCCTGA CGACGGTAGT ATTCACAGCG ATAACGACAA CCACAATGTC GACGACCGTA TCCACAGTGA CGACGATGAC GATTTGGCGG CGGCGCGAAG AACCCTTGCC GTGGCTCAAA CCAAGAGTCA CAAAAATTCT CGTACCACCG ACCCCGACCA AACGAGCAAC AACAACAACG ATAGCACAGG CACAGGCGTC TATCAGGCAC CGCGCGTCGC CGCGGTCCCT TACACGCACG ACCGCGTTGA CCGGGACGCA CAGCGTACCC AGCGGGATCG CCAGCGTCAA CGCGCTTCCG AATTGGCCCA GACACTCCGC GAACAATACG GCGACGCCCC GGAACAGCAG GACGTGCACG GCGGTACGGA ACTCGGCCGA CAGCGCGAGG CGGCGCGGAG ACACGCCTCC CGACAAGCCG AACAAACACG CTTCGAAGAA GATACCATGG TGCGACTCAC CACCACACGT AAACAAAAGA AGGAGCGACA GCGACTCATG CGCGACGAAA CCAGTAACCT GGGGGCTATT GCCGATCTCG GGAATCTCGT GCGCGACTCG ACTTCGGCGT GGGGACGCGA CCGAAATGTG GAAGAGCCCG TGGACGATAT TCTCGGATAC GAACGACACG CGAACGGCAA ACGGAGACGA AAAATGATTG ACCGCGACGG GCGGGCCGTG ACCGATCAGT CCGCCAAAAG CAAGATCGTG GACGCCAAGA ACTCGCTCCA GTCGGCGTTG TACGGTGGTG GACCCCGGAC CGGGAAGAAG GGAAAGAAGG GCAAGCGCTA GAAAGGTCGT GGTGGATGGG CGCTGTGCCG GGTGCCACCA ACCACCATGA CATGAACCTC TACCACCAAA CCGTAGTGAT CATTACAACC ATCACTACAC TATCTAGCAA GCCCACGCGT ACGAATAAAT ACGCACTCCT TACTACGAAT AAAGCCAAT
|
Protein sequence | MAGSQASPPS ESALLQGLIE TSRKVEASLL PAIHAHHTTP DQFHPSQGLD FLDARNTVLL SYLIERTLAL RHRLVESPTM YDGDNVNALH QHQHRLRLVT TVLDKTRGLD QKLRYQIDKL LAKAAQDDTV DHPNDSNNDN AMGPEDPLQF RPRVEDDSDD SNRDSPDDGS IHSDNDNHNV DDRIHSDDDD DLAAARRTLA VAQTKSHKNS RTTDPDQTSN NNNDSTGTGV YQAPRVAAVP YTHDRVDRDA QRTQRDRQRQ RASELAQTLR EQYGDAPEQQ DVHGGTELGR QREAARRHAS RQAEQTRFEE DTMVRLTTTR KQKKERQRLM RDETSNLGAI ADLGNLVRDS TSAWGRDRNV EEPVDDILGY ERHANGKRRR KMIDRDGRAV TDQSAKSKIV DAKNSLQSAL YGGGPRTGKK GKKGKR
|
| |