Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37726 |
Symbol | |
ID | 7202603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 808849 |
End bp | 809877 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | |
GC content | 60% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181808 |
Protein GI | 219122970 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000639291 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGTGT TGTACGACTA TAACAGTAAT GCCATCCATG TTGAACTCAT GAAGAGCAAG TCCGGCGCCA AGATCCTTGC CGCTTACCAA CGCGCCCACT CACTCTTCAC CCAACAAGGC CTCCAGCCTC AAATCCAGCG TCTAGACAAC AAGGCGTCTA CCGCTCTCCA GTCCTTTATG ACGGCCAACC AGGTCGACTT CCAGTTGGCT CCTCCCCATC TACACCGTCG CAACGCCGCC GAACGCGCAA TCCGTACCTT CAAGAATCAC TTCATTGCCG GTCTATGCAG TACGAACCCG GATTTTCCGC TTCATCTTTG GGATTGCCAC ATTCCACACG CTATCCTTAC CCTCAATCTC CTCCAAGGCT CCCGCATCAA TCCTACCCTC TCGGCCCATA CCCAACTCCA TGGGGCTTTT GATTATAATT GCACCCCACT TGCCCCTCCC GGCACTCGCG TCCTTGTCCA CGAAAAGCCC GCCGTTCGGG AAACTTGGGC ACCCCATGCT GTTGAAGGCT GGTACCTTGG CCCTGCCATG AACCACTACC GCTGCCATCG CGTTTGGATC ACAGAGATGC GTGCCAAACG TGTTGCCAAC ATGCTTGCAT GGTTCCCCAG TAAGATTCCC ATGCCCACCG CCTCCTCCAC TGACCGCGCC CTGGCCGCCG CCCGTGACTT AGTGTGCGCC CTCCAGAATC CCTCTCCCGC CTTGCCGTTT ACGCCCCTCG ACGCCAACCA ACACCAGGCC CATACCCAAC TTGCAGAACT CTTTGCTTTG GTTGCTGCCC GGGGCCTCTT CCGCAGCCGC ACCCGCTCGA GCGCCCCCGG TCCCGCCCCC TGTCTCACGC CTACCCCTGC TCAGGTCCGC TTTGCCGTTC CAATTGTCAC GGCCGAGCAT GCCCCTGCCC TTCTGAGGGT GCCCACCCTT GCGCCCCCAT CTCCGAGGGT GCCCCCCACA GCCACCTATC ACTCTCGCAC AGACAATCCC GGCCGCCGCC GTCGCAAAGC ACGCCAGCCC CAACCCTAG
|
Protein sequence | MLVLYDYNSN AIHVELMKSK SGAKILAAYQ RAHSLFTQQG LQPQIQRLDN KASTALQSFM TANQVDFQLA PPHLHRRNAA ERAIRTFKNH FIAGLCSTNP DFPLHLWDCH IPHAILTLNL LQGSRINPTL SAHTQLHGAF DYNCTPLAPP GTRVLVHEKP AVRETWAPHA VEGWYLGPAM NHYRCHRVWI TEMRAKRVAN MLAWFPSKIP MPTASSTDRA LAAARDLVCA LQNPSPALPF TPLDANQHQA HTQLAELFAL VAARGLFRSR TRSSAPGPAP CLTPTPAQVR FAVPIVTAEH APALLRVPTL APPSPRVPPT ATYHSRTDNP GRRRRKARQP QP
|
| |