Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48536 |
Symbol | |
ID | 7194778 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 144986 |
End bp | 146669 |
Gene Length | 1684 bp |
Protein Length | 486 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183163 |
Protein GI | 219125806 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0226991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAACCGATC GAACACTTTT CGTGAATGGA AAGATCTATT GTTTTTCATT TGCCATTCTC TCGTCCTTCA TTCACTCTAA AGGTTCACGA TTAGTTTCCA TTCGTAGACA TCTCGAAGCA GTGTTGCTGA CTTTGCGAAC ATGGTTGGAT GGTGGTGGGC CCCACAAGCG ACTGCAGTCG CGTCTTGTGA ACCTCCAGCT GGGAAGACCT ATTTGACCAT AGGCCAAGAC CTGGCTGCGA TTCAAGACTA CGTCACGGAT CAGTTCTCGT ACAGTTTGCA CCGAGCTCAA ACAAGGCAGA AGAAGGGTTT GAAATTGTGA GTCATCACAC CGTACAGTGC CGTATTTGCA GTTACTAGAA ATAGTTCAAT GCAGAAACGA TATGCGACTC GACTGGTATC GCGTTGCTCT TTCTCACGTC TTAACCTTTG CTCAATTTTT ACGCCCTTAC GTGCCTAGGA ATGAAATCGC CGTGGGACCG ATTCCGCCGC TGACGGTGGC AGACTTTTTG CCAGCCGCAA CCATGGTCTA CACGGACATC CAGACGCTCA GCGGTTTGAA AACACCGATC GACTACGGTA GCGGCGTCCA GGACGCCGTC GGCGTCACTC TGGAAGGACA GGTACCTGGT CTCCAAATTG GCTTGTGGTT AAACGGCACC ACCGGATGTG CGGATATTAT GGCGGGAAAG TTAAAGGCAG AGATTGACTC GTTGATGACC TTTCTGACAA ACGAAAGCAA CGCCACCAAG GTCTTTTTGC GCGTTGGATA CGAGTTTGAC AATCCAGGTT TTGGGTACAA TAGTGATCCG GCGCTCTACA CGAAAGCCTA CGTCAAGATT GTCAACAGTT GCCGTTTCTG GCCGGCTTGC CGCAACAAGG TCATCTTTGT TTGGCATTCG TGGGGAGCCG GTTTACCCGC GAATACCACA CTAGCCGACT TTTATCCTGG CGATGATGTC GTCGATTGGG TCGGGGTGAG CATTTTTTCC CAGTTCTACC AGCACTATCC TAGTCTCGGC AGTATCTCGA CATTGAACAA CGTCTTGGAC TTTGCCAACT TCCACAACAA ACCCACTATG ATCGCGGAAT CGACGCCGTT TGGAGGGGTT CATGTCTTGA AAGACCCCTG GAGAGATTGG TATCATCCAG TACTGAAAAT AATCAATGCG TACGATATTG GTATGTGGAG CTACATTGAT TGCGACTGGG ACTCTCTGTC TATGTGGAAT GGAACGGGTT TCGGAGACTC GCGCTTGGCG GCGAACCAAA CAATCACACG CCTTTGTCAA AAATATGTTT TGAAGAATCC ACGTTTCGTT CAGCACGCAG GCCTTTGTGC CGTCGGCCCC GCCAAAACAG GTACACGAGT CTCTAAAAAG GGTGGCAAAA AGCCATTTGA TTGGACAGAA TGGATGAGTA AGGGCAAAAA AGACAAGATG GATAATGTAG ACGAGGGCAA CACATTTGGA TGGGATAGCG ACGGCAGCAG TGGACGCTTA GATCTTTTGG CTGCGAGGGG GAAGGATGGT TTTCTTATAG CTAATGGTTC GTCCTTTTTC TTTGTGGGGG GCATCCTTAT TGCCTTGCTT GCCTCGCTTT GGACTCATCG ACGACATCGT CGCCGAGGAT ACGAAACCAT CAATGAACTG TTGGTGTAGA ACTAAATGGA TTTAATGGAA ACTT
|
Protein sequence | MVGWWWAPQA TAVASCEPPA GKTYLTIGQD LAAIQDYVTD QFSYSLHRAQ TRQKKGLKLN DMRLDWYRVA LSHVLTFAQF LRPYVPRNEI AVGPIPPLTV ADFLPAATMV YTDIQTLSGL KTPIDYGSGV QDAVGVTLEG QVPGLQIGLW LNGTTGCADI MAGKLKAEID SLMTFLTNES NATKVFLRVG YEFDNPGFGY NSDPALYTKA YVKIVNSCRF WPACRNKVIF VWHSWGAGLP ANTTLADFYP GDDVVDWVGV SIFSQFYQHY PSLGSISTLN NVLDFANFHN KPTMIAESTP FGGVHVLKDP WRDWYHPVLK IINAYDIGMW SYIDCDWDSL SMWNGTGFGD SRLAANQTIT RLCQKYVLKN PRFVQHAGLC AVGPAKTGTR VSKKGGKKPF DWTEWMSKGK KDKMDNVDEG NTFGWDSDGS SGRLDLLAAR GKDGFLIANG SSFFFVGGIL IALLASLWTH RRHRRRGYET INELLV
|
| |