Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35587 |
Symbol | |
ID | 7200805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 293266 |
End bp | 294951 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | hypothetical protein |
Protein accession | XP_002180009 |
Protein GI | 219118475 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.713211 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTTC TGTTTGTGTG CGTCCTATCC GGACTTTGTG GGACGACAAA CGCCTTTGTG CCGAGCGTGC GATCGGCAGC TTCACTTCAC GGTCGCGACT CGCTGCAAAC CCAGGCCGAA AAGGTATCCC GCAGATGGTC TAATCTTAAC GATGTTGAAG ACGACTTATT CGATCCCTTG CGTGATCGTC TGTCGAACAT GAACGCTACT ATCGTTGCTC CTGAAGCGAC GGAAACGGAT GTGGTGGACA ACACTCTCTT GAATTTGGTC GTTGATTCTG ATCGGGAGCC GCACCCCTTG GGCTGGAAAC GACGTCTGAC GCAAGCCCTG TCCGGTCGGA AAGCCGGTAG GCAACTGGAC AAGCTGATTT TGAGTACATC TATCCCTTCT ATGATTAATC TGGCCGTAGT GCCACTCGTC AATTCTGTTG ATACCTTTTG GGTGGGTCGC ATGGGGAGCG CACTGGCCTT GGCCGGACAG GCGGCGGCAA ATCAGGCCTT TTTTACCATA TTCTTCCTAG TGAACTACTT GCCAACAATT ACCGCACCTT TGGTAGCGTC GGCCGTCGGA TCCGGCAACC AAGACGAAGC GCGGGCCAGG GTCTGCGAAA GTCTCTTCTT ATGCAACGTC TTGGGATTGA TGGGCACATT AAGTCTGACG CTCTTTCCGC AGTGGGGTTT GAGCATGGTG CTGCAGGATG GCGCTCCCGC TATGGAGTAC GCGGTTCCCT ACTTACGCCT TCGAGCTTTG AGTATGATGC CGGCTCTGTG GTCGTCGAGC GGCTTTGCAG CGTATCGCGG ACTGCTAAAC ACGGTGACGC CGCTCAAGGT CAGTCTGGCA ACCAATTTGG TGAATCTGGT GCTGGATCCA CTCTTTATCT TTCGGACACC ACTTGGTTTT GTCGGAGCAG CGCTAGCGAC GGCCATTTCA GAAACGTGCT CCGGAATAGT CTACTTACGA CTATTGATGA AACGTCAACT TGCCAGTATC AAACTTCTCT TGCGCCCGCC TTCGATGAAG GCATTGATGC CCCTGCTTCA AGGTGGCGCT TCCATGCTGG GACGCCAATT AGCCTTGAAC GTTGGATTTA TTTCGGCTGC CCGTCGTGCT CAGAGCATGG ACCCGTCGGG TGTGTCGGCC GCAGCTTATG GTATAGTAAT GCAAATGTAT TCGGTGGGAA TTGTGGTGCA CGTCGCAATG CAGGGAACGG CCGCCGCACT GGTACCATCT ACATTGGCAA GAGAAGGCAA GGATGCTGCC CGGAAAGTGG CCGATCGTGT CATGATTTGG GGTTCCATCG TTGGCGTTTT GTTAGGATTG ACCCAGTACT TGGCGCTTCC CTTTCTGGTC CCTCTCTTTT CAACTCTACC AGAGGTACAA GAAGCCGTCA AAGTTCCCGC GCTACTAGCG GCTCTCTTGC ACGTTATCAA CGGTGCAGTG TTTGCTGGAG AGGGGACTAT GCTCGGACTA GGATCCTATC GCGACTTGAT GCTTTTAACG GCAGGGAGCG TTGGGGCACT CGTGGGCTGC TTGTCCTCGC CGCTGGGTGC GAGATTGGAC GGAATTTTGC TGTCCATGAT GGTGTTTTGC GGAACTCAGG GCATAGCTGT TGTAACGCAC TACCTCAAAT TTGGCCCTTT AGCCGTTCGG CGGCAGAGTA AGCGAACGCT CATGCCTAAA CTATAG
|
Protein sequence | MRFLFVCVLS GLCGTTNAFV PSVRSAASLH GRDSLQTQAE KVSRRWSNLN DVEDDLFDPL RDRLSNMNAT IVAPEATETD VVDNTLLNLV VDSDREPHPL GWKRRLTQAL SGRKAGRQLD KLILSTSIPS MINLAVVPLV NSVDTFWVGR MGSALALAGQ AAANQAFFTI FFLVNYLPTI TAPLVASAVG SGNQDEARAR VCESLFLCNV LGLMGTLSLT LFPQWGLSMV LQDGAPAMEY AVPYLRLRAL SMMPALWSSS GFAAYRGLLN TVTPLKVSLA TNLVNLVLDP LFIFRTPLGF VGAALATAIS ETCSGIVYLR LLMKRQLASI KLLLRPPSMK ALMPLLQGGA SMLGRQLALN VGFISAARRA QSMDPSGVSA AAYGIVMQMY SVGIVVHVAM QGTAAALVPS TLAREGKDAA RKVADRVMIW GSIVGVLLGL TQYLALPFLV PLFSTLPEVQ EAVKVPALLA ALLHVINGAV FAGEGTMLGL GSYRDLMLLT AGSVGALVGC LSSPLGARLD GILLSMMVFC GTQGIAVVTH YLKFGPLAVR RQSKRTLMPK L
|
| |