Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39727 |
Symbol | |
ID | 7195445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 554516 |
End bp | 555751 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183631 |
Protein GI | 219126788 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGGAC CCCATTTCGA ACCATGGTTT ATCCTGGCTA GTCGTCCCTC GACAGCGACG GACGAGCAAA GTGCGGGACG AAAGGCTCGC TATGCGCTTG GTCTCACTTT CATCATGATG CAGTGCATCG TATGGATCTG TGCATCCGTC ACAACTCAGT ATTTATACGG AGGACAAGGC TTTCATTCGC CGTTTCTCAT GACCTTTGCT GGTGTTGGTA TGTTGGCCAT ATTTTTGCCG TTGCGACTTC TAGCTGTTCG AATTGGGATA GCTCCGAAGC TCCTCAAAAG TACAGAGGAT GCAGATCCTG CGGTCAACAA TGGTGTTGGC AATAGTCACG ATGAAAAACT CGCGCAAGCC ACATCATACC ACCAAGTTTT TGATGCTGTT GCCTCCGAAC GGCGTGAGCT GTCCCATCCT ACAACGTTCT GGAATCATCG CAAACATGCT TTAGCTGCGC TTCACATTGC ACCCGCCATG TTCTTTGCCG ACTGGTGTTT CAATCACGGG CTAGCATACA CATCTGTCGC TTCAAGTACG GTTCTAGTTT CCACTTCCTG CGTCTTCGTT TTCTTGTTCG CTGTTCTGGT GCGAGTCGAG GCCTTTCACT CTGTGAAACT TGCTGGCGTA CTGCTCGCAG TGGCGGGTAC CGTTTTAACA ACGATGGGCG ATATTTCCGT CAGCGAGGAA TCTAGCGGTG TGGATGCCGA AAGACATGTT TTGACAGGCG ATCTCTTCTC CCTCATGGCA GCCATTGGCT ACGCATTTTA CACTGTACAA GTCCGTGTTT TGTGTCCTCA AAACGAGGAT CTTTACAGCA TGCAGCTATT GCTCGGCTAT GTTGGTGTAG TTGCCACCAT ACCGCTTCTA CCCGTTGCGT GTTACGCTTT GACGCAAGTC ACATTCACGC CAAAAATAGC CGCCGTTTTG GTAGTCAAGG GACTGTTGGA TTTTGTTATT ACGGACTATC TGTTATTTCG CTCCGTAATT TTGACCAACG CAACGACGGC TTCCGTCGGC TTGGGATTGA CGATCCCCTT GGCTTTTTTG GTCGACTGGG TCTTGGGCAA GGGCAACGCA ACGACCATTC AATCCTTGCT TGGACCAGTA GCCATCGCTA TCGCCTTTTT GATAGTGAAC CTTACTGGCA ACTCGATAGA CGAGCGGGAA CAGAATATTC ACGACACAAA TACACCATCG ACTGAGAATC CGCAATCGGC AGGAGTTTTT GCATAG
|
Protein sequence | MQGPHFEPWF ILASRPSTAT DEQSAGRKAR YALGLTFIMM QCIVWICASV TTQYLYGGQG FHSPFLMTFA GVGMLAIFLP LRLLAVRIGI APKLLKSTED ADPAVNNGVG NSHDEKLAQA TSYHQVFDAV ASERRELSHP TTFWNHRKHA LAALHIAPAM FFADWCFNHG LAYTSVASST VLVSTSCVFV FLFAVLVRVE AFHSVKLAGV LLAVAGTVLT TMGDISVSEE SSGVDAERHV LTGDLFSLMA AIGYAFYTVQ VRVLCPQNED LYSMQLLLGY VGVVATIPLL PVACYALTQV TFTPKIAAVL VVKGLLDFVI TDYLLFRSVI LTNATTASVG LGLTIPLAFL VDWVLGKGNA TTIQSLLGPV AIAIAFLIVN LTGNSIDERE QNIHDTNTPS TENPQSAGVF A
|
| |