Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47468 |
Symbol | |
ID | 7202584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 705787 |
End bp | 707403 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181789 |
Protein GI | 219122930 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.204515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGAGC CTACCGATTT GCCCTTGGTA AGCCAAATTC CACCATTACC AAAAGAATGG GCTCGATGCC ACGCCTACAT TGATAATAAG CGGCGATATT GCCGACAACA TCCTATCGAA TTCGAAGACG CGCCGGCAAC GGATCACAAA GCTGATACCG GCAGACCTCG CTACTGCGGC AACCACCAAC ATCTTCTGGC CAATCGCAAA CGAAAACGTA TACCCTGTCC AGCCGATGCA TCTCACTCCG TATACGAAGA TTGCGTAGCC AAGCACCTGT CCGTCTGTCC AATGCTGAAG AAACAGAGAG GGCAGGAGAA GCAATCTTTC TACCGGAAGA ATATAAATAC AGGTGGTTAC GGCCCCTTGG GCGAGATCGA AGAAATCGCT ACCAGCCAAA TACGGTCAAA GTTGGATCAT GAAATGAGTC AATTCGCTGA ATCCGTATTA AGGTTACATC AGAAATTGTT TGCCGGCGAC AATATTCAAG ACCCATCGGT CCTCTCCGCT GAGGATATTC ATCAAGCGAT ACCTACGGAG GATCTATCCT CTGCAGAATT TGACGCAGGC TTAGCCCAGG CAGTTTCCGA CTATCGGATT AAGTCGGGTG GGCAAAAGCA TTTGCACCAG CAAGCCAGCC TCGTGGGACA CTTGCGCCGT ATCGGGGCGC TGCCATCGCT TTCCAAAGGG TCACGCCATA CAGACAAAAT CAAGTCAGTC GGGAAACACT TGATCTTGGA AGTCGGTGCG GGTCGTGGGA TGACCGGCTT GGTCGCTGCC GGTGTTTCTG TAGTCCACGG TGACCCAACA GATCTAATCA TGATTGAACG CGCTGGATCC CGAGGCAAGG CCGATACTGT GCTGCGCAAT GCCCCAATAT GTGTCAAACA GACCACGTCG GATGTCCCAC CGTATTTGGA TCTCAAAGGG CCTCTATCCT GGTCGCGTGT GCAATGCGAT CTTTCCCATC TCAGCTTGTC ATCCATCCTT TTGAATGCAG ATTTAGAAGA GAACGACCGT GTTACGGTGT TGGCGAAGCA TTTATGCGGT GCCGGCACAG ACCTGGCTCT GAAGGCGTTG GAACCAGTAA AGACGTCGAT TTCCTCCTGT ATTTTGGCAA CATGCTGTCA TGGAGTTTGT AACTGGCAAG ATTACGTCGG AAGAAAGTTC TTAGTGGACG CTTTCCAGAA AAACTGCCCG TCACAACCGT TCGGAGCCTT CGAGTTTGAG CTACTTCGGC GATGGAGCAC CGGCACAGTC AAGGCAGGGG CGGCAGAGAA TCGCCTTGGC GAAAACCGCA TAGAAATCGC TGATTCCAGC CTTCAAGAGC ATGGTCTCGG CACCGTCTTG GTCGAGAATG CATCTGCTAA TATTTCCAGA ATTGTGGAAG CGAGCCGTCT ACGATGCGGA GCCCAGGGTC TGGGCCGCGC CTGTCAGCGT CTCATCGATT ACGGACGGCA GGAATATCTC CGTGGGGTTC TGTTTCCTTC CGCGAAACAT GCAAATGGAT CAGTCGAGAC GACAAACATT GACATGCTGC ACTATGTTGC TCCCGGTGTT ACTCCTCAAA ATGCTGCACT GGTAGCATTT TCCGAAAAGA CAGGCCATAT ACCATGA
|
Protein sequence | MGEPTDLPLV SQIPPLPKEW ARCHAYIDNK RRYCRQHPIE FEDAPATDHK ADTGRPRYCG NHQHLLANRK RKRIPCPADA SHSVYEDCVA KHLSVCPMLK KQRGQEKQSF YRKNINTGGY GPLGEIEEIA TSQIRSKLDH EMSQFAESVL RLHQKLFAGD NIQDPSVLSA EDIHQAIPTE DLSSAEFDAG LAQAVSDYRI KSGGQKHLHQ QASLVGHLRR IGALPSLSKG SRHTDKIKSV GKHLILEVGA GRGMTGLVAA GVSVVHGDPT DLIMIERAGS RGKADTVLRN APICVKQTTS DVPPYLDLKG PLSWSRVQCD LSHLSLSSIL LNADLEENDR VTVLAKHLCG AGTDLALKAL EPVKTSISSC ILATCCHGVC NWQDYVGRKF LVDAFQKNCP SQPFGAFEFE LLRRWSTGTV KAGAAENRLG ENRIEIADSS LQEHGLGTVL VENASANISR IVEASRLRCG AQGLGRACQR LIDYGRQEYL RGVLFPSAKH ANGSVETTNI DMLHYVAPGV TPQNAALVAF SEKTGHIP
|
| |