Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46618 |
Symbol | |
ID | 7201743 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 913099 |
End bp | 914924 |
Gene Length | 1826 bp |
Protein Length | 570 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181102 |
Protein GI | 219120740 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAATGT GAGTAGGTAC CTAACCATCC CGATCCGAAA CGAATCCAAT GCCCTGCGCC GTTCCAAACT GTACACTCTA TTGCCCCCGA TACTGCATGG CATTTCGTTC TCCCGTGATA GCTTGATAGC TATTGATGTT GAATGTAAGA GCGAAGCTAT GATGGAGACC CTTCCAAGTG GGGTCGCGGC AACACGAGCG AATTGTCTGG ACAAACGTTC GTTAGCTTTG TTTCGTATAT TGCTCGGGTT GTACTTGCTC TATGACATCT ACGCTCGCAC TTCGTTGGGA AAGTACGATC TCGCTTGGTA CACGTCTTTG CCGCCCAGTC GTTCCTTCCT ATCGGATCTA GATTTTCCGC ACCAGGCCCC ATTGCACAAG CTTTGGTTCT ACCGAGGAAC CTTGGCTTTC CAAGTTGCAA TGTTCGGTAT TTCTACGTTG TTGGCGGTCT TTTTCACGGT GGGTTTGTTT CAGCAAACCG CTAGGGTAGG CGGTCTCGTC AAGACCGTTC TATTCATTGT TCAAGTCGCG CAACAGAGTC GGAATATGCC TGGCACGGAC GGCAGCGACT CCTTTCTTCG CCACTTGTTG TTTTGGAGCT GTTTTCTGCC ACTCGCGGAC GTTTGGAGCG TGGACGAATG GCGCGCATCA CGTCGCAAAC ACAAGACGAA AAAGCCCAGT CACCGTCACT GGCAATGTAC CGGCTTGCCC TGCTTGGGAT TGATACTGCA AATTGCCTTC ATGTATCTCG GTACTGCTTT GAATCGCATT GACGTACACG GATGGAAGAA TTGGCACCAA TGCGAATGGT TGCCGCCGTC CTTATCTGCC GTCCATTACG CACTCAGCGG GAGTTTTGCC ACGCGGGACA ATTTTCTCGG CGATCTCATA CGTACCCAGC CTATTCTTTC CAAGACTATG ACTGCCATGG CCATGGTGGG TGAAATCGGA GCGCCAATTG GTTGCGTACT GGGAGGAAAG TATCGACATT GGTTTGCGCT TGTCCTTTTT CAAATGCACT TGGGGCTCTT TTTAACGCTT AATCTCCCCA ATTGGCAGCC AATTGGTATG CTGATACAAG TATTATTCAT TCCAACGGCA TATTGGGACA GATGGCTGGG CTTTAGTAAT ACGGACGAAG GGGACTACAA AAAGACAGAC GGTGACAGTG TGGAAAATAC GGAACACAAG GACGCTGTGG TCAAAAAGCG GTCCGCTAGC GCATTTTCTC GGACTTTGCA GATCTTTTTC CTCTCCTATA TGATCTACAA TTGGCTAGGA AACCGGGGTT GGATTGCAAA GCACGATAGG GGAGATATCG GTGAAGGGTT GCGACTAAGC CAATACTGGG TGATGTACGG CACGCTGGGC CACGTGTCCG ACAACATTTT TTTGACGGGA TACATCGATA CGACCAACGA AACTTCCAGA GATGCTCGGC AAATGGTGGA TTTATTACAC TACGTCAAAA CCAAGACATT TCGAGACCAG GACGACTTCA ACTTTGTTCC ACTGGATATG ACAAGCCGCT TTCCATCTCC TCGTTGGGAA CGAGCCTTGC ACCAATGGGC CGCTAAACGC AAAACCCAAT CTGCCCGACT GCTCTGCCAA GTATTGTGTT TGTTTGTCAA CGAAGACCGA ACTCTGAAGG GTCTACCACC CTTGGCGAGT GTGGAGATGC GCTGGCAACA TATGCGCATT CTTCCACCCG GTAGTAAAGA TCGATATCCT AGTCGCGACG TATCAAAAGT ATCCACTCCT GACACAATTA TTTCCGCACC GTGCATGGAA TACGATAGCT CGTTCGACTC TCTTCTGGTG GAATAA
|
Protein sequence | MSILIAIDVE CKSEAMMETL PSGVAATRAN CLDKRSLALF RILLGLYLLY DIYARTSLGK YDLAWYTSLP PSRSFLSDLD FPHQAPLHKL WFYRGTLAFQ VAMFGISTLL AVFFTVGLFQ QTARVGGLVK TVLFIVQVAQ QSRNMPGTDG SDSFLRHLLF WSCFLPLADV WSVDEWRASR RKHKTKKPSH RHWQCTGLPC LGLILQIAFM YLGTALNRID VHGWKNWHQC EWLPPSLSAV HYALSGSFAT RDNFLGDLIR TQPILSKTMT AMAMVGEIGA PIGCVLGGKY RHWFALVLFQ MHLGLFLTLN LPNWQPIGML IQVLFIPTAY WDRWLGFSNT DEGDYKKTDG DSVENTEHKD AVVKKRSASA FSRTLQIFFL SYMIYNWLGN RGWIAKHDRG DIGEGLRLSQ YWVMYGTLGH VSDNIFLTGY IDTTNETSRD ARQMVDLLHY VKTKTFRDQD DFNFVPLDMT SRFPSPRWER ALHQWAAKRK TQSARLLCQV LCLFVNEDRT LKGLPPLASV EMRWQHMRIL PPGSKDRYPS RDVSKVSTPD TIISAPCMEY DSSFDSLLVE
|
| |