Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43112 |
Symbol | |
ID | 7196886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2065496 |
End bp | 2067756 |
Gene Length | 2261 bp |
Protein Length | 708 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176900 |
Protein GI | 219110297 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACGATAAGG CTATGAAAAG AATCGAGAAA CCGATATGAA ATATATTAGT AGGACCAGGT CAAGCGAATG AAGCTCCTTC GAAAAGATCG AGGCCGTGAA AGGCACCCGC TTTAAATTCG GTTTGGGTTG TTCAATGATG TGCTCGTTTG GCATTGCTTC AGCGTCGATA CCGAAGCCGG CCTTGTCGCC TTGGACGACT ACCTCCAAGA AGTTCTTGAC ACAACCTGCA CCATGGAACA GTGGGTGTGT CTCGACGAAA TTCCCAAGCA AATTACACCG ACCCTTTATT CAATTTCAAA GATATCATAT AGCTGCATGC TACAAGCAAC ATATTTGTCG TCAGCGTTTG ACCTGCTTCC AACGCTCATT GATTTTTACA AACTGCGGGG GCTTTAAATA TTGGCATCTA CCTGGTTTCC TCGACGATAC CTCAATCGAA GAAAAGGAGG ATTGGCTGCA AAACCTACTT CAAGATAAAG AGGATCTGTC TGATACCGAG GCTTACCTAG TTGTTTTACG GTCGTTGGCA ACGTCCACCC AACCAGATGC ACCAATGAAA GCTGAGCGAT GGCTACGACG TTTGGAAGCT CGTTCAGTAT CGAATCCGGA TGCACCTCAG CCCACCACGG AGTGCTATCA AAGAGTAATC GAAGCTTGGG GTCAAGCCAC TAACGAAGAC CCAAACCTTT TGATTACCCG CACCCAGCGA TGGCTTATGA AGCACCTGCG AAATGACAAC ATTGATTTGC GACCCGACAC TGCCTGCTTC AACTCTTTCC TGGACTTATG TTCGAAAGGT CGCGCTTTGA AACGGGCAAA GGCAACAGAC GGAAACCTGG TGAGGGATCA CGCTTTAAAA GCAGAACAAA CGCTGCGTTT GATGATTTTC AAGAGGAGGA AAGAAGGGGA AGATTCTTCC ATGGCCCCAA ATGTTGATTC GTTCAATTTT GCTATTCGAG CATGGACGCG TTGTCGTAGA AGTCCTGACA TCGCGGACCG ATCAATCTCA GTATTGCATT TGCTGGAAAA TTATGAAAAA ACGTTGGACT CGTCGGTACG TCCCAATGTC AAATCGTATG CCATGGTGAT GGATTCCATT GCTGTGGTCG CTAGACTTAA AGTGAAACGA TGCCAAAGTA TGCCGAAAAC CGTGGAAAAC CCATCGACTA ACGGTTTGAA CGAGATCAAT TTGCTTCAAG AAGTAGTCTC ATATATGAGA AATCAAGCCA GCCTTGGAAA ACATCACTTG GCGCCGAACG GAGTCATTTT CAACACTCTT ATTTCTTGTT GGAGTTCTTT GGCTAAAATT CATTCTCATG CCCCAAACGA AAGCGAGAAA ATACTGCAAA GCATGATACG CATGAAAGAC ATGGGGGAGA ATCACACGGC TCCCGATGCT ACATCTTATC TGATGGTGAT GCGGACATGG CTTAATTCAC AACAGAGTAT TCGTGCCGAA CGCATATCAT GGTGGTTGTC AAAGCAATGG AAGGATTATG ACTTCGAGGG CGACGAGGGG CTTCGGCCAA ACACTACTAC ATACAACCTT GTCATGCGCG CCTGGGCGGA AAAGGGAGAG CCAAAGCGTA CGGAAGCGCT CCTCGCTGAG CTCATTGGTC ATTCAGAAAA AGACCGAGCT GGCAACCTGT TCCCTACATC CGAATCCTAC ACGCTGGTCA TTCGTGCGTG GCTCGTTTTG GCGAATAGGG GTGATAAATC AGGCTTTGAA ACAGCTGCTT ATTGGTTTTA TTGCTTGGAA GCACGCGAGA GAGACGAGAG CGGATTGGTG GCTCCTAGCG AATTTTATAC TTTGTTATTG GCTGCCGGTC GAAAGTGTGC CTCTCAGCAC CCTGACATTC TCGAAACTGC TGTAAAGATC TTTGATCTGT TACGAGAATC TCACCATCGT GTCGACTGTT TACACTACTC GAGTTTGCTA CAGATAGGAC TACTAGCCCT TTCGCGAGCA GAACAAAACA AAGTACGACA GGCGTTTATT GATGAAATTT TCAAAAATTG CTGTGAGGAC GGTCTCGTCA GTAGCCATTT TCTACAGGCT CTCGCGAACG GCCCCGTCTA CTACGATGGT TGGACGGTTG AGGAAAGCCA GCGCACTCTA AAGCGTATCA TTCCCTGTTG GCCTCTTCCA TATACATGGA CGAGAAATAT TAGACAAAAA GGCTTCTTCC CGCAGCGACA AGGATTGAGA AGAAGTAACT TTGTTTGCTC ACCGCACGGA AAGGACCCAT ACAAGACCTA A
|
Protein sequence | MMCSFGIASA SIPKPALSPW TTTSKKFLTQ PAPWNSGCVS TKFPSKLHRP FIQFQRYHIA ACYKQHICRQ RLTCFQRSLI FTNCGGFKYW HLPGFLDDTS IEEKEDWLQN LLQDKEDLSD TEAYLVVLRS LATSTQPDAP MKAERWLRRL EARSVSNPDA PQPTTECYQR VIEAWGQATN EDPNLLITRT QRWLMKHLRN DNIDLRPDTA CFNSFLDLCS KGRALKRAKA TDGNLVRDHA LKAEQTLRLM IFKRRKEGED SSMAPNVDSF NFAIRAWTRC RRSPDIADRS ISVLHLLENY EKTLDSSVRP NVKSYAMVMD SIAVVARLKV KRCQSMPKTV ENPSTNGLNE INLLQEVVSY MRNQASLGKH HLAPNGVIFN TLISCWSSLA KIHSHAPNES EKILQSMIRM KDMGENHTAP DATSYLMVMR TWLNSQQSIR AERISWWLSK QWKDYDFEGD EGLRPNTTTY NLVMRAWAEK GEPKRTEALL AELIGHSEKD RAGNLFPTSE SYTLVIRAWL VLANRGDKSG FETAAYWFYC LEARERDESG LVAPSEFYTL LLAAGRKCAS QHPDILETAV KIFDLLRESH HRVDCLHYSS LLQIGLLALS RAEQNKVRQA FIDEIFKNCC EDGLVSSHFL QALANGPVYY DGWTVEESQR TLKRIIPCWP LPYTWTRNIR QKGFFPQRQG LRRSNFVCSP HGKDPYKT
|
| |