Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43802 |
Symbol | |
ID | 7203944 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 47141 |
End bp | 50787 |
Gene Length | 3647 bp |
Protein Length | 1189 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186268 |
Protein GI | 219113369 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.327125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGCA ACCCCAACAA AACCTCCGCC AAGGAACGTG AGATTCTCGA TCGCCAAAGA CAACTAGCGG CAAAACTCAA GCCGCCAAAA CCGCCAGCTT CGTCCATTGG TAGTAGCACT TCTCCACCCG TGCCCCCTTG CGCAAAGGGT GAGAGCCACC GCACGTCGGC TCCTCCCCCG AACGTGATTG ATTTGACTGA ATCCACCGCG TCCCATCAAT TCGAATCCCG TTCCAATTCA TCACGCAATT TTACATCCCA ATCACGTCCC CTTCCGGCCA AGCGGAAGCG TGTAGAGCCC TCCAGGAAAG CCTCCTCAAC GAGACCGGCA ACCGACACCG GCTCTCAACA AGCCCCACCA ACCAAGCCTT CATCCTCCTC TAATCACCCA GTGACTGTGC AAAAGCGCCC CACACTCAAA CGCAAAGGCA ACATTCCTCC CGCGGCAGCG TTGATGGCCG CGGCAAGGGT CAAAGCTGGA GCGAACGATC CCAAACAGTC TGTGCACGCT ACGACCACCG GAAGCAAATC GTCTCCACGG CTTCAGCGCA AAACGGTATC CACTCGCAAC GCCAATGCGG CGAGTGGGTC CAAAACAGTC ACCAGCGGTA GTCTAGCACA ACTGGTACAA AACGTATCTT CAACGCCGCT GGATGCTGCG AACCTCAGCG GTGCCGCAAG TGGTGTTAAC GCAGTCCACG CCGACGACTT CTGGAAACAT TTGCGCGAAT GGGATTTCGT ATCCCAGTAT GCGTCTTACC AGAGATCCCA ACGGCAGCAA CTAGACACAA ACGATCAGAG TACCACAATG CAGAAAAAGC CGCTTCCCAA CGTCTTTTTG AACGCGCGTC ACTACATGGC AGCCTGGGCT CCACTCTGCC TGGCCGAATG CCGGGCCCAG CTACTGCAGG AGGCTGGACT CAACGCATCC GCACCTCTCG CTGTGCAGGT CCAAACCTCC ACCAACGGAC CCCGAAGGTT TCGGGGGACT GGTGATATGT TTAACGCCTC CAGTGGCTGG GATGAACACG ATACCGGAGG ATACGTGACG ATCCAACCGC AACAAAGGGG GACCGGGCGG GGTATGAAAT TCTTCCCACA CGATCTCGTC TTGTTACTGA TTCCTCCATA CGAACATATC TTGCGAGACT TGTCCCAAAG CCGCAAGACA CCACCAGCAC CACCTTTGGG ACAAGATCCC AACGATCCTG CTGCCTACAA AGATGTCGGA CTGATTGGTC ACGTTGAAAT GAGTCGGGGC GAAGTGGCTG GTTTAACTCT GAAAATTTCC AAGCGGTTGT GGGCCAAGCT GAGTACCAGG AACGGTTCCG CGCCGAGATC ATCCAATGCA TCTCCCACCA CCACCAACAT GTTTCTCGTC AAAATTGGGA GCAACGTTAC AGCGCTGCGT GAATTCACGG CGCTCTGTCA AGTTGACACG CTCCCGGTGC AACGGTACCT TTTGGCCGAA CATCTCGCCA ACGCGCAGAA TCGTCGCAAA TTGAGCCGGA ACCAAACAAC GGAACAGCTC CTGGAACGAA TGGGCGGCGC CAATGCGCTG GGCAAGGGCT TTTTGGACTA CGCCGAACAC AAGTTTAACG CATCGCAGCT TACAGCCATT GCTGCGTCGG CACACGAATA CGGAGAAGGT GGATTTACTC TCATCAAAGG ACCGCCAGGA ACTGGAAGTA AGTAGCCCGA AGGACGTATT CGTAGCAGCA CGCACTGTCT CCAGCTGACC TCCTCTTTTC TCGCTGTCGA GCAGAAACAA CCACGCTCGT GGCTGTTTTG AACTCCTTAC ATATTCGTCA GTACAACAAA TACTACGAAT CGGTCCGACG TATTGCGACG CAACCCACCG GCACGCGTCA GGCTGCTTTG GACATGGCCC GTCGCGCCAA ACCTCGATTG CTCGTTTGTG CTCCATCGAA TGCAGCCGTG GATAACATAA TATTGAAAAT TATGGAGGAT GGCTTTGTGG ATGGACGGGG TCAACGGTAC AATCCAAGCA TGATTCGTGT TGGCGTGGGT AAGGGTACTG CAGTAAAACC TGTCGCTCTG GAAACCAAAG TAGACGCTAT TCTGGCGGAG AATATGGACG CTGGCCGGCT GGAAACCTCG ATCGCGGGCT ATCGGATGGA ATTGACCAGA ATTTCGCAGG ACATTGCCCG ACTGCGACGT CGAGTGCACG CCATGACGAA CGCCAGTGCG TGGCCGCTTT CCAAGGATTG GGAAATCCGT ATCGATGAAG ATACCTTTGA CGAAACGGGA AAGGTGTATT TTGTTAATCA CCGCGCCCAC TTAACCACGT ACGAAGCTCC TCCGCCACCA GAACCGGGAG AGACGCACTT CCCAGCTACG GCAATGCCTG AGTATCGAGC ATTTATGAGT CGGATTGTGA AGCTTGTGGA GAACTACTTT TCGGTAAAAG CGGAATTAGA ACGATGCACA ATAGTCAAGG GATCGATGGA TAATGGTACC AATCATATTG AAGTTCGTCA AAACATGGAA ACACACGTCC TAAATTCTGT ACACATGGTG ATGACAACTT TAGGGACGGC TGGCAACCGT GTCATGGAAG CCGCCGACAA GTTTGAAGTT GTGGTCGTCG ATGAAGCTGC GCAAAGCGTG GAACCGGCAA CTCTATCTGC GTTCCAATTG GGATCGAGAC ATGCTGTGCT AGTTGGCGAC CCCCAACAGC TTCCAGCGAC CGTCTTTAAC ATTTCGGGAC GCCTTTCTAA ATACGATCGA TCCCTGTTTC AGCGTTTGGA AGAAGCTGGG CAACCCGTGT ACATGTTGAA CGAGCAATAC CGAATGCACC CCAGCATTTC TCACTTTCCT CGCCATATTT TTTATGGCGG CACTCTTTTG GATGGGCCAA ATGTACGAAA ATCAGATTAT GGCAACCCAC TGCTTGGTAT GGTCACTCGG ACTCTTCCAA GCTTCTCTCC CTTAATGATT CTCGACCTCG ATTCTAAGGA AGAACGTGGC GGCACAAGTT TGTCCAACTC TGGAGAAGCT CAGCTGGCCG TCTACTTGTA CATGCGATTG AAAGGAATAA GTCGAGGGTT GTCGGCCGAA ACCAAAGTTG CTGTTATTAC TCCCTATGCT CAACAAGCTC GTATGCTTCG CGAGTATTTC GGGGATGCTT TAGGGCCGAA CTACGAGAAA TTCGTGGAGG TGAATACGGT CGATGCCTTT CAGGGGCGAG AGGCCAACAT TGTAATCTTT TCGGCAGTCC GTGCGGCGGG TAGTCACGGC ATTGGCTTCC TTTCCGACGT GCGTCGAATG AATGTCGCTC TGACTCGCGC AAAGCATTTC TTATTTGTGA TTGCACGCTG CGATTCGATT GTGGTAAATC CATACTGGAG CGATTTGGTT ACTCACGCCC GGAAAACTCA CGCTGTGCTG AAGGTTCCGA TTTTTGGGGG CGGTCGGGCG CTGTCCTTTG GAGAGCTCAA CGAATGGCAG AAGGAAACTC CGAAAATTAT AGACAATGCT CCGACTGGAC TGACAGCGAC CGAGCCTCGT GAGAGCAAGC CGATCCCGCC TCCACCCTCT CGACCTCCAG ATCCTCGCAA AGCTCCGAAG GCACCGCCAC CACCTGCTAC ACCGGCAGCG AACAGAGTGG ATCCTCGAAA ACGCTAG
|
Protein sequence | MSSNPNKTSA KEREILDRQR QLAAKLKPPK PPASSIGSST SPPVPPCAKG ESHRTSAPPP NVIDLTESTA SHQFESRSNS SRNFTSQSRP LPAKRKRVEP SRKASSTRPA TDTGSQQAPP TKPSSSSNHP VTVQKRPTLK RKGNIPPAAA LMAAARVKAG ANDPKQSVHA TTTGSKSSPR LQRKTVSTRN ANAASGSKTV TSGSLAQLVQ NVSSTPLDAA NLSGAASGVN AVHADDFWKH LREWDFVSQY ASYQRSQRQQ LDTNDQSTTM QKKPLPNVFL NARHYMAAWA PLCLAECRAQ LLQEAGLNAS APLAVQVQTS TNGPRRFRGT GDMFNASSGW DEHDTGGYVT IQPQQRGTGR GMKFFPHDLV LLLIPPYEHI LRDLSQSRKT PPAPPLGQDP NDPAAYKDVG LIGHVEMSRG EVAGLTLKIS KRLWAKLSTR NGSAPRSSNA SPTTTNMFLV KIGSNVTALR EFTALCQVDT LPVQRYLLAE HLANAQNRRK LSRNQTTEQL LERMGGANAL GKGFLDYAEH KFNASQLTAI AASAHEYGEG GFTLIKGPPG TGKTTTLVAV LNSLHIRQYN KYYESVRRIA TQPTGTRQAA LDMARRAKPR LLVCAPSNAA VDNIILKIME DGFVDGRGQR YNPSMIRVGV GKGTAVKPVA LETKVDAILA ENMDAGRLET SIAGYRMELT RISQDIARLR RRVHAMTNAS AWPLSKDWEI RIDEDTFDET GKVYFVNHRA HLTTYEAPPP PEPGETHFPA TAMPEYRAFM SRIVKLVENY FSVKAELERC TIVKGSMDNG TNHIEVRQNM ETHVLNSVHM VMTTLGTAGN RVMEAADKFE VVVVDEAAQS VEPATLSAFQ LGSRHAVLVG DPQQLPATVF NISGRLSKYD RSLFQRLEEA GQPVYMLNEQ YRMHPSISHF PRHIFYGGTL LDGPNVRKSD YGNPLLGMVT RTLPSFSPLM ILDLDSKEER GGTSLSNSGE AQLAVYLYMR LKGISRGLSA ETKVAVITPY AQQARMLREY FGDALGPNYE KFVEVNTVDA FQGREANIVI FSAVRAAGSH GIGFLSDVRR MNVALTRAKH FLFVIARCDS IVVNPYWSDL VTHARKTHAV LKVPIFGGGR ALSFGELNEW QKETPKIIDN APTGLTATEP RESKPIPPPP SRPPDPRKAP KAPPPPATPA ANRVDPRKR
|
| |