Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42571 |
Symbol | |
ID | 7195954 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 466297 |
End bp | 470958 |
Gene Length | 4662 bp |
Protein Length | 1319 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176588 |
Protein GI | 219109668 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGGAGTCAAG GTCCTTCTTT TTTGGCCGTA CCGAACGATA CGTTCGACAA TGTCGAGTAT TGCGGGGGCT GCGGCAAGTA AGACCCTTTG GGGCAAGATA AAGTCGGTAC CGATCCAACA CCCCTTCGCC TTTGGTTTCT TCCTTTCCGG CTTTAAGACT TCCTTTTCGG ATTTGCTCGT TCAGAAGGTA GTGGAACAAC GAGAAACGAT TGACTGGAAG CGCAACGCCG CTTTCGCGTC CTTTGGATTC TTCTACCTAG GAGGAGTGCA ATATGCTATT TATGTACCAC TCTTTAGTCG CATGTTCCCG GGAGCAGCGG GGTTCGCCGC CAAGTCGATT CGGGACAAAC TCAAGGACGC CAAGGGAATG TTTCAATGCG GAGCCCAAGT CGTCCTGGAT CAGTGCGTGC ACCATCCGCT CATGTATTTT CCCGTCTTTT ACTGCACACG AGAATTGGTG GTACACGACA AACCCGACTT GAAGCGTTGT CTGAACGAGT ATCGGGGTAA TATGAAAGAA GATTTGGTGG CTCTCTGGAA GGTGTGGGTG CCGTCGACCA TTATAAATTT TGCCTTTATG CCCATGTGGG CGCGAATTCC CTTCGTGGCA GCTACGTCTT TGCTATGGAC GAGCATATTA TCCGCCATGC GGGGTGGAGA CGTACGTTCG TTCACCTTTT TCCCCTGGGA AACGGGGCTT GCCTGGTTCG TTCGTCCGTT TTTGCTCACG AGCATTCTTT CTTTTGTTTG GTGGCAACTT TTTTTAGGTC TCTCACGGCG AGGAAATGGC GGGAGGTGCG GTAACGGGAG CTACGTTGAC CATGGTAGAA GCTGGCTTTG AGTTGTTGTT TACGTCGAAC GTCGAGCTGG ATCGGGATAG CAACCATCTC GTAATAACAG CGTCGGGACC CGATCGCGTG GGCTGGATTG CGACACTGAG TCGAGCGATT GCTGATCAAG GTGGGAATAT TACGCATTCA AAGCAAGTGC GGTTGGGGTC CAGTTTCATC TGCGTCTTGC ACACGGCAGT GGACCCCGAA CTGCAACATG CGTTGATGAA ACGATTAGAA AAGATTCCCG AGCTAGAGGG ACTTTCCTTG CAGTGCAATA TGTTGACCCG ACGAGCTACG GGGAGCTTCG ATCCGCCCGT CATGGGAGTG CGTTTGCATT GTGCGGGAGA AGACAAGTAA GTGCTTTTGG GTGCAACTGG ATTCTTCGCT GATGCCCTAG TCATTGTTGT TGGTGCGTAA CACTGACGTT GCTTTATTGT GTTATTATTA TTGCTGGGAT CATGTTCAGA CCAGGAATGT TGGCGTCGAT CACCGAAAGT CTTGCCAACC ACGGCTTGAG TTTGGAAAAT GTAACCACCA GCGTCCGACA CAACAAAAAA AGCGGCTGCG ACTTTGTGGT GGACGCGGAC TGCACGTTGA CTCGTCATTT GGACCAGGAC CAAATCAAAG CCATGGTGGA CGATCTCAAC CACTTGAAAC AAGAACTGGA CCTGAGTACG GTCGATATTC GCGTGCAACG CTTGGCCGCG GAACGACGCG GTACCATCCG GACCAGCGAG TGATCTTCGT CCCGTGTGGC AGTCGCACGC TGGCGGGTCC GTGATCGTGA TCGTTTTCCC ATTTTTGTAA ACCTCTCCTT TTTACGTTTA TCCTAATCTT TACTTGGAAT AGTTCCGTAT GGACCACGTT GGGATCGTGA GTTTTCTAGA GTGTCAAGAT AGTAATGTTT TCTGAGTGTA AATATAGGGT AGTATGGGTC GGCTGGTGGT TGGGCATTGG TAGATGATGG CTGGGCTTCG GTGCCAGTTC CGCCCTTATG GTATTGTCCG AATTGACATA CCATAGACAC AGGATTAGAA TACGATTGGT TCCCAATGGT CGAGACGTTT CCGACTGCGC CCAAAGCAAA AATCGCACGC GGTTTTAAAT TCCATGGCGG GAACGCGCCC ACCCGTCCCT GGTTCCCGGC ACGCCGGACC GCTCGCGCCG GAATCCGCGT CCCACGACCA CGCAACGTCC GCACCGTGTG GTACGAGCTG TACACTGGTT CCATCCGACC CGACCCGTGG CGCACCAACA CTTTCGGGGT CGCCATGCCT GGCATTGTGT AGTCGTTACT ATCTCTCTCT CTCTCTCTTT GTGTCTATCT GTTGCGCAAC CAAACAGTGT GGCATGGGTC GAGACGTGAC GGTGTCGTTC GCGTCGATTC GCGTCCAAGT CCACCGCGGG GCGTTGCGGG AAACCAAACC GCTCCTGACG GACACCGATG CCGTCGCGAC GCTGACGGGC CAGACGCTGT TGATCGGCGC CAGAGGCAGC TGGTACATGA AAGCCAAAGA TCTGACGGCT CTGGACGTAT CGGATCGTGT GATTACCGTG TCGTGTGCTT CCAAAACTGT GGAATTGCGG GACGCCCGCA ACGAACGGGA CTTTCGGATG TTTCGGCAGA AACTGCAACA CTGGTACGAA ACGCAACGTT CCGAAACTGC CTTTGGTGAT TTCTTGCACA CCAGTGCCAA AGCAGCTTTG GGTAGTCCGC GGAGTCGCGT TTCCACGTCC CGATCGACCC AACGCCGGCG GATTCGGGGA ACGTACGGAT CGCAAGCGAC TCGAAGTGTA TCCACTGCCG TGGCTCCGAC CAATCGATCC TGGTTGCCCG AAGTCTTTTC GGAAGATGAG GGAGAAGACA CCCACCTGCT GACAGATAAC GCAGTGGAAA CACCCCGCGA GAAAGCGGTG GAGAAAGCGG ACGACGACGC ACTCTTGCCG GCGGTTTCCA ATGGTAACTC GGTTGGCAAG AAACGCCCCC GCCTACAGAA ACTCAAAAAG GCTCTAGACG ATACCCAGAC GGCGCTAGAA GACGACTCGG ACGACGATGC TTTATTCGAC GACGTCCATC CATGGACTAC ACCCGCCACA CAGCACATTG TGTCGCCGGG AGGAGCGCTC ACGGAACGCA AGTCGCCCGG AAGGAAGCAA AAGAGACTGT CCAGTTTTTT TCCACGCAAA CCTACCCAGA AATCTTTCGA TCCCGCGACT GCGGTCACCA CCCCACCACG CCCGGTACCG CGGACCCCGA CACGACTGTC TTCGCCTGCC GCCACCCGTC TCGTCAAGTC GGCACGCAAA TCCCTGACCC ACGACGCGGC CTGGCTCGAA CGCTCGCCTG CCGCCAAATC GCCGCACCAA AGCAGCGCGG AACGACTGTT TGGGAAACAC AATTTCTTCC ACTCGTCTCA TCGGAATATT AAAAGTGAAC CCGAACCAGA AGACCCCATT CAAGAATTTG GTGATCCCAC GTCCGCGACA CCAACATTCC AGCTCAAATT GAAGCCGCCT TTGTTCTCTC CGTGTGATAC TAGTACACCC ACCAAGAATT TGTTTCCCGA GGTAGATTGT TCGCCTTCGT TGCAACCCGA AGAGACCAGT AAGCTGACCC CACCTCCACT CATTCCTCGT TATCCATGCC GGGGCTTACG GAACCTCGGC AATACGTGCT ACCTGAATTC GTCGGTCCAA ATGCTCTGTA CCGTGCCGGA TTTTACCTCG CGACTAGACC AAATAAATGA CAACGCTCCA CTCGCGACGA GCCTCGTTCA AGTCGCTCAC GAGCTGAGAG ATACCAATGC TCCTCTGTCC GTAAGACCAC GAGCAATCAA AGACGCTATG GACGAGAAGA CGCACAAATA TCAAGGATTC GAGCAACGTG ACGCCCACGA ATTTCTTAGT GATTTAATCG ATCACGTCCA CGAAGAGTTG ACCGAGAAGA GCAAAGCGGA ACCGTCCAAA GAATCGGAGC AAACTCCGCC AACCGACGAC TTTCGTTTGG TCGTGCGGGT GTGCCTCAAG TGTACTTCGT GTGGGTATTC TCGGTAAGCA TTTCAATCGG AATTGTCTTT TTTTTTGTAA CGCAACAGGC TGCTGACGTA TGCTTCACTC CGTCTCCTGA TATTGTAGGA ACAAGGACGA AATTTATCGA CACCTGTCTA TTGATGTCGT GGGCGATGCC ACCTCGGAAG AAGTATCGGA TGTTTCGCAA GCTTCAGTGG AGCAGGGACT GGCCCGGTTC TTTCAACCGG AGACGCGCGA AATTTTGTGT GAAAAGTGCA AACGCGGTAC TCACGCTTTC CAGACACTAC GGATTGTTCA AAAGCCCAAA GCGCTTTTGC TGCATCTCAA ACGGTTTCTA GTGGTGGAAA AGCCGCGACC GTTATCACCC AACCATGCAG AGGCCACTAC TGACGAAAAC AGCCCACCTA ACAGTCAAAG TAGCTCTCCT ACAAAGACAG CGCCCCCGCC GCCCGAGTAC GTTTTCCGAA AGAACAAGGC ACCCGTTTTG ATCCCTGCGA CTCTGTCGTT GGACTCCTAC CAAACAAAGG AAACTGTGCA AGACGGCAAC CAGGTTGCCT CTACTTTTTC GCTGCAAAGT GTAGTGCATC ATGTTGGCAA TCGGTCGTCG TCGGGACACT ATACAGCCGA TGCGCTGAGG CTGGTGAACA AGAACGGCTC AGAAACGGAG AGTGCTAAGA CAATGCAGTG GATTAGTTTT GATGACGGCT GCTCCGGTAG AACGAGTTTG GAAGATGTTA TCCTAGACCC GGTCAAGCAA GCAACCGCTT ACATTCTACT CTATTCTTCA ACACCAATCT AA
|
Protein sequence | MSSIAGAAAS KTLWGKIKSV PIQHPFAFGF FLSGFKTSFS DLLVQKVVEQ RETIDWKRNA AFASFGFFYL GGVQYAIYVP LFSRMFPGAA GFAAKSIRDK LKDAKGMFQC GAQVVLDQCV HHPLMYFPVF YCTRELVVHD KPDLKRCLNE YRGNMKEDLV ALWKVWVPST IINFAFMPMW ARIPFVAATS LLWTSILSAM RGGDVSHGEE MAGGAVTGAT LTMVEAGFEL LFTSNVELDR DSNHLVITAS GPDRVGWIAT LSRAIADQGG NITHSKQVRL GSSFICVLHT AVDPELQHAL MKRLEKIPEL EGLSLQCNML TRRATGSFDP PVMGVRLHCA GEDKPGMLAS ITESLANHGL SLENVTTSVR HNKKSGCDFV VDADCTLTRH LDQDQIKAMV DDLNHLKQEL DLSTVDIRVQ RLAAERRVPY GPRWDRLEYD WFPMVETFPT APKAKIARGF KFHGGNAPTR PWFPARRTAR AGIRVPRPRN VRTVWYELYT GSIRPDPWRT NTFGVAMPGI CGMGRDVTVS FASIRVQVHR GALRETKPLL TDTDAVATLT GQTLLIGARG SWYMKAKDLT ALDVSDRVIT VSCASKTVEL RDARNERDFR MFRQKLQHWY ETQRSETAFG DFLHTSAKAA LGSPRSRVST SRSTQRRRIR GTYGSQATRS VSTAVAPTNR SWLPEVFSED EGEDTHLLTD NAVETPREKA VEKADDDALL PAVSNGNSVG KKRPRLQKLK KALDDTQTAL EDDSDDDALF DDVHPWTTPA TQHIVSPGGA LTERKSPGRK QKRLSSFFPR KPTQKSFDPA TAVTTPPRPV PRTPTRLSSP AATRLVKSAR KSLTHDAAWL ERSPAAKSPH QSSAERLFGK HNFFHSSHRN IKSEPEPEDP IQEFGDPTSA TPTFQLKLKP PLFSPCDTST PTKNLFPEVD CSPSLQPEET SKLTPPPLIP RYPCRGLRNL GNTCYLNSSV QMLCTVPDFT SRLDQINDNA PLATSLVQVA HELRDTNAPL SVRPRAIKDA MDEKTHKYQG FEQRDAHEFL SDLIDHVHEE LTEKSKAEPS KESEQTPPTD DFRLVVRVCL KCTSCGYSRN KDEIYRHLSI DVVGDATSEE VSDVSQASVE QGLARFFQPE TREILCEKCK RGTHAFQTLR IVQKPKALLL HLKRFLVVEK PRPLSPNHAE ATTDENSPPN SQSSSPTKTA PPPPEYVFRK NKAPVLIPAT LSLDSYQTKE TVQDGNQVAS TFSLQSVVHH VGNRSSSGHY TADALRLVNK NGSETESAKT MQWISFDDGC SGRTSLEDVI LDPVKQATAY ILLYSSTPI
|
| |