Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44406 |
Symbol | |
ID | 7198061 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 449850 |
End bp | 455748 |
Gene Length | 5899 bp |
Protein Length | 1782 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178513 |
Protein GI | 219115435 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0950443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAT CCAAGCGAGA CACCGAAATC GCAAGGTCGA CGGAACAGCC CACGGATATT CAGAAGTCCC TGCCGGTAAC GGCATTGCCC GTCGGAGACA AACAAGTGCT GGATACCGAA GGCGAAGAAG GACTCAAGAA TAAGGAAGGT TTGAAATGTA TAGTGGATGC CACTGGTATT CAGGCGTGCG AGAAAGCACT AGAAGATTCC AGCGAAGGAA GGGCGACACA AAAGGAATCC ATTTCCGACA AACAACAAAA AATGGAACCT CCTGGAGAAT CAGTCTCTAC ATTCAACGAA CAGGAAACAT CTCAGCTTGA AGACCGAAAG TTCCCAGCTG CTCCGCCGAA TTCGTTGGAA GACCAATTGT CCGTGAACGT GCTCCCTGAC GAAGCTGCTG CTGAAACAAC TGTAAAGGAA TATGTATGTG TTGGGCCCTC TTCTGAACAT GCCGACATAG GGCCTGTTAT TGACAGATGC GGCGATCGGG AAGACGTAGC AACTAACAAC TCAGCTCCCA CGAAGGCCGC TCCTCTTATT GACCAGGTTT TGATGGCCGA CCTCTCCACT GAAAAAGAAT TTTCACCGCC ACGGCTTTCA AAACTTTCGA AGCCGGCGAA CAAATTTGAT GCCGCCACAA TGGACCCGAC TGCCGTTTCA CTGGAAGATG CCTTTGATTC TTATGCGTCG GTACGGAAAC GAAAATCGAA GAAAAGAGAA ATCCAGATAA AGATCTCATC AATGGCGACA GTGAAAGACG AAAGGATGTC TTTTGGTGAA ATTGCCCCTG CTCGCAGGGG TTCTGCCCAC CTTGTCTTTA GGACTGAGTT GCGCCCAGTA TGTGAGGTAG GAAGTGAGTT TTACGGAATT TCTGGTCTTG CCGATAAGTT TGAAGTAGAT ATGAAAGACT CGCTATCATT TTTTGAAGAG ACGCATGAGG AGCAAGATGC GGCCTTCAAG GAGTTTCAGC TTAAACGTGG ATTGGAAGAG ATGCAACGAA AACTTCGCGA AGCCGAGAAC GAGGACAAAA AAGGACAAAA GGAGATAGCA GAGTTGATTG GGGCCATGGC AAAAGATAAA CAACTTTCGG CTGAACGCAG TTTTGAAAAC TACAAGGCCA AATACACTGA CGATGAGAGT CGAGACCTCC AGCGATTGCA AGCCTACTAC CAACAGAAAA CGGCCTCCAG TCAAAATAAA ATCAATCAAG GTATCAAAAT TCTTCAACGG CGTCATCAGA GAAATATGCA ACAAGCTATT CAGCAGCATC GGGAACAGGT CCAACAGCGC CACCTCCCGG AACAAATGGC TTCTGCAGAA TGGCAATCCA CATCTCAGCA GATCCAAGCA AAGCAGCAAA GGGAAATGCA AGACTTTAGT GCGAAAGGTG AAGAAATTAA GGCCAAAACC GATACAGATT GCAGGCGAGA GCAGGAGAAG ATTCGCAAGC AGTATGAACA AAAACGTCTG AATATCGAGA TCAACCGTAA AAAGCTATTT TCGAAACTCT ATACTCAATT CCAGCAGCTT CGGCAGCGCT ATCTCAAACG ACACCTGCAA ATGGTGATGC AGAGGAGAAA GAATATAGAA AAAGCTTATG CTGTTGAGCA CAAAACCATT GATACGGGAA CGAAACCATT GGGCAATGAT AGCGCCTACA CTCCATTGGA GCTGGCAAAA AGCACGTTGC AAGAAAAACC TGAACATCAA CCACCGTCTC CGATTAAATT TTCTCACCGT TGGACTTCCC ATGAAAAGTG TCATTCTGCT GGAGGTGGCG CACGGCACAA GCACAGAAAG GGAGTCATGA GTCAGGCCAA TAGACAAATA TCCATCGAAA TACACAATGA AGGGATTTGG ATTTCCAGTC TCTCGAAGAG CTGTCAGCCA GAGGAAGGGG CAGAGAATGC CAGCCAGGAT GAGCTACTTC CATGGGGTAC CAACGCCTAT AGTGTCCTCG AATCCATTAT ATGCGGCGAG GTTCCACATT ATTTAGACAC CTTTGATTTT GGAGAAACAG CTACAGCTCA AGGCGGTCAA CTTCGGTGCA TTCTCACAGA TCTTCGAACC AGCGATGAGA CAGCATCATC CCAGCGGTCC CTTGCTTTCA GGGAACAAGA GGACGCAAAC TTGGCAAACC TTGAAAAGCA GGTTACCGAA TTGCTCGAAA AGACCTCCGA GGCAGAAAAG GCTGTGACAC GATGCAGCGA AGAAGAGAAA CAATGTCATA CTTCGTTACA AACTGCATGT AAAGAATTGG AGAAGGCCAA GAGGACGCAG GACGACTTCA GGTCAAAATT TCGTAACTTT CTTGGCCCAG GTGTGTGATT CCCATCAGTT TTCATGCTGT TGAGCTATCG TCTCGTAACT CATGCAATAT TTGCTTTTCG TCTTGCAGAT GGCAATCCGA TTGCCACAGC CAATCCAAAT GACAGACAAG AGCTCATGAA GGCGATGCTT CGATATAAAT CAAACCTTGA GAATGCTATG AAGAATGAAA AGTCAGCCAA GCAGTCATTG GAAGACACGC GGACCAGGCT CCAGAAGCTT CAACTGTTGA TGAAAGCTGG ATACAAAAAT TCAGGCATCG CTACTAACGC ATTGAAGAAG AGGAAGACTG TTCTCGCTGC AGCGAGGGCG GGCCGAAGTA GTCGCTTCTC GAAGCAGTCA GAGGTCGCGG ATTCTGAGCG AGCAGCTGTT CGCGTATCGG ACATTTTGTC TGCTCTAAAG AGGACTGCCG AAAAGAGACG CGATCAATCG AATCAGAAAA AAAGCAGCAG CTTTAGTAGC GCCTGGATTC AATCTTTTCC TGGGCTACCT AGCTCATTGA AAAAGAGTCT TTGGCACAAG ATGCATCGCA GAAAGCATCA AATAGTGCTT CGTCCGACGC AGGAGTCAAT GATTAACGAC TTGCGTAAGT ACGTGGAAAC CAAAATAGGT GGCGAGAAAG CGAATCGAAG TCGAATCGAA CAGGAAGCTC TTGAGTCAAA GCTGTTGAAC GCAGAGCAAC TGTTCCTACT TGCGATGCAT CCTGCGTCTG ATCTGGAATT TCCTCGGGCG CCTTCATCGA AATCAAACGA GGAATGGGCT GAGCCTGGCT GGCAAATGGT TCTGACAGTG CCATCCAAAA AGAATAGCGG GATTCTACCA TGCACACCTA GTTTTTCACT TCCGGAAATT AATACTTGCG AAATGTTGTC GGCAACTGGT CGCCAGGCTT CGTCGCTTTT TAGAACTTCA CAGCTCAAAG GGTTGGTAGC ACCAATGTCT GCATTCGCGG TCGCTTCAAG TCCAGCTGAA ACGACCGCCT CCTTGGCAAA CCCAGGTGCG TGCTACAGTC CTGTCAATTG TCCTTCAAAT TTATCAGTCT TCTCATCGTT TCCTCGTTGC CGTAGTTATA TGGTCATCGG GGGATCCCTT CTTTGCTACA GATGAGGATA CGGCCCTTGG TTACGCATTC CATGCAAAAC AAGCACCAAC GAAGCCCCCC CTTCAAAGCA AAAAACCATC CACCGCAAAA GATCCATCAA AATTCGATAC GATGAAAGCT TTGTACTCGG CATCACAGAA GAGCAAAACA AGCGGACTTA AAACTGTAAA TCCAATTACC GGTACAAAAC GGAGCTCGGT TGATAGTGCA AAGCTTGATA AGTTTCTGCC AGTGACCACA AAGACAAGCT CAGAAGCTAC ACCACACAAA AGAAAGACGA GCGATGTATC TCAAAGTCAA AGTCGAAAAT TAGGACGGAA ATTAGATCTG ATCTCGGAAG GACAGACTGC TGTTGCAGCA GGGAACACAG CAACACAGTA CAACCAGTAC ATGAAACAGC AAATACAGCA ACCGCAGCAC TCCCAGCCAC CAATTCAACA GCAGCAGTAT TCTGATCCAG CATCCCAGGC TATAAAAAGA CAGCAAAGTT TGACCAAGCC ACACCAAAGT CAACCATCAC AACAAGTACA GACTCAACAG CACCACCACC AACAACAAAT TCGGCACCCT GCACAGCAGC AGCAGCTCCA GGGTACACCG CAAGCAAATC AATATTCATC TCCACAACAA CGTCAGCTTG CACAGCTTCA GATGATGCAA CAACAGCAGC ATCATCAACA ACAACAACAA TCAACACCAC AGCAACATCA CCAACAACCA ACTCAGCATC CTCATCAGGC GCAGTCACCG CTTCATCAAA TCACCCAACA GCCATCTGTG AAAGCAAGTT TTAGTGCTTC TCAAGCAATG CAGCCCATGC AACAGTTCTA CGCTCCGCTG ACCCGTCCTA CTGCTCCAAG TGGACAACAT CCACAGTATC AATCGCCAGC TCCAATTCAG GGACGGGTTA TGCCAAATAG CAATTCGTTC AGCGGCAATA TCGGCCCCCA CGACGGCCAG AATCCGGCGA ACGGTCCACC ATAACTCTTT AGTGTAAAGT AGCGATTGTA CGGATGTTGG GCGGTACTAC ACTCACCTGA GCTTCTTGCG TCAAGGATTT TGGGACGTGA AATAAATATA AGCTCTTCCA TACCACCTTT TACAGGGTGA ATCAACTCGA CAGGGTATTT CACCCTACCC TTACTTTAAT TGTAAGTCGC TAAGTATCTA CTGTCGCATA ATTCACAGTC AAATGTACTG TCGCCGCCAA CTCGTTGGCA TTTGCGCTTC GGTGAAATTG ATATGGGAAG CTCATCGCAA CCAAGCTTTC TGTCAACACG AGAACAGAAT ATACTACACA AGCCTTATCG AATACCCAAG AGCGCAGTCT CTTTATTTCA CACGCTGCCA ATGACAGCTC TCAGCAAATC ATCGTCTTTT CGAAGCGTCG CGCCCTGCCG CGTCTGCTTG TTTGGTGAGC ATCAGGATTA CCTCAATCTT CCTGTGATTG CTTTCGCGGG CCCAATTCAC TGCTGCATCC ATGTTTCACC CTTGGTCGAT AGAGAAGATC ACGGAGATAC CACGTTGAAA CTCCATGTAC CTTCTCTTAA TAAAACAGCA ATCTATAATC TGCGGAGTCT TCCCCCTCGG CAAGGTCCTG ACACCTCGAA TCCCGACTTT GCTTTGGCAG CGATACATGA AGCTATCGAC GATGGCTGGA TACTAAGTTC AGCCGAAGCG ATATCAACGA CCGATATTCC GATGCAAGCC GGCTGTAGTA GCTCAACCGC CTTTGTCGTT GCATGGGTCC AGGTACTTGC AACCCTTGCG GGTGAAGTTT TGACACCAGT AGAAGTTGCC CAGCGAGCAC ACCAAGCTGA GGTGACACAT TTTGGAGCGC CGGGAGGGAC CATGGATCAT GTCACCATAA GTTTAGGTGG CCTTCTCCGG ATCGGTCCAG CTCCGTGGGA CTACGAGATC ATTCCCAATC CTGAAAACGG CGTTTGGATC TTGTCGTACT CAGGAGAACC AAAGGAGACT TTAAAACATC TGAAGCGTTG CAAATCGGAG CGTCTAGCCG TGTTCGAAAA GCTACGTCAC GATTGGGATA GTAGCGCAGT ACAAGATCTT GACAACAATG AGCTAACACT TCTGCAAGCC ACTCGTACAA ACCGAAATAC AGAGAGAGAA GCCTTTCAAC TCTGGAGATC TACAGCAGCT AGCGGAACAA ACGGAAAGGC CCTTGGCTTT CTCATGTCCA CACACCACGG AGCTTTGCGC GATGGTCTTG GTCTAAGTAC CGTTTCTTTA GAGACTATGA ATAGGGCTGC TTTGGATGCG GGTGCGTGGG GTTTCAAGTT GTCGGGGTCA GGGGGTGGAG GTTGTGGGGT GGCTTGGGCG TCTGTTGACA AGGCTCATAA TGTACGACGT GCTTTGGAAC TATGCGGAGC TGGACCTACA TGGATAATAC ATGAGCCATC AAAAGGAGCT TCAGTTGAAT TTGATATGGA TAAAAAAAAA GCATCATGA
|
Protein sequence | MEESKRDTEI ARSTEQPTDI QKSLPVTALP VGDKQVLDTE GEEGLKNKEG LKCIVDATGI QACEKALEDS SEGRATQKES ISDKQQKMEP PGESVSTFNE QETSQLEDRK FPAAPPNSLE DQLSVNVLPD EAAAETTVLM ADLSTEKEFS PPRLSKLSKP ANKFDAATMD PTAVSLEDAF DSYASVRKRK SKKREIQIKI SSMATVKDER MSFGEIAPAR RGSAHLVFRT ELRPVCEVGS EFYGISGLAD KFEVDMKDSL SFFEETHEEQ DAAFKEFQLK RGLEEMQRKL REAENEDKKG QKEIAELIGA MAKDKQLSAE RSFENYKAKY TDDESRDLQR LQAYYQQKTA SSQNKINQGI KILQRRHQRN MQQAIQQHRE QVQQRHLPEQ MASAEWQSTS QQIQAKQQRE MQDFSAKGEE IKAKTDTDCR REQEKIRKQY EQKRLNIEIN RKKLFSKLYT QFQQLRQRYL KRHLQMVMQR RKNIEKAYAV EHKTIDTGTK PLGNDSAYTP LELAKSTLQE KPEHQPPSPI KFSHRWTSHE KCHSAGGGAR HKHRKGVMSQ ANRQISIEIH NEGIWISSLS KSCQPEEGAE NASQDELLPW GTNAYSVLES IICGEVPHYL DTFDFGETAT AQGGQLRCIL TDLRTSDETA SSQRSLAFRE QEDANLANLE KQVTELLEKT SEAEKAVTRC SEEEKQCHTS LQTACKELEK AKRTQDDFRS KFRNFLGPDG NPIATANPND RQELMKAMLR YKSNLENAMK NEKSAKQSLE DTRTRLQKLQ LLMKAGYKNS GIATNALKKR KTVLAAARAG RSSRFSKQSE VADSERAAVR VSDILSALKR TAEKRRDQSN QKKSSSFSSA WIQSFPGLPS SLKKSLWHKM HRRKHQIVLR PTQESMINDL RKYVETKIGG EKANRSRIEQ EALESKLLNA EQLFLLAMHP ASDLEFPRAP SSKSNEEWAE PGWQMVLTVP SKKNSGILPC TPSFSLPEIN TCEMLSATGR QASSLFRTSQ LKGLVAPMSA FAVASSPAET TASLANPVIW SSGDPFFATD EDTALGYAFH AKQAPTKPPL QSKKPSTAKD PSKFDTMKAL YSASQKSKTS GLKTVNPITG TKRSSVDSAK LDKFLPVTTK TSSEATPHKR KTSDVSQSQS RKLGRKLDLI SEGQTAVAAG NTATQYNQYM KQQIQQPQHS QPPIQQQQYS DPASQAIKRQ QSLTKPHQSQ PSQQVQTQQH HHQQQIRHPA QQQQLQGTPQ ANQYSSPQQR QLAQLQMMQQ QQHHQQQQQS TPQQHHQQPT QHPHQAQSPL HQITQQPSVK ASFSASQAMQ PMQQFYAPLT RPTAPSGQHP QVNQLDRVFH PTLTLISNVL SPPTRWHLRF GEIDMGSSSQ PSFLSTREQN ILHKPYRIPK SAVSLFHTLP MTALSKSSSF RSVAPCRVCL FGEHQDYLNL PVIAFAGPIH CCIHVSPLVD REDHGDTTLK LHVPSLNKTA IYNLRSLPPR QGPDTSNPDF ALAAIHEAID DGWILSSAEA ISTTDIPMQA GCSSSTAFVV AWVQVLATLA GEVLTPVEVA QRAHQAEVTH FGAPGGTMDH VTISLGGLLR IGPAPWDYEI IPNPENGVWI LSYSGEPKET LKHLKRCKSE RLAVFEKLRH DWDSSAVQDL DNNELTLLQA TRTNRNTERE AFQLWRSTAA SGTNGKALGF LMSTHHGALR DGLGLSTVSL ETMNRAALDA GAWGFKLSGS GGGGCGVAWA SVDKAHNVRR ALELCGAGPT WIIHEPSKGA SVEFDMDKKK AS
|
| |