Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35741 |
Symbol | |
ID | 7201130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 643591 |
End bp | 647703 |
Gene Length | 4113 bp |
Protein Length | 1370 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180285 |
Protein GI | 219119037 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACATA GTAGTACAAC TACAGGGGCT GAAGCGCAGA ATCGAAAGAA AATAGTTGCA CTGGATCTAA AATCTGCCGA CTCTTCAAAT GAAAACAACA TGAGTACACA CGATGCGTTG ACAACCTTTA CCACGGACTC CACACCTGAG GCATACAAAG TCAGCACCTC CGATATTCGA AGATGGAAGA AAGTAGGCTG GAAATCCAAG TTGACTCGTC GTCAAAAGTC AACGGAGTCT TTGGAAGATG AGTTGAAACA GCACTTGCGC GTTGCCGTGG AACAATTGCT AGACAGCTTG TTTATCTACC TTTCGAATCT ACCGGAGCCA TCTTCCACTT CTTCCTCAAC TACAGTCGGA CTGACATTGC CGGCTTCAGC GGTGGGCTGG TTATCCCAAA AATTGTTTTC GGATACTGAT TTTGGAGCAC AGAACGAGGG TGGTCTGGCT GAGCCAAATT CATCTTTGCT AAGCGGCTCA TTGCCTGGAC GCCTGGCGTT GATCAAGTTT CTGCTCCCCC GTGCTACGCA TGTGCGTCTG ACATCAGAAC GCTGGCCATC TCGAAGAAAC CAGAAGGCAA TGGAGGCTGA GGATACGATA CATGTCATGT GTGACCACAA TCCGCGGGGA GATACTTCCG GCTCGTTTTT GGAATACTAT GATGCTTTGA TACACAGGCC GCGTGTAGAC ATGCGCGTTT TTTGCCAATG CAAAGTACTT CTGATTGCTC AGGTGCCACC ATCATCAATT GTCAACCTTT ACTGTATCCG AAATACTTTG CAGATTTTGA GAGTTGAGCG CAGCTGTATT TTCGACCTGC CATCTTTCTT ATTACCGGCT GAGAAAAATA ATACACGTTC CGATTTCGAA GACCCGCTAT CGTCGCTGGT AGCCCCAATA ATATACCCCA ACTTAGTTCA TGCTAAATTA TCGTTCTGCG GCTTGGATGA ATTGTCAGGG ATGAGGGGAC GACGAAGAGC AGATCAGACA CGCGACCAGC CACCATTGGG TTCTCTAACG GCTTTACAGT CGTTGAACCT TTCACACAAC GAAATTGTCT CGGAACGAAC AGCTCTCTCT AGCGCAAGGA GAATGCCATT TCTGCAGCGA TTGGATTTAT CTCACAATCG GATTACAAGT TTACGAAACG CACATTATCG TCTCGGTAAT ATTCAAACTC TACGCTTGTC CTACAATCGC TTGAGATCAG TACAGGGCAT TGACCGGCTT TACTCATTAG AAAGTCTGTG GCTAGATCAC AACGAGCTTG AGGACTTGAC AAGTATAAGC GGCCTATCGC GTCTTCCTGA GCTTCAAACC TTGCATCTGC GCGGGAATCC ACTTGAACTC TTGCGGCTGC GCACGTACCG CATTGGAGTG TTTGACTTGT TTTGTGAGCG CCGCTTGGCC AATTTGGATC AAAATGCAAC CTTCCGCCAA TTACAGCGGG CTTTGCCTAG CCTTGACTTA GAGCGCCCAT CTTTGATGGA AATGAAAGCT TTGCGTGAGC GCACTTTCGC ACCGACATAT GCTGTTGCAC GCCCGCATCG AGACTCTGAA GGTGCAACGG GTCTTGTCAC CGAAAATGCT AACATTTCCT CGACGCAAGA AAACGGGAAT TTCGGTGCGT TGTCAAGGGC GACTGTCGCA AGCTTCGCAA TGCCGAAGCT TACAACAGGT ATCAAACGGA GTGCAAAAAG GCGACATGCT CCAATTCGTG GAATGGATGT CGATTGGAAG CAGGGCAGAA TACGTCCTTA TCAAAGCAAC GCATGTTCAC CGATTGTGGA TTTCACTTCT ATTGATGTCC TTGTGTCCAT GTCTGAAATT TCTGAAGCAC AGTTCGAGTC AGTGACGTTG ATTGAAACGA AGCATTCACA CGACCAGTGT GAGCAAGCTG GCGAAAATGG AGTTGAGAAT AACATCAAAA TTGAACCATC TTTGTTTGAA GAATACCTCA GAGCAGCGGA TCGGGTGGGG TCGATACCAG TCGACCCGGA CGGAGGCAGA GGCACTACTT TCTTCAAGGC AAGCATGAAG ACTGTAGTAG GTACCGATAA GGATGGTGAA GTTCCTGGAG AGAGACTAGG AGATGCATGT GGCAATATCG ATATTGTTAA GGGCGACGAA GCTGCTGCAG CCGACGAGGA AATTTCCGCG AAATATCCTA AAAACACCGA AGCTCACGAT TCACAGTCAA TCACGACTTT CCTTGAAACG ATAAAAGAAG GACAAGGCAT GCCAACCACT AATGCTTTCG ACTCTTCTAG CGAGGTACGT GCGTACGTAG GGCTTAAAGG AACCCCATCG AATGTGAATG AAAACACTTC GCCACACTTG CTCTCTATGT CAGTCTTCAA TGATAACATA GACATGGACA GTGATTTTGA TGATGAGCTC GGAGGTGATC TGCTGTCGAA GGGTCCGTCA ACAAGGAATC ATGGGAGAAG CGGTGTTATT GGCTTGACGA ATCCGTCGCT GATTGCAGAA CAAAGCATAC CAACAGATCA TAATAGAATG AATCGATTGT CTCTTTCCCC AATCAAACGA AAACCGCCTA ACCAAGACTT GACAATTCCA CAAGCCACCA CAGAAGCATT TTCGGAGGCT TCCTTTGATG TGGATGGTGA AGCGTCAACT CCTCCACGGT CTGTCATAGT GAACGACTTA GATGAGGGAA ACTGGGACCA GCATCGCAGC CTCAGATCTG AGACATCTTC ATGCGTGCAA ATGTCGAACG AGGAACAACT CGTCATAAAT GTCACATCAT TCCCCGACAG TTTGTGGCAC GACGACACCA ATTCGATCCA CAGCGCCACC GTGACGCCTA CAAGAAACAA AAATTCTGAT CCTGGCAAAT TCGTCCTTGC TGAAAGAAAC GCAACCTATG AAGGTCCGGT TCATTGCAAA AACTTCTTGA TTTACGACAA TCTGGATTTT TATTTTCAAT TATTTGTTTT TCCGCCGCGG AGTGGGAAAA TGTGTGAAGC CGCTGCACCT ACCTCGTGGC CTGGGGGCGC AGAAGAAGAC TGGCGTGGAG TACTCGAGCG CTTCCCACGT ATTCAATTAT GGCTAGTTGA TCGACAGTTG AGAGAAGCAG CCAACCGTGA ATCGATGTCA GCACATACAT TTGAAGAATA TCGTCGAGTC TGGAGAGAAA GGGTGGTCGC TTGTGGTAAG CCTGCACTTC GGCGATTGAC ACCAAATCGT ACAGCCCGGT ACGGTTTTCA CGGAGAGCTA CTCTGGTCGG CAGCTGGCTC ATCGCACTTG AAGCCTGAGA CTGTTGCTGA ATCCCGGAAT GTTCTACTTT GTTTGTCGAA CGAAGCTTTT TACTTACTTT CCGATCATGA TAAGGTCAGC GCAAAAGCTG TGGAACAGAA GAAGACCTTT CCTATTCCCA TACCCGAGCG TGCTAGATTT TCAGATGCAA AGTTTCCACA CTCTATGGCC CGGCACCCGC TTTCGCAGCT TCGATCTATT GCAATTGGTT TTGGCTTCCA GCGCTTAACA CTGCGATTTC GCGATCTGAC GTCGGTAAAT GAAGATGATT TCACTTACAT TCTACTCATA TCCAACAAGA CCCAAACAGT GAATTTGCTC AAGGAACTGC AGCAATATGC AGGAGACAAA GCGACAAGTA TTTCCGGATT GATCGCGTCA GATAACGCAG TTACTATTGA GAATGACGAT CGATACGTGC TTGACGCTGT CGGAGTGGCA GTGGCCCCGG ACGTCATCGA TACAATTCTG CACTACCAGA TCTTACAGCA GCGCTGGAAA CACGGCGAGC GTGGGACTGT AAAACGAGTT TGTATTGTAA CTGACGCGAA GATATATCTT TTGGACGAAG ACTACCTTGG CGACGGATCT GAGTCCATCG ATGCAGGGTC GCGTACTCTT GGGGAGCCTA CATATCGTCT TGTGGATTCG GCTTCCCTAT CACTGATTGA TAAGGTCCAG GCAGCCGATG CTGATCCAAA CTCTATTACT ATTGTGATTC AACCTCTCAC CCGGTTGCAA CGCTTTCGAA ATTGGAGGCT CTTGTGCCAC GACAGCCAAG GAGCCGAAAG ACTCGTGGAG GATGTCCGAA AAGCAGTAGA GTTTGCGTCG TAA
|
Protein sequence | MEHSSTTTGA EAQNRKKIVA LDLKSADSSN ENNMSTHDAL TTFTTDSTPE AYKVSTSDIR RWKKVGWKSK LTRRQKSTES LEDELKQHLR VAVEQLLDSL FIYLSNLPEP SSTSSSTTVG LTLPASAVGW LSQKLFSDTD FGAQNEGGLA EPNSSLLSGS LPGRLALIKF LLPRATHVRL TSERWPSRRN QKAMEAEDTI HVMCDHNPRG DTSGSFLEYY DALIHRPRVD MRVFCQCKVL LIAQVPPSSI VNLYCIRNTL QILRVERSCI FDLPSFLLPA EKNNTRSDFE DPLSSLVAPI IYPNLVHAKL SFCGLDELSG MRGRRRADQT RDQPPLGSLT ALQSLNLSHN EIVSERTALS SARRMPFLQR LDLSHNRITS LRNAHYRLGN IQTLRLSYNR LRSVQGIDRL YSLESLWLDH NELEDLTSIS GLSRLPELQT LHLRGNPLEL LRLRTYRIGV FDLFCERRLA NLDQNATFRQ LQRALPSLDL ERPSLMEMKA LRERTFAPTY AVARPHRDSE GATGLVTENA NISSTQENGN FGALSRATVA SFAMPKLTTG IKRSAKRRHA PIRGMDVDWK QGRIRPYQSN ACSPIVDFTS IDVLVSMSEI SEAQFESVTL IETKHSHDQC EQAGENGVEN NIKIEPSLFE EYLRAADRVG SIPVDPDGGR GTTFFKASMK TVVGTDKDGE VPGERLGDAC GNIDIVKGDE AAAADEEISA KYPKNTEAHD SQSITTFLET IKEGQGMPTT NAFDSSSEVR AYVGLKGTPS NVNENTSPHL LSMSVFNDNI DMDSDFDDEL GGDLLSKGPS TRNHGRSGVI GLTNPSLIAE QSIPTDHNRM NRLSLSPIKR KPPNQDLTIP QATTEAFSEA SFDVDGEAST PPRSVIVNDL DEGNWDQHRS LRSETSSCVQ MSNEEQLVIN VTSFPDSLWH DDTNSIHSAT VTPTRNKNSD PGKFVLAERN ATYEGPVHCK NFLIYDNLDF YFQLFVFPPR SGKMCEAAAP TSWPGGAEED WRGVLERFPR IQLWLVDRQL REAANRESMS AHTFEEYRRV WRERVVACGK PALRRLTPNR TARYGFHGEL LWSAAGSSHL KPETVAESRN VLLCLSNEAF YLLSDHDKVS AKAVEQKKTF PIPIPERARF SDAKFPHSMA RHPLSQLRSI AIGFGFQRLT LRFRDLTSVN EDDFTYILLI SNKTQTVNLL KELQQYAGDK ATSISGLIAS DNAVTIENDD RYVLDAVGVA VAPDVIDTIL HYQILQQRWK HGERGTVKRV CIVTDAKIYL LDEDYLGDGS ESIDAGSRTL GEPTYRLVDS ASLSLIDKVQ AADADPNSIT IVIQPLTRLQ RFRNWRLLCH DSQGAERLVE DVRKAVEFAS
|
| |