Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50002 |
Symbol | |
ID | 7198704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 105504 |
End bp | 109800 |
Gene Length | 4297 bp |
Protein Length | 1122 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184890 |
Protein GI | 219129426 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.736142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTGACG CCCTATGCGG GACCGGAGCC CCAATACGGG ATTTCTTTGA TCTTGCAGGG TTCGACCCTC CATTTCCGAC CACCAACTGT ATTATAGCAG TTACGGCAGC CTTTGGATTT GGAATCCTTC CGCCGCGACC CCGTCGAAGG TATCACTTGC TCTTTCCTGC GTGTACATTG ACCCTCTCCT CTTCTCTCTT CTTATCATCA ACTTCGTTTG CCTGGATGCG ATCTTCGCTT GCTGGAACAA CGAATGCTTT AGCTCGTAGG CGAAACGAGG AATGGAAAGC AGCCGCTACA ATCTCAATCC GATGCGCTCA CAGGCAGCAC CCTACACAAT CGGCACCCAC ATCCGCGTGT ACGTGTGATG TGTCGTCTGG ATGGAACCCC GTGTCGTGTC GCTGCTATTC TACGATAGAA GCACTAGGGA AACAACTAGG GCGAACACGA CCCTCACTGC AATTTTGGTT CTCAGCACGG TCGACGGCTG TGCAAACACG ACAGTACGCA AACGGCGATT CGCTCCACGC TGGTGCGGCT GCTCTTCGCA TCTCCACAGA AACACCGAGT GCGAATTCGG GGAGCGGAGC ATACGATAAG CGGAACACGG CTCTCGTGGG TCGTGGAGCT CTTCGCACTT TTGCTAGCAC CGGAAGCACC TCTGTCCACG ATTCTGTAGA CACTGATGGC AGCCTCTACC ACACAAATGA TGCGGCTGAT GGTGTAGTCG GATACGGAAG CGAACTGGCG CAGGTGCGAT CCCATACACA ACATCTTCTG CACGATCTAC CACCAGGTTT CTTGTCAGGG AACTACTTGA CGGAACCCGT CCTCTCCTGG GCTGATGCCT CCAAAATTTT GTACTGGTGG ATCATTCGTA AGACGCCAGA ATCTGTCGAA CGTAGCCTGG AAATACTGTC AGCCTTGGTG CGCGAAGTTG ATCGACTACA GGTAGAAGAT GAGGACGAAG ATACCTCTAT TCGCTTTTTG AATCCGTCTC TGGTCGAATC AATCGTCATC AATTGGAGTT TTTGCTGGAA GCGGTACAAT TCTATGAAAG TTACGCCAAT TATACTCTAC GAGAAGTTGC AAGAGTGTGC CGCACAACTC TCGTCACTCG AGCTCTCCGG CAAAGCATTG ATACATATCA TCCACGGAAC AGTCTTGGCG GAAAGGGAAA TTGTGAAGTC TTCTTCGAAG ACGTACGCCA CGGCCTCGTT TTGCGACCAA GTGTTCCGCA GCGCACTTTT GTTGCCCTAC AGTCCCGACA CGCTGACGCT GACGCTGAAT CTGTGTCTCA AAGCCTGGTT GCGAAGTGGA CATCGCAATG GTTACCGCGG CGACCAGCGA GCTGAACATG CGACCCAGTT GTTCGAGGAC GCTTTGCAAG CTGGAGTCGC TGTGGACGCG ATTTCCTATA ATACAATGTT GCGTATCTAC GCCGATTGTG GTAAGGGCGA ACATGCTCAA TCCCTCCTGG ATGAATGGTG TCGAGCTTAT AACCGTGATC CCTTTGTCGT CGCTGAACCC AACCTACTGT CTTTAAACAT TACGTTGGCA GCCTGGTCCA AATCGGCCAA ACACACTCCC GACGCCGCCT CCCATGCACA AACGTTGTTG CGCCGCGTAT GGGATCCATA CGACGACATT GGCAAAATCG GGATTCAACC CGATACTGTC ACCCTCAATA CCATTCTATC ATGTTGGGCT CGATCAGCAT GGCAACCATC CAATACCGCC GAAAAAGCGC AGACTATGTT GCAAGAAATG AAATCTTTAT TTGATAGTGG TGAATTGGAC GTGCAGCCAG ACGTATACTC CTACACAACC GCGATGAATG TGTTTGCTAA AGCCGGTCAT CCAGAATCGG CTGAAGAGTT GCTTAACACT ATGCATCAAA AATACCTTGG CGGCGACAAA AGAATGATGC CAACGTCTCT TATGCATGTT TCCATCCTCG AAGCATGGGG GAGATCTCTA CACCCTGATC GGATCGAAAA GGCGGAATAT GCTTTGTCCC GAATGGAAGA ATTGCGCAAT TGTGGATTCT TGACGACGCA TACCGCAACA CATAGCAATG ATTCAAACGC GGCAGCCGCT TACAATGTTA TGCTGAATAC CTACGTCAAA GCCGATGCCG TAGACAAGGC GGAAGCACTG TTGCGTCGCA TGATGGCTCA TACGGATCCC GTTCTCCCCC CACCCGACGG CCGGACTTTT TCTATTGTTA TCCTTGCCTT GGTTCGTTCG CCTGATGGAA CTCTTCGTGC CGCGGAACTA CTAGATAGCG CATTAGAGCT TTACCGGAAA TCGGGGCCCG GAGTCTTTGA GTTTGATATC CGACCCATCA ATGCTGTTAT CAGTGCTTTG AGTCGCAAAG ATAGCACCGT AACACAAGCT GTTGCCTTAA TACATCGCAT GTTGGATTGG GTTGAACAGG GCGACCAGAG GTTGCGACCG TGTGATGCGA GTTACGGACC GTTGCTGTTC GCATTACATA AAGCTCAGCT TGATATTGAC GGAGCGCCAT ATATTGCTGG CATTCTTGAG CGATTTGAAA AACTCCATGC AAGCGGTATC TTGCCAGCCC CTCCTAAATA TGAGGCATAC AAGTTTCAGG TCTTGGCATG GAGGTTTTCC AAGCAACCGA ACGCGGCCAA GAATGCCCAC GCGGTGCTCA AGTTTTTGAG CAACCAAGCT CGACAAGGTA AAAGGAACTA TCAGCCTGAA GCACTACTAT ACAATGCCGT ATTGGAAATA ATGGCCATGC ACCGATTACC TCTACAAGCA GAAAATTTGC TCCAAGAGAT GTATAACGAC TACTTGACCA ATCCGATCAA TGCCAAGCCA AATTTGCGAT CTTTCAATGC TGTCCTTTGG GCTTGGGCCC GTTTAGAACA ATCTTCAGTA GCGTTAGAAC ACACTCGTTC GCTGCTGGCT CAAATGCAGC GATTACATTC GTCGGAAGAC GAAGCCAAAG GACTCGATGT TGAGCCCGAC ACCGTTTCGT ATAATTGCGT CATGAGTGCC TGGATGAATT CGAGACACGA GGATGCACCG ACGCAAATCG AGCATATCTT CCGGCATTCG CTTTCGGTTG AATCTGTCAG AACCAAACCA GACCAGTACT CTTTTACGTT GGTCATACAG GGATGGCTGC GGGGTCGTAA TATTGAGAAA TGTACTGCCG CCTTGGAAAA CATGTGGCAG GCGTATACGG AGAATCGCAT CCGTGTATAC CCCAATCCCA GACTCATACA ACAGGCAATT GCATTGGCGA CAGTTGACGA TGCCGCACAG GTTGCACGAC TCGAAGCCTT AATGCTCAAG ATCAAGAAGT TGCCAGTCCG GCACAGACCG ACTGCCTGAC AGAACTCACG CCAGCGTGCG AAGCCGTCAA CGGGGTAAAG AAGTGCAAAG AAGGTATATT ACAGAAGCCT GTTGTAAGCA AATATAAATT GCTCCTTCCT GTATCTTTCT GTTGCAGTAT ACGTGCTGAC TGTCAGTAGC ATTATCTCCA ACAAATATTC TTTGGAAGTG GTCATTGACG ACTAGTCGCA ATTTCTTTCG TCAAACGCAA CAGCTTCTCG GTACAAACAC TCGTGCCAAA CGTCAAAGCT CCATAGTCGC GTGTTGTCTG AGTCGCTATG CTTCGCCATT ATTTGTAGTG GTTGTCCATC AAGGCAGTCC TGCAAATGGA TACTGCCGAA GAGGGAGCCG AGCTCAATGG AAAACTCCTG AAAGTTGAAC AAGTGCTGCA GAAATTCAAA CGTATATATG TCCTCCGGAT TAGTTCGCAG ACTTGCCTCC TTGCTAGCAA ATACTTTATC AAAGCCCTTT TTCCGAAAGC GAGCCCGTGA CATTGTTGTA CTAGCCGAAC TTGCTTTTCC TAGCAAAGTT TCTTCGTCAC GAGTTGGTTC TTCTAACTTG GCTTCAAGAT CGAATTCATC ATAGTTATTC TCAATACTTG ATGGGGTATT ACGCTGCACC CTGAGGGATT GCGCCGTGCT ACCGAGTGGC GTCAAACTAT GGGGATGGGA ACCTTCTAGT CGCGCGTCTA GTTGTGGTGC AAAGAAGCCA ATGACTTTCA TGGCCCCTTT GACGACATAT TTTGCGGGCA AACGAGCCGT CTTGCGAGTC AGCTCCAAAC CCGCCACACA ACGTGTCCAC GGGATACTTC GCTTGAAACG ACCGCGCATT ACCACTTGGT AACGTCGGTT CATACCCACA AAGTACCCGA TCTGATCGTT GTATGGTTCC GGTGTAGTAC CCAGTGTGTG ACGTAACCGG ACCAGCA
|
Protein sequence | MCDALCGTGA PIRDFFDLAG FDPPFPTTNC IIAVTAAFGF GILPPRPRRR YHLLFPACTL TLSSSLFLSS TSFAWMRSSL AGTTNALARR RNEEWKAAAT ISIRCAHRQH PTQSAPTSAC TCDVSSGWNP VSCRCYSTIE ALGKQLGRTR PSLQFWFSAR STAVQTRQYA NGDSLHAGAA ALRISTETPS ANSGSGAYDK RNTALVGRGA LRTFASTGST SVHDSVDTDG SLYHTNDAAD GVVGYGSELA QVRSHTQHLL HDLPPGFLSG NYLTEPVLSW ADASKILYWW IIRKTPESVE RSLEILSALV REVDRLQVED EDEDTSIRFL NPSLVESIVI NWSFCWKRYN SMKVTPIILY EKLQECAAQL SSLELSGKAL IHIIHGTVLA EREIVKSSSK TYATASFCDQ VFRSALLLPY SPDTLTLTLN LCLKAWLRSG HRNGYRGDQR AEHATQLFED ALQAGVAVDA ISYNTMLRIY ADCGKGEHAQ SLLDEWCRAY NRDPFVVAEP NLLSLNITLA AWSKSAKHTP DAASHAQTLL RRVWDPYDDI GKIGIQPDTV TLNTILSCWA RSAWQPSNTA EKAQTMLQEM KSLFDSGELD VQPDVYSYTT AMNVFAKAGH PESAEELLNT MHQKYLGGDK RMMPTSLMHV SILEAWGRSL HPDRIEKAEY ALSRMEELRN CGFLTTHTAT HSNDSNAAAA YNVMLNTYVK ADAVDKAEAL LRRMMAHTDP VLPPPDGRTF SIVILALVRS PDGTLRAAEL LDSALELYRK SGPGVFEFDI RPINAVISAL SRKDSTVTQA VALIHRMLDW VEQGDQRLRP CDASYGPLLF ALHKAQLDID GAPYIAGILE RFEKLHASGI LPAPPKYEAY KFQVLAWRFS KQPNAAKNAH AVLKFLSNQA RQGKRNYQPE ALLYNAVLEI MAMHRLPLQA ENLLQEMYND YLTNPINAKP NLRSFNAVLW AWARLEQSSV ALEHTRSLLA QMQRLHSSED EAKGLDVEPD TVSYNCVMSA WMNSRHEDAP TQIEHIFRHS LSVESVRTKP DQYSFTLVIQ GWLRGRNIEK CTAALENMWQ AYTENRIRVY PNPRLIQQAI ALATVDDAAQ VARLEALMLK IKKLPVRHRP TA
|
| |