Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36397 |
Symbol | |
ID | 7201544 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 350619 |
End bp | 352936 |
Gene Length | 2318 bp |
Protein Length | 759 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180991 |
Protein GI | 219120508 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACG ATGGAACATA TTCGGCTCGC TTTTTAGAAG ACAATGGCCT GGAACTTATT ACGGATGAGC AGGGTATGGT CACAGCACGA CCGCTAGGAC TCAACGATGG ACAGGTTCTT GAAAAGGCTG TATCGCAACC CCAACAACAC AAACAGAAAG ATCAAAAAGT GGTTGAATTT GATCGTGACG AAAGAATACC ATTGATCATC TCGACGCAGC CATCCATCGA CAGTCACGTA CAGACCGAAC AGCCGGTAAA ACAAGCCCCG GCGAAAAAGC ATCCAATAAC GGAGGCACCA GCAAAAGTGG TGCCGCAAAC AGGATTTAAT GTGGTGCTGA CACACTGTAC TGCAGATTTC GATTCACTAG CGTCGGCCGT CGGTCTCGCC AAACTCTGGA GTGCGCAAGA TACTTCCTCG TCAACAGCGG AAGCCAATAA AACCTTTGAT TCAGCGTCGG ACGTCCCTAC CTTTGTGGTC TTGCCGCGTG GCGCTCATCC TGGTGTTCAG CGATTCTTGG CACTGCACAA ACACTTATTT CCAATTCGTT CGCTCAAATC ACTACCTTCA GACTTATCGG GGTTGAATCG GTTGGCGCTG GTCGATGCGC AGAGACGGGA TCGTATCGGA CCAGCTGAGC CTTTGCTCAA ACATGCCAAG CGCATAACGG TTGTGGACCA CCACATCGAT CAGGACTTGA TATTCCAGCG ACTGATTATG TAGTGGACAA GGTTGGTTCT GTATCCACGC TTATTTCAGA AAGCCTTCGT AAATCGAAAA TTTTGTTGAC GGAGGCCGAA GCAACGTTGT TAGCATTGGG TGTCCATGCT GATACTGGCT CTCTTTGCTT CGATTCGACG ACGCCTCGAG ATGCTGTAGC GCTGGCATGG TGCTTGGAGC AGGGCGCCAG TCAAGTGGCC ATCGCGGAAC ACGCACAAAC TTCGCTCTCA GCTGAACAGC AAGGTGTCTT GACGCAAGCA TTGATCAATA CGAACTCAAC AGCTATTTAT GGCGTTACCT TGTCGACTGT GCTGTTATCA GCGGACGGAT TCATAAACGG CTTGGCTGCC GTGACGCAAG ATGCCATGGA GTTGAGTAGC AGTGACGTTT TTCTTCTAGC GTTAGTATAC GAAGCCCAAG CTGGTGGGCG TCGGCGAAAG CGAAAAGGCT CAGGGAGTTT ACTGACAAGC CGTTTGTTAA CCAAAGATAA GTCTTCCGCC AATCCAGGGG GAAACAGCAA CAATGACGCA CAGGTATTTG AGGCTGAAGC CTGGAAAGGC GGCCCCGAGT ACATTAAACA GCGTCGCTTA CGGTCTGCCT TTGATCGCAA AGACAACGAT GATAGCGGCT TTTTGGAAGT TGATGAAATC ACGGCTGCGC TAGCTTCATC GGGCGTGATT GCTTCCCCCG AAGCGGTAGC TGATCTAATT CAAGCTATCG ACAAAGATGG CAACGGCAAG ATTGACTTCG ATGAATTTGT TGCATTTTCT GAACAGGCTG AGACAAGACA ATTGGAAAGA GATGCCCTCA TGTCGAAAGG ATCAACGACT ATGATCATCA TTGGCCGAGT GAAAGCGGGA GTCAATCTCA AATCCGTCAA GCTGAACAAG CTGCTAGAAA AGTTTGGTGG TGGTGGCCAC GCAAAGGCAG CTTCCGCAAC TGTGCGTCTT GGTGAAGAAG CGGAAGCTTC GGATGTCTTA CAGGATCTAG TCGATGAGTT GATCGAATCT AGTTTGAACG AGCAACCCAC CGTCGGGGAC TTTATGACGG CTCCTGTTCT TTCCGTGCAA CCTGAGATGA CGGAACATCA AGTTGAAGAT CTGTTCACCC GGTACGACGT TCGTGCGTTG CCGGTAGTAA ATGAAGAGAA CGATGTGATT GGCCTCGTCA CCTACAAGGA AGTTGCGGCC GCGAAGCAAC GTTTGTGGAA CAAGGAGCAG AAGCGACTGC GAAGAGAGCT TGAGTTGGTA GAAAAGGGCG AAGTGGTGGA CGAGGCCGAT CAGAAGGTAG CCCAAGAGCG ACGCAAAGGG TCTACCGTCA AGGGTTGGAT GCTGCAACAC GTGCAGCTCG TCGAAGCTAG CAAGACAATG GCAGAAGTTG AATCCATTCT GCTTGAAAAT GACGTAGGAT GTATGCCGGT TGTGGCAGAC GGGACGAAAA AGCTGGTCGG TATGGTTACA CGAACTGATT TGCTTCGGCA ACACCGATAC TACCCTTCTC TGCACTATCA CAATAAGGGA TTCGCTAACT CGATCGCTGA CCGAAAACCG ATCATTGCTC TACGAAAGAG GTTAAAGCAA TTCGATATCG AGGAATAG
|
Protein sequence | MTDDGTYSAR FLEDNGLELI TDEQGMVTAR PLGLNDGQVL EKAVSQPQQH KQKDQKVVEF DRDERIPLII STQPSIDSHV QTEQPVKQAP AKKHPITEAP AKVVPQTGFN VVLTHCTADF DSLASAVGLA KLWSAQDTSS STAEANKTFD SASDVPTFVV LPRGAHPGVQ RFLALHKHLF PIRSLKSLPS DLSGLNRLAL VDAQRRDPHN GCGPPHRSGL DIPATDYVVD KVGSVSTLIS ESLRKSKILL TEAEATLLAL GVHADTGSLC FDSTTPRDAV ALAWCLEQGA SQVAIAEHAQ TSLSAEQQGV LTQALINTNS TAIYGVTLST VLLSADGFIN GLAAVTQDAM ELSSSDVFLL ALVYEAQAGG RRRKRKGSGS LLTSRLLTKD KSSANPGGNS NNDAQVFEAE AWKGGPEYIK QRRLRSAFDR KDNDDSGFLE VDEITAALAS SGVIASPEAV ADLIQAIDKD GNGKIDFDEF VAFSEQAETR QLERDALMSK GSTTMIIIGR VKAGVNLKSV KLNKLLEKFG GGGHAKAASA TVRLGEEAEA SDVLQDLVDE LIESSLNEQP TVGDFMTAPV LSVQPEMTEH QVEDLFTRYD VRALPVVNEE NDVIGLVTYK EVAAAKQRLW NKEQKRLRRE LELVEKGEVV DEADQKVAQE RRKGSTVKGW MLQHVQLVEA SKTMAEVESI LLENDVGCMP VVADGTKKLV GMVTRTDLLR QHRYYPSLHY HNKGFANSIA DRKPIIALRK RLKQFDIEE
|
| |