Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54708 |
Symbol | |
ID | 7202333 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 2430 |
End bp | 5884 |
Gene Length | 3455 bp |
Protein Length | 874 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181467 |
Protein GI | 219122261 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTTCTACAG AGTCTGTACA CTACGATGGT GTATACAACA CGGTGTCACT GTAAAACGCA TCGTTGCCCA GTGTGTAACG AGGGCTCCGT CCAATCCAGC AGGATACATA GGTATCGTGC TGTGGATGGT TCCAAGAGCA ACGTGGTGGA TATATATATA GATAGATGGT CTTGGAATAC GATGCACTCG GGGGAGCGGT TGACGGACAA CCAACCCCTG CCGTCGAATG CACGTTTTGG AATCATCTCT ATTCGAGTAT GGGTGGTACG GTACACACCC GTCCATCGGC TAGTTGGTAG TAAGTAAGTA CACTCCGATG TTTGGGTACG GCATGCATAT TGCGACGACA TCCGGAATTG ATTCTTGTCA AAATTGTTGG TGATTTTGGA CGTGTCTCAT ATATCCATAT ATGCGCTGTT TTGGCATGGG TCCATCGTGT ATATAGAACA AGGTCGGACT TCTTTTCTAT GGAAGTCCCA AGGAACGGAA CACCCTGGGT GTAGGAAAGC CCTTTTCCTG GTTTGGTACG GTTGCGAACG ATTGTATCGT GCTGGGTAGA TTCTTCCGGG ACACGAGATT TCCTATGTGT AGACGGGACA TCTTCATGTG CTATCCGAAG TCGTCCGTAC CCCCTTACAC CCCCACAAAG CTCGAAGTGG GGGGGAATCG TAGGGACGAG CTTTTCCAGG GATTTCCGGA CGAAGTGGGC GCTGCTTCGG ATACCATACC GGCAAGGCAG TATGGTATTT GGCATTCTTG GAAAGTCCCC AAACGACGAA TATTTCTTCC GTTGGCGATA CGTTGAAACG GTAACACGTT TCCAACCCCA AATGGTTCCA ACGGTCGTTT GGAGTCCCGT ATACTCGGTT TCCAACGCTC GTTACCAATC AGAGTAGGGT ATTTCTATCA ATTCGTGACT CCTTCTTCCG GTCTCGTCCA AGGGTCGGCG GCGCGCGGAA AAAACAAAAG TCACGCCCGA CCCAACGGAC GCTACGGCAT TGGTAGATAC CGCTTGGGTA CAACGTCACC CGTAGGTGTG CAAGGAACGC CTCGGACTCT TCTGGTACCC AAGCCCCGCC AGTCCAGTTG GCACGAGTCG AGAGGTCCCG TCCGCGCCGG CCCCAACGTA CGCGTCGTAC GTACCGAAAC GCACTGCAAG TCTTGGGAAC GAGCCGAGTG TTGGCCGGCA TGCGGACCAG GAACCCCGGG AGCCGACGTT CCGCGACCGC GAAAGGCTTC TCGTGTCTCG CGTGCCGGAC AATTTTCGTC CAGCGGTTGA GGGGGTGCGG GTAGGTTGGT GTAGTTTTTG GGAACGGTAC CGTACCGTAC GGTTTATACG AATGGATCCG GTGCGGTGCG GTACACAAAA CCGTTTGGGC GGAGTCCTTT GGTTCTTCGT TCGCAAAACA CTGGCTTGTT TTTGGCTCCA CTGGTCTTGG ATTTACATTA CCACCACAAG TACACACCAA TATACACCAA CACATACACA AGAGACAATG TTGGGTTCCA GAATTGCAAG CACTACGGTG TTGGTTACGG CGTTGTTGGC GTGGACCATG CCGGAGACGC AGGCTCGTTT CACTCCGATC GTCACGTACG CCCCCATGAC CAAGGTGACG GACGTGGCGG CGTTGGACCT CGACCAGAAC GAACTGGAGC GACAACTGGG GGTCGGCGCT TTGAATAACG CCCGTGCCGT CTACGAACGC GGCGGTAACT CGTTGTCCAT TGCCAATCTA ACCCTCGTCA ACCCACCGGG TCCCATGTCG TACCCGCAAG GAACGCAAGT TCTCGGCCGA ACCAAGGGCT CGGCCAACAC CACCGTTCTC GGAATTCTAG TCGACGAAAG CGTGACGTGG GGTGCCAACC CTGGAAACGT TACCATCCAC GTGCAGTACG TTACTTCCAC GAACCAAAAG ACCTACTCCG ACTGCCAATC CGGTGGACTC TGGACCTTTA GTGAAGCGAA CCTCAACGGT TGTAAGTGCC AACGGACGGG AACGCTGAGA CTCTTACCTG TCGCACGTGC AATATTCACA GACTCACCCT TTGCCGTTCC CCTTGCTTCT TCGCGTGGCA AATCGTATCC CAGGCTACAA TGATCAAGGT CATGTTACGC TGGTGCCCCC CGGTGAATCT GCGAATCGCG GCGACAAGCT CGCCTACAAG TACGATATTC GACAGGACAA CTGGAACCTG CGTACGCTCC AGAGTCTCAG TACCAACGCG GATACTTCCA TGAGACCTTG CCCCGGATGC CCCTTTTACG CAGACTTTCA GACCTTTGTC AACTACTACG AAACGACCGA GTACGCCGAC TCGTGGATCA TGGCGGCGGC ACTCAACCGA TCCGTCGCCT TTCCCTCCGG TCGAGGAAAC GCCAATTTTG GCGCCGCCTT TACCAGCGGC AGCAAACTTG GACTCGCCGA AGCCGTCAAG AAGGGAACAG CCTACATGAG CGTCTTTATG CAAATCATTC GGGATCTCGA AGACGCCATG ACGGACTGTG ACAACAAGTG CGACCCCGAC GACCTCTCCT GTCAGGATAT TGGAGCGGAA GCGGTCCATT CATTGGACGC CGCTGTCGCT CTCTACGCCG GATCGCTAGA AGGGACTGAT GGGACGGAGA ACAGCGGAGT CATGCCTTAT GCACTCGCCA ACAGACGGGC CGCTGATTTC CGCACCGGTG GACCTCAAGG CGAACGCAGC TCCGGTACCG CCCACGTCAA CTACGAGGTT ATCAAGGAGT TCAAGCACGC ACAGACCAGG CTCCTGGCAG ATGAGTGCAA TTTGGCGCGT CAAAACAAGT GGCGCATTGT CAATCTCATG AAGGTCCCGC TGGTCCAGGG TGTCCTGCGG TACGCCTACA TGCGTGATTT CAACAACCCC AGTGACGCGC CCACCAGAGA AAAGGAGCAA GCCGAGGGCG CCACATTTGC CGCCGCACTC TTGCCATTCG TGCACAAGTG CGATACCGCA GACGCTACTT TGATTTACAA CAACATGCGC ATCGGATCCG ATGCGAATAA AGTCGTCTTT GCCGATGTCA AGGCCGCCCT GGAGCGCAAC TACGGTTGCA TGGGAATCAG CTGTCCTCTG GTGGGCGGAT TCCACAATGG ACAAGACTAC GAGCTGGGTG CGACGCCGTG TACCGAAGGA ACTCTACAGG GCCCCGCGGC ACCCGATTTC TCCCAATTAC CGTCCTCCAG CGGGGGTTCG GGCGGAGGTA ACGCTGGTGT GGCGGTCGGC TGGATCCTTG CGGTATGTCT GGCTGGTGTT GTGGGCTTTA TGGCCTACCG ACGATTCGGC AAACGCAAGA ACATCGCCGA CGTCATGAAG CCGCCAGCGA ACAATTTGGC GGCCGTCTCC GAAATCGCCT AAGATGCACC GAGGATAAAA CTTTTAAAAG TGCATACCAA GGAGATACTA TATACTATAA GAAACTGGTT TTGAG
|
Protein sequence | MVYTTRCHCK THRCPVCNEG SVQSSRIHRY RAVDGSKSNV VDIYIDRWSW NTMHSGERLT DNQPLPSNAR FGIISIRVWV VRYTPVHRLV GSNPKERNTL GVGKPFSWFG TVANDCIVLG RFFRDTRFPM CRRDIFMCYP KSSVPPYTPT KLEVGGNRRD ELFQGFPDEV GAASDTIPAR QYGIWHSWKV CKERLGLFWY PSPASPVGTS REVPSAPAPT YASYVPKRTA SLGNEPSVGR HADQEPREPT FRDRERLLVS RVPDNFRPAV EGVRTMLGSR IASTTVLVTA LLAWTMPETQ ARFTPIVTYA PMTKVTDVAA LDLDQNELER QLGVGALNNA RAVYERGGNS LSIANLTLVN PPGPMSYPQG TQVLGRTKGS ANTTVLGILV DESVTWGANP GNVTIHVQPT PTANPVDSGP LVKRTSTVTH PLPFPLLLRV ANRIPGYNDQ GHVTLVPPGE SANRGDKLAY KYDIRQDNWN LRTLQSLSTN ADTSMRPCPG CPFYADFQTF VNYYETTEYA DSWIMAAALN RSVAFPSGRG NANFGAAFTS GSKLGLAEAV KKGTAYMSVF MQIIRDLEDA MTDCDNKCDP DDLSCQDIGA EAVHSLDAAV ALYAGSLEGT DGTENSGVMP YALANRRAAD FRTGGPQGER SSGTAHVNYE VIKEFKHAQT RLLADECNLA RQNKWRIVNL MKVPLVQGVL RYAYMRDFNN PSDAPTREKE QAEGATFAAA LLPFVHKCDT ADATLIYNNM RIGSDANKVV FADVKAALER NYGCMGISCP LVGGFHNGQD YELGATPCTE GTLQGPAAPD FSQLPSSSGG SGGGNAGVAV GWILAVCLAG VVGFMAYRRF GKRKNIADVM KPPANNLAAV SEIA
|
| |