Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47869 |
Symbol | |
ID | 7202938 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 296103 |
End bp | 298532 |
Gene Length | 2430 bp |
Protein Length | 721 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182197 |
Protein GI | 219123782 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACATTTTCG TTTACAGTTT ACTGTTACAC TGATTTCCCT GGTGAAAAGA ATTCTTCGAT GACGCAAATC TTGATCTTGA CTGCCCTCCT TGCCGTCATG GCAGAGGCTC TCACATTCTC TCGCAGTGAC ATGACAGGGA ACAATTTTGC GGGCGATATT TTGACATTCA AAGATCGTCT CTACGTCCCT TACGGTCCAG ATTTGATTTC TGATCCCCCA CAGACGGACC AACCTTGGAC GGGATTTGGC TACGGATTGG GAGCAACGGA GCATTGGGCG TACGATCATA AGGAGAAGTA CATTTATTCC CAAAGCGAAG CTGGTGGCTA CGTAACAATT ATTGATTACA ACGCTTTGCC AGGAGTAGTG ACACCTTATA GTATGAATGT TGGTGGACGC AATGTTGACG TGCGAGATAT TGTCGTCTGT TCAGAAGAGG GACTACTCTT TCTGACGCTT ACTGATCGAA GCAAGGTGCT TATGTATGAA ACCGTGAAAC GGAGCTCCCC CGGAACCCCA ACGTTGCTCT CTGAGATTGA TGCCGGAAAC TCTCCAGATG CCATGAAGTA AGTTTTTGAA GGATTTTAGC GTGTCACTTT CAAGTTAACA CTCTGATCAT TACCTTGTGC TGAACCTGGT TGCAGGCTCT CGAACGATTG CAGTATACTT GCAGTTGCAA ACCAGAACGA GGGAACCTCA GTTTTAAATC AAGGCGCTGT CACTTTGGTG ACCAATTTTC GTTCAGCAAG TGGACCCGAA ACAAAAACTG TGCTGCTCAA TACCTTTACG GATGAGTACC TGTTGGGTCG TGAAGTGCAC ATGCCGTTGA CACGAAATGC CATGATATAC TGGAATGCCG AGCTCGGATT GGGCTGGGAT ACTCCAAATG GTCTGATTGA TCAATACAAT CCAGCCCTTG CGTTCGACCC TGAATTCTTG GCGTTTAACA ACGACGGCAC GGAGCTTTAT CTAAATCTTC AACAAAACTC TGCAATGGTC CGCATCAGTA CCGCTACCGG TACTGCTTTG TCCGTTGATG GATATGGCCT TAAAGATCTC ACCGCCGGAT CCGGTGCTGA TATTGTCAAA GACGGCGAGT GCAAGCTTGT GACCAATCCT TGTCTTTTCC TCGCACGTTC ACCGGACGGT ATTGCGACCG TAGAGTACGA AGGCGTCAAC TATGTACTAT TGGCTGAGGA GGGAAGTGAC TTTGATCTTG GTGACTATGA AGAAAAGGCT GACTCGAACG ATATCTTTCA AGGCAATGGA ACTTTTGCGT ATTCTAATTT TACCTTCGAT GCGTCTTTCT TCGCGGAAGG TGACTCCAGC GCTGGTTGCT CCGCTAATTT CAATGCGGAG TGTGAAAGCA ACGATCTTCC TTGGTGCTCC AACTTTGAAC TTACAGTTGG ATCGTCAGCT GTCGACTATA CAGACCCAAC TGCTCCCAAG ATGAACCGCA TCGTTGGGTT TGGAGGGCGA GGAATCTCAA TCTTTCGAGT ACCATCTAAC GTTCAGCAGC AAATTACGAT GGTGTGGGAG TCCGGCTCCG AGTTTGAGGA GCGTACCTGC GCCGACTTTC CGTGGGCGAA CAACGCTCTT ACGGACGAAG AGTTTGCTCC CATTTGTACC GATTCAAACC AAGACTTTGA GTGCGCTCGT TGGATTCTTG TTTCCAACGA CGATCGGGAG GGCATCAACG AAAGGTAAGT CTGTCGCATC TAAACGCCGA GCATTTCTTG CGAAGAGTAC AGCTATGTCA TTCTCACCGC GGGTTGAAAA CACGAATCAG AAATGATCCT CTTGGCGACG GATGTACCTT TAACAATGGT AGCACTGGTG CATGTCCGAT GGGATCAACA GTGGATACAA AGGCGCAGCA AGACGGCCTA GGCGTCGAAA CAGTTGTTGT CGGAATTGCA TGTGACCATC TCGTTGCTCT GGGTTGTGGC GAGAACAATG CGATGTGCTT TCTATACGAC ATATCCGACA TTGAGTCTCC GGTCCATCTC AAAACTTTCA ACTTGAGCCC GTCATCTCGC AATAGAAACC CCGAACAGTC TTATCTCGAC GATCTTGGTG ATATTGATGC TGAAACGATC CAGTTTATTT ATCCCGGCCA GAGCCCTACC GGAAAGTCTG GATTTATATT TGGCGGTGCC ATTAGTGGTA CCCTCTCTTT CTGGGAGTTT GAGTGCGCTA GCGAAGAAAC CGCTCAAAGC GGCTCTGGTG GTGGGCAGAG CCAAGAGTTA AGTGACAGCG ACGAGAGTTT GGAAGGCGGG GCGATCGCAG GAATAGTGAT TGGATCGGTT GTCGGCTTGG CTTTGCTTGC TGTCATTGCC TTGAGGGCCA TGGGAGGAAA CAAGAAAGAA ATAGACACGG GCAAAACAGG CAGTAGCGAC CATACCGAGA CCGTAGATGG TCTAGCTTAA
|
Protein sequence | MTQILILTAL LAVMAEALTF SRSDMTGNNF AGDILTFKDR LYVPYGPDLI SDPPQTDQPW TGFGYGLGAT EHWAYDHKEK YIYSQSEAGG YVTIIDYNAL PGVVTPYSMN VGGRNVDVRD IVVCSEEGLL FLTLTDRSKV LMYETVKRSS PGTPTLLSEI DAGNSPDAMK LSNDCSILAV ANQNEGTSVL NQGAVTLVTN FRSASGPETK TVLLNTFTDE YLLGREVHMP LTRNAMIYWN AELGLGWDTP NGLIDQYNPA LAFDPEFLAF NNDGTELYLN LQQNSAMVRI STATGTALSV DGYGLKDLTA GSGADIVKDG ECKLVTNPCL FLARSPDGIA TVEYEGVNYV LLAEEGSDFD LGDYEEKADS NDIFQGNGTF AYSNFTFDAS FFAEGDSSAG CSANFNAECE SNDLPWCSNF ELTVGSSAVD YTDPTAPKMN RIVGFGGRGI SIFRVPSNVQ QQITMVWESG SEFEERTCAD FPWANNALTD EEFAPICTDS NQDFECARWI LVSNDDREGI NESTGACPMG STVDTKAQQD GLGVETVVVG IACDHLVALG CGENNAMCFL YDISDIESPV HLKTFNLSPS SRNRNPEQSY LDDLGDIDAE TIQFIYPGQS PTGKSGFIFG GAISGTLSFW EFECASEETA QSGSGGGQSQ ELSDSDESLE GGAIAGIVIG SVVGLALLAV IALRAMGGNK KEIDTGKTGS SDHTETVDGL A
|
| |