Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49118 |
Symbol | |
ID | 7195339 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 640618 |
End bp | 643660 |
Gene Length | 3043 bp |
Protein Length | 918 aa |
Translation table | |
GC content | 61% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183786 |
Protein GI | 219127110 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00139685 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACT TCCCTCACAA AGTCCTCGAT CCAATCGCCA CCACCACCGT TCCGCCGACC TACGCCACTC TCAAAGTGGC CCAACGTCAA CTCAGTACCA ACGCCGCCGC CATCCCTACG CTCAATGGTG GTGGCGCCCA CGGCCACATG GCCCTCACGC TTACCGCCCG CGCCTACGCC GACATCAGCG ACGTCCCATT CGACATTCCC GTCGCCCCTC CGGCCAACCC TCCCGTCGGC ACCACGCAAC CGCAAATCAC CGAGTTCAAC CGCATCCACC AACGCAATGC CGACGTCTAC AACCTGTACG TCGCTGTCAA TAATGCCCTC CGCCAGCAAC TTCTCGACGC CCTCCCGAAG ATTTACGTAC GCGCCCTTGC ACATCCCATT TTCGAATTCA GCACCGTTAC CTGCCTCGAC CTCCTTTCGC ACCTCTGGAC CAAATATGGT ACCATTAAGC CTGCCGACCT CCAGAAAAAT TTCCAATCCA TGTACACCCC ATGGAACACT GCCGAACCCA TCGAGACTGT TTTCTTACAG CTTGACGAAG CTATCGCGTT TTCCATCGAC GGCAACGACC CCATCTCCGA GGCCGCCGCC GTTCGTGCCG GCTACGACGT CCTAGCTCAC TCCGGCCTCT TCCCTCAGGA CTGCAAAGAC TGGCGCAAAT TACCCCTTGT TTCTCACACC CTTGCCAACT TCCATCAGCA TTTCACTCTC GCCGACGAAG ACCGGCGCCT CACCGCCACC ACTGGATCCC TTGGCTACGC CAATCTTCTC GCGGCCACTC CCTCTCTGGC TCCAGCCACG GTTTCCGACA CCCTTAGCCT TCCTTTCTCC GCGCTCTCTG TGTCCCAGAC TTCCGTCTCC TCTCCAGAAA TGACGTATTG CTGGACTCAC GGAACCAGCA AGAACCGGCG CCACACAAGC GCCACGTGCA AAAACAAGGC CCCTGGCCAT CGCGACGACG CGACGGCCAC CAACACTCTT GGCGGATCAA CCAAGATTTG GACTGCCCCC AGGCCTCCTG AATAGGAAGG AGGGACGGCT ACGCCGACGA TTAAAACTAG TAATACCGAT TCTTTACATC ATATTACTAG TCTTAATTCG TCTGTAGTCC CCTCCCCGCC TAGTACACAC ACCTCCGCCA TTGCCGACAC TGGCTGCACC GGCCACTACA TTACGGTCGA CTGCCCTCAC ACCCACAAGC ACCCAGCAAA CCCCAGCCTC GCCGTCCGTG TCCCAAATGG CGCCGTCCTC CGCTCCAGCC ACATTGCCAC CCTGGCCCTG CCTGGTTTCT CCCCTGCCGC CTGCCAAGCC CACATTTTTC CTGGGCTCGC TTCCCATCCG CTCCTCTCCA TTGGGCAACT GTGCGATGAC GGCTGCACGG CCACCTTCTC TGCCACTCGC CTCGACATTC ATCGCGACGC TACCCTGCTG CTCTCTGGGG CCCGCTCCCC CCACACTGGC CTTTGGCACC TTGATCTTGC CCCAGCTCCC TCTCCCGCGA CGGCCCATGC CCTTGTTCCA CACACACCCC TTGCCGACCG CATTGCTTTT ATCCATGCCT CACTCTTCTC CCCGGCACTT TCCACGTGGT GCCAGGCACT TGACTTGGGC CATCTCGCCA CCTTTCCGGA CCTTTCATCC CGGCAAATCC GCAAGCATCC ACCCAGCTCC TCTGCCATGA TCAAGGGTCA CCTCGACCAA CAACGAGCTA ACCTTCGCTC CACCAAGCTT CCCCCGGTCA GTCCTCCTAC CACAACGACA CCTCCCGTCG ACCACGAGCC TGACAGGGAT CCTCCCGATG CCCCACCGGT CACACGCACG CACCACGTCT TCGCTGCGCA CCAGCGTGTT ACCGGCCAAA TCTACACAGA CCAACCGGGA CGTTTCCTCA CTCCGTCCAG TGCAGGCCAC AACGACATGC TTGTGCTTTA TGATTACGAC AGCAACGCCA TCCACGTCGA ACTCATGAAG AACAAGTCCG GCCCGGAGAT ACTGGCCGCC TACAAACGCG CTCATACCCT TTTCACCCAG CGTGGCCTCC GTCCCCAACT CCAGCGTCTG GACAATGAAG CCTCTGCAGC CCTCCAGTCC TTTATGACCT CAGAACACGT TGACTTTCAG CTGGCACCCC CCCATCTACA CCGTCGTAAT GCAGCCGAGC GGGCCATCCG CACCTTCAAG AACCACTTTA TTGCTGGCCT CTGCACCACT AACCCGGATT TTCCATTACA CCTTTGGGAC CGCCTCCTCC CACAAGCCCT TATCACCCTC AATCTTCTTC GTCGCTCCCG CATCAATCCC AAGCTGTCCG CACACGCCCA GCTTCATGGT GCTTTCGATT ACAACCGCAC CCCGCTTGCT CCTCCCGGTA CTCGCGTCCT CGTCCATGTC AAGCCGTCCG TCCGCGAAAC TTGGGCCCCC CATGCTGTCG AAGGTTGGTA CCTCGGCCCC GCCCTGCACC ATTACCGCTG CCACCGAGTC TGGGTCACGG AAACACGTGC CGAACGCGTT GCTGACACCC TTTCCTGGTT CCCGACCCGC ATTCCCATGC CCACCGCTTC GTCCACCGAC CGCGCCCTGG CCGCCGCCCG CGACCTGATC CATGCCCTCC AGAATCCCTC CCCTGCGTCT CCATTCGCCC CCCTCGACGC CACCCAGCAT CAGGCACTCA CCCAACTTGC CAATCTCTTT GCCACCGTGG CCGCCCCGGC CGCCGCCGTC CCTACATCCG CTCCCACGCC TCCGGTCCGT CCTCCTGCCC CAGCACCTCC CCCTTCTCAG GTCCGCTTTG CCGTTCCTCT CGTCACGGCC GAACATGCCC CTGCACTTCC GAGGGTGCCC ATTCCGGCCG CCGCACCTCC GAGGGTGCCC ACCATAGCCA CCTATCACTC TCGCACCGGC AACCCAGGCC GTCGCCGCCG CAAAGCACGC ACACAACCGG CAACCCCAAC CCTAGTACCA GCGCATCCAC ACAACACCCG CACCCGGCCC TTTCTTGTCC CGGCCTCTGC CAACGCTGTT GTCGACCCCG CCA
|
Protein sequence | MSDFPHKVLD PIATTTVPPT YATLKVAQRQ LSTNAAAIPT LNGGGAHGHM ALTLTARAYA DISDVPFDIP VAPPANPPVG TTQPQITEFN RIHQRNADVY NLYVAVNNAL RQQLLDALPK IYVRALAHPI FEFSTVTCLD LLSHLWTKYG TIKPADLQKN FQSMYTPWNT AEPIETVFLQ LDEAIAFSID GNDPISEAAA VRAGYDVLAH SGLFPQDCKD WRKLPLVSHT LANFHQHFTL ADEDRRLTAT TGSLGYANLL AATPSLAPAT VSDTLSLPFS ALSVSQTSVS SPEMTYCWTH GTSKNRRHTS ATLNSSVVPS PPSTHTSAIA DTGCTGHYIT VDCPHTHKHP ANPSLAVRVP NGAVLRSSHI ATLALPGFSP AACQAHIFPG LASHPLLSIG QLCDDGCTAT FSATRLDIHR DATLLLSGAR SPHTGLWHLD LAPAPSPATA HALVPHTPLA DRIAFIHASL FSPALSTWCQ ALDLGHLATF PDLSSRQIRK HPPSSSAMIK GHLDQQRANL RSTKLPPVSP PTTTTPPVDH EPDRDPPDAP PVTRTHHVFA AHQRVTGQIY TDQPGRFLTP SSAGHNDMLV LYDYDSNAIH VELMKNKSGP EILAAYKRAH TLFTQRGLRP QLQRLDNEAS AALQSFMTSE HVDFQLAPPH LHRRNAAERA IRTFKNHFIA GLCTTNPDFP LHLWDRLLPQ ALITLNLLRR SRINPKLSAH AQLHGAFDYN RTPLAPPGTR VLVHVKPSVR ETWAPHAVEG WYLGPALHHY RCHRVWVTET RAERVADTLS WFPTRIPMPT ASSTDRALAA ARDLIHALQN PSPASPFAPL DATQHQALTQ LANLFATVAA PAAAVPTSAP TPPVRPPAPA PPPSQVRFAV PLVTAEHAPA LPRVPIPAAA PPRAVAAAKH AHNRQPQP
|
| |