Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37299 |
Symbol | |
ID | 7201946 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 664125 |
End bp | 666628 |
Gene Length | 2504 bp |
Protein Length | 713 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181244 |
Protein GI | 219121794 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000102085 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGGACT TTATCATCCC TGATAACTTT CCTCCTGACA ACCCCACAGT GGAGACTACG GAACCAACTG CAACCATTGC ACCAATCCCA AATCCTGATC CGCCTGAAAA TGTCAATGTC AACACCACCT TGGATATCCC GGACGCATTG AAAGATCTCC TTTCTAACGT ATCAAACACT GATGCCACAG TTTCTGGCGC CTACTACACT TGATACACTC ACTAGTTTCG TGCCTCTCTC TCGTCAACGG ATTATTACAC GTACCAAACT CAAGCATTCA AGGGACTCAT CAAGGAATTC AAGTTTGATG CTGATAACCC AATGGATGTT CTCACCAAGG CCAAGTCAAA CATGGAGGAA GCCGCTTTTG CTCTTACAGC AAAAGGTATC ATCAATTGAA AGAACAAATT GGAAAACTTC CTTACCGAAT TTGGACTTCG CGACCCATTT GATACCATTT ACACACAGTG GCAAACCTCC CCTAAGGGTC CCATTCCTGT CTTCACGTCA AGTAAAAGTC TCTTTCACGA CTTTCATTTC ATCTCCTTGT CCAACGTTGT CAATACGGTG GAATTCATGA AACAGTACAC AAACTTGACT CACCCAACCA AAGGAAAAAT CAACAAAGAA CATTCCCGTG ATTATTCCAT GTCCGGAACC GTTTTATACA ACTCATGTGA ACCGTCTCTC CAGTTGTGGT TAGATACCCA GATTAGTATC AGCACCGACA CCATTCTCAA ACGCCATGGC AACTCAGGAC CAGTCCGTTT TTATCTCATT TGGTCTCGCT ACGCCAATGT CGATGGAGCC GTAGCCACGT CTATTCAAAA CGCTCTTACC AAGCTTCAAG TGCGCGATCT TCCCGGTGAG AATGTGTCCC TTTACTTTGA CACCATTACC ATTATTGAAG AGTATCTTAG CTCCATGGGC CGTACCATTC CTGACTTTGT TTCACACGTT ATTGACGTTT TGATCAATGT GTCTGTTCAT GACTACTCCC TGTTTCTCAA GACACAACAG TTTGTCTCAA ATCCAGCGCT TCGGAATATA CATGCCCTTC GCCAGCTTGT CTGTGACCAA TACCAGCTGC TTCTCAATTC TGGCAAATGG CACCCTACAG CAAAAACTGG TGCCGCATTC CACGCTGTCA AGAACTTCTC CATTGAAACT GGTTTCCCCA ACGACACTCC CAACACCAGT GCCAATATCA ACCAGTCTCC TGGACATTCT AAGCCCCGAC TCTCTCGCGA AGAGTGGGAA AAGACTATTG ATCGATCTCC CCCGTCTCCG GGCTCCCCAG ACTGCCGAAA GTCGACAAAA GGGGATTTCA ACGAGTACTG GTGTGTCACC TGCAATCGCT GGGGCAATCA CCCCACCGAC AAAACTCGTC ATCCCACGGC AAAGCTAGAC CACACTCAAT TTCTCGAAAA ACGAAAGAAG CGATTCACTA AACGAGAGAC TCAAGACCCA TCTCCGGCTC CCAGTAACCC TCCTACTCCA CCACATGGCA TCAATTCCTC TGGGGCACTC CAATTCTTGT GTACTTCCCC ACTTACCCAG TTCCATTCCT TTGGCGTTCC CCCGGCGAAT TTTTAATGGC ACTGATCTTA CAGATAAGCC CTTTCCTTTC CTTTGACCCC GGTGGATTAT TGTTTCTTGT TGGTCTCGTC TGCCTCCTTC CTTTCCTTCT CTACCAGACT TGCTGCCTCC TTTTCCTTCT GGGGGTCGGG CGAGCTATGC TCCCATTTCT CGCTTCCTTC CTGCCCTGCT CCACCTGGAC TCCACACCGC ACTCATCGAC ACTCCAAATG GCGCCTCACC ACCGCTTTTC CTACTTCGTT TCTTCTCCTT TCCTCCGTTT CGGTCTCTCG CACTACAGCG ACCTTTCTCG GCACCCTTAA GATCACGGCT GTTACTCCTG CCGCACATCA GCTCCTCACC TTCCGTACTG TTTACCTACG TCCCTTTCAA CGCTGTCGTC GTCCTGGTTT ATCACACTCC CGCGCTGGTC ACTTCACCAC TTATGTCTCC AGCCGTCAAC TCTGTTTGGA ATACCGCACC CTTCTTAACG ACTTCAAACT GTGCCGTTTT TCGGGCCTCC TTGATCCTTC TCTGGCGTAT TTTGATCATC CACAATATGA TCTTGAACGT ATACTACCCT ACGGACCTTG GGAAGATGAT CCAACCATTG TTCCCTCCTT TTCTCCTCCT GTCGAACCCC TGTATGTTGT GGACTCCCGT CTCGCCTCTG CCCTGAGCAC ACGTCATCTC CAACAGCTTT TTGACTCCGC TCTACTACAA CACAACATGG TCTCCGAAAT CCGCCATGCC CACAGTTCCG ACAATGTTTG TTCACCCCTC CTACGGCCCG GTTGCGCCAC GGCTTCTATT GCTGGTGGCA CTCTCCCTTA TGACACCCTC TATAGTCATC GCTCAACCCC TTGGTCCATG GGGTATTCTC CTCCGTCCTC TCTACACTGT GCCTACGCTC ATGATAAAGT ATAG
|
Protein sequence | MLDFIIPDNF PPDNPTVETT EPTATIAPIP NPDPPENVNV NTTLDIPDAL KDLLSNFRAS LSSTDYYTYQ TQAFKGLIKE FKFDADNPMD VLTKAKSNME EAAFALTAKG KINKEHSRDY SMSGTVLYNS CEPSLQLWLD TQISISTDTI LKRHGNSGPV RFYLIWSRYA NVDGAVATSI QNALTKLQVR DLPGENVSLY FDTITIIEEY LSSMGRTIPD FVSHVIDVLI NVSVHDYSLF LKTQQFVSNP ALRNIHALRQ LVCDQYQLLL NSGKWHPTAK TGAAFHAVKN FSIETGFPND TPNTSANINQ SPGHSKPRLS REEWEKTIDR SPPSPGSPDC RKSTKGDFNE YWCVTCNRWG NHPTDKTRHP TAKLDHTQFL EKRKKRFTKR ETQDPSPAPI PFLWRSPGEF LMALILQISP FLSFDPGGLL FLVGLVCLLP FLLYQTCCLL FLLGVGRAML PFLASFLPCS TWTPHRTHRH SKWRLTTAFP TSFLLLSSVS VSRTTATFLG TLKITAVTPA AHQLLTFRTV YLRPFQRCRR PGLSHSRAGH FTTYVSSRQL CLEYRTLLND FKLCRFSGLL DPSLAYFDHP QYDLERILPY GPWEDDPTIV PSFSPPVEPL YVVDSRLASA LSTRHLQQLF DSALLQHNMV SEIRHAHSSD NVCSPLLRPG CATASIAGGT LPYDTLYSHR STPWSMGYSP PSSLHCAYAH DKV
|
| |