Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39534 |
Symbol | |
ID | 7195355 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 72802 |
End bp | 74858 |
Gene Length | 2057 bp |
Protein Length | 572 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183668 |
Protein GI | 219126864 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTTGA CTTCCTTGGG TAAGTCACAG AATGCCACAC CTCTAGCGAA GTCGGACCGA ACCAGCCCTA CGAGCGACCG TATCAAATCC ATCGCGAACA AATTTTCATC TCTCTAGGGA AGGTGTAAGG TCAGTCGACC GAAAACCATC GACGCTAATA CCAACGAAAC CTCTTCTGCA GTGCAGTTTA CTGTGAGCTC CTGAATTTTC TCGTTAGAGT ATTTCCATAG CTTTCGCTCT GTTCTATCCC TTGCGGCGAA CGAACGACAT CCAGACATAA AAATTGGGAT GCCTCCTACC GAACGAACCA GCTTGATGGA TCCATTGATG GGTGAGGAGG CGAATACAAT CAGAAACAAC AACAAAGGCA ACTACCAGGC AACCACGAGG ACGTCAATGT CTTCTGCTTC GGCTTCTCCT CCTACACACG CTCGTCATCA GTCGATCCAA CTGTCTCTGG ACGAGCTTGC ACAACCGTCT CCGCCCGCTT CGCCCTCCGC TTCTCAAAAC GTTAGAAACG CTGCCAAAAC CACGGCATAC AAACGAGATG TGACGTACAA TGGAGATAAT CACCTCGAAG TGCTCTTTCA AATGCACGGA TCCGTCTGGC CTCACGTTTT TCCCTGGTGC ATTGCGACGA TCGTATTCAC CTACGCAATC ATTCTGCTGC GGAACCACAA AATTGTCGAC CTCACCATTG ACAACAACAA TGGCCATTCT TTCATGAGCA TTCTCGTATC CTTTTTGGTT GTCACTCGGG CAACCATTAC CTATAATCGC TTTTACGAAG CACGACAATT CCTGGCGGAT TTGTTTCGTT CCAGTCGGGA AACCATACAG TACGCTTGTT TGCTCACCAC GCTGGACCGG GGGACCAAGG CACAACAATG GCGCCAAGAC GTCGCCTACC GGACTATTGT GAGCTTGCGT GTAGCTATTG CAGCCGTCGA GTTTCGGTCG CACGGAGTGA GCGCATGGGA GACCTTGCCG GATGAAGACC ACGAGTACAC ACCACTCCTT TTGCCACGAC AAGTACGTCA GCAACCCACT AGTATCTCGA CGACTGCTTT GCAAACTTTA CCAGCCAACA CTCCAACTGC ACGTGACAAG GACGGTGTGG ATGATTCGCT ACAACCGATT GGACAAAGTG CCTGCGCTAC AAAAGATCAG GAATACATTC AACAGTACAA GCCCGTCACG GACCATTCCC AATTTCTGGA AGCCATGCGG CACGGATCCC GTACAGTGAT GGACGAAAAT TTGCGAGCAC CTATCGTTTG GTGCTACAAT TTGCGGGAAA AGATTCTCGA ACCTCGCAAG GGTGGTATAC TGGTAACTCA CCCACCACAT ATCAACGAGG AGCTCCGATT ACTGGCCATT ACAAGTGATT GGCTGACCGC TTTTCACGGT TTGAAGATGC TGCTGACTAC GCCGTTTCCC TTTCCCTTCG TGCAAATGAC AAGAACTTTT CTGTTTGTGT GGGTCTTTAC TTTGCCAATG GTGCTGATTG CTGACAATGA TCAAACTTTA GAAGTGTTGG TACTTATGTT CTTCATCACA TATGGTTTTT TGGGTTTGGA GTATGTCAAT ATGGAATTGG ATGATCCGTA TGGCACTGAC CCCAACGATT TTCCGGGAAA GTAAGTACTC GCGTACTGTT AGTGACCCCG ACGTCATGAG TAGCAAATTG TGTTCTCCAC AAAATCCTGT TTATCAGTGC TTTTTTGTGG TTGTCGCTAA CTGCTTCTCC TGCCTGCTTG AATTCCTCTT TTTGTTTCAG ACGATGGGCA GAGCTTGTTT ACGAGGACAT TTACATTACC CTTTACAAAA CGGACGGATT CGATTCGGCC ATGGCCTTGC GGAATCGCAT AACGGAAAGA ATTGCGAGGG GGACGGCACT CGACAACTTT AACGAAGACA TGCACAATTC CAAGGCAAAT TTTTTTGGAA CCCATTCCTT TAAACACAGC TCACAGAAGA CACAGGCTAG TAGTGACTTG AGTTCTTTAC CCCCAAATCA AGACAACTCG AACAGGAATT TTGGAAACGT CGTTTAA
|
Protein sequence | MFLTSLEYFH SFRSVLSLAA NERHPDIKIG MPPTERTSLM DPLMGEEANT IRNNNKGNYQ ATTRTSMSSA SASPPTHARH QSIQLSLDEL AQPSPPASPS ASQNVRNAAK TTAYKRDVTY NGDNHLEVLF QMHGSVWPHV FPWCIATIVF TYAIILLRNH KIVDLTIDNN NGHSFMSILV SFLVVTRATI TYNRFYEARQ FLADLFRSSR ETIQYACLLT TLDRGTKAQQ WRQDVAYRTI VSLRVAIAAV EFRSHGVSAW ETLPDEDHEY TPLLLPRQVR QQPTSISTTA LQTLPANTPT ARDKDGVDDS LQPIGQSACA TKDQEYIQQY KPVTDHSQFL EAMRHGSRTV MDENLRAPIV WCYNLREKIL EPRKGGILVT HPPHINEELR LLAITSDWLT AFHGLKMLLT TPFPFPFVQM TRTFLFVWVF TLPMVLIADN DQTLEVLVLM FFITYGFLGL EYVNMELDDP YGTDPNDFPG KRWAELVYED IYITLYKTDG FDSAMALRNR ITERIARGTA LDNFNEDMHN SKANFFGTHS FKHSSQKTQA SSDLSSLPPN QDNSNRNFGN VV
|
| |