Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45852 |
Symbol | |
ID | 7200959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 468270 |
End bp | 469976 |
Gene Length | 1707 bp |
Protein Length | 536 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180054 |
Protein GI | 219118569 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0205914 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCACGATTC ACATCAGGCT ACTCACGACA ACTGCGTGTT TTTTCACATC AGGCAACACA TTCTCTTTGT GATTTCAACC ATCCACCATG AAACGATCCA ATCGCTTGAT AGGAATACTT GCCGTTGCAA CCTCCACAGC CGTGGTTCTC CTGCAGACGT CAAGGATTGA TGCCTTTGCA ACGGCAACTC TCCCTTCGAC AACGTCTTAC CGTATTCCGG CGTCATTAAA AACTACACAA ACATCATCTC GGGCCTTTCT CACGTCCAAA AAGAAACTCC ATTCGTTGCG GTCTAGCGAT AGAATAGCCA CATCCTTCTT ATCAGCCGTC GTTGTCGACA CTTTGTATAG TACTACTACG GACACTGGAG GGAAACTACA ATCCAAGAAA GGATCCTCCA AAACATCATC TTCCCAACCA AAGAAGGAAG GCCACAATCT CATGGACTTC CTCGGTCCAG TACTGTCCGC CGCACTCCTC GTAACGGGCA ATACCTTTGG GGCGGGTTGC CTTGTTTTGC CGGAATTGGC GAAAGGTCCA GGAATGGCCG CGTTTTCGTC TCTTTTTCTC GGTGCCTGGA CCATCAACTT GGTTTCCGGG TTGATACTGG CGGAAGTTGC CATCCGTCAA AAGGAAGCAG CCGTACAGCA GGGGGACGAT AACGTCGTGC CCAGCAGCTT CAAAGAATTT GCTCAGGTCA ATCTCAACTC TTCCACTGCC GCAAATGGTG TCAGCGCCAT TTCGGTCTTT GTCAATTCCT GTGTCATGAG TTTCGATCTT TCCCGAGTCG GCCAAGTAGG TGCCAATCTA TCCGGACAAG CCGTTCCCAA TGAGCTCATC TCCGCGGGCT GGGGTCTCTT GCTCGTCACA ATTTTGTCCA CGCAGTCCAG CAAGAATCTA AGTCGCGCCG CGAGCATGTG TGCAACGGTA CTGTTCCTGT CGTTTGGATC TTTATTGTTG CCCGGACTCG CACACGTGAC CGATCCGTGG CAAGTCTGGA CGGCACCAGC GGCGATAGAC TTTACCGGCA GTCTAGCGGC CGCAGCCCCC GTGGTACTCA TGTCCATGAT ATACCAAAAC GTGGTACCGT CCGTGACCAA AATTTTGAAC TACGATCGCG TCCAGACCGT GTCCGCCTTG ACCTTGGGAT CACTGTTGCC TCTCGTCATG TACGTGGCCT GGTGCTTCGC CTGGGTCGGT GGAGGCATTG ACACGACGAT CGGTACCACC GCTGGTATAG GTGGACTCCT CATGACGGTG TTTTCGCTCG CCACCTTGGC GGGATCCTCG ATTGGTTCCG GTATGTCGCT TTCGGAAGAA ATTGATACCT TTGTCCAGCC CAACGCTGCT CCCCGTACGA AGAGTGGTGA TACAGAAAAT GAACAGGCAG TACGCGGCGA GTACCAGATT CCTTCGGTGC TGCTAGCCGT CACGGCACCC TTACTGGCTG CCAACGTCAT GACCGCGACG GGTCATGATA CTACGGAAGC CCTGCGATTG GCCGGTTCGT TTGGATCACC ACTGCTGTAC GGCGCCATTC CTGCTCTCAT GGCCTGGCAG CAAAGGGCAC AACAGTCGCC AAAATCGCCG CATATGATAC CGACGGCTGG CCTCGGTGCC TTGGGACTCC TATCGACGGG ATTCGTCGGC AACGAACTGG TTGATGTTGT GGAGCGTTTC TTGGCCGTGC CCGTCTGAGA AATCCCA
|
Protein sequence | MKRSNRLIGI LAVATSTAVV LLQTSRIDAF ATATLPSTTS YRIPASLKTT QTSSRAFLTS KKKLHSLRSS DRIATSFLSA VVVDTLYSTT TDTGGKLQSK KGSSKTSSSQ PKKEGHNLMD FLGPVLSAAL LVTGNTFGAG CLVLPELAKG PGMAAFSSLF LGAWTINLVS GLILAEVAIR QKEAAVQQGD DNVVPSSFKE FAQVNLNSST AANGVSAISV FVNSCVMSFD LSRVGQVGAN LSGQAVPNEL ISAGWGLLLV TILSTQSSKN LSRAASMCAT VLFLSFGSLL LPGLAHVTDP WQVWTAPAAI DFTGSLAAAA PVVLMSMIYQ NVVPSVTKIL NYDRVQTVSA LTLGSLLPLV MYVAWCFAWV GGGIDTTIGT TAGIGGLLMT VFSLATLAGS SIGSGMSLSE EIDTFVQPNA APRTKSGDTE NEQAVRGEYQ IPSVLLAVTA PLLAANVMTA TGHDTTEALR LAGSFGSPLL YGAIPALMAW QQRAQQSPKS PHMIPTAGLG ALGLLSTGFV GNELVDVVER FLAVPV
|
| |