Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29381 |
Symbol | |
ID | 7203516 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 476793 |
End bp | 477994 |
Gene Length | 1202 bp |
Protein Length | 329 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182542 |
Protein GI | 219124504 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.575053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGATTTACGT CGTACCATGA AAGGGTGCTC CATCGACGGA ATAAGAGCAT TAGCTTACTG TTAATCGCAA TCTTTCCTTG CAACGCAGAA GCGCCTTACT TCCAGTTTCT GCACTCCAGC GCGCTCGACT ATCAATGGCA AGAAACATGT TGACAGAAAA TGAAAAGACA ACGATTCGGG CAAGGGCAAC AGAAACACTG ATAGCTTTAG AACTTGACGC CAGTCGAGAC CCCCAAGCGT CTTCATGGCT GAACAAAGCT CTCCCCGCAT CGCGATCTGA GCGATGCCAG TATTTTGATA CTGAAGGATT TTTGCACAAT CCAGCATTTG CCACAGACTC GGAATGTGCT GCAATGAAAG AACAAATGAG AGACTTGGTG GAATCGTGGG ATCCGTCGCA AGCCCTGGAC TCATTCGGGA CCGACGATAA GCAGAATTCC AAAAGAGGCG ACTATTTCCT AGATTCTGCA GACCAAGTGC ATCATTTTGT TGAGCCCGAC TCTCTAGACG AAGATGGGTT GCGCCTGAAG CCGGAGAATC AATCGGACAA ACTCACTGCT CTCAACAAAT CAGGTCACGC CCTGCATTTG ATGCCAGGGG CATTTCACGA CTACTGTACA TCCGAAAAGG TTCGGTCGCT TGTTACCGAG ATGGGATGGA AGGATCCGGT CGTGCCGCAG AGTATGTACA TTTTCAAGCA AGCGCGAACG GGGGGAGTGG TCAATTCACA CCAGGACAGC ACGTTTTTGT TCACGACTCC TCGACAGTCC TGTCTCGGTC TGTGGCTGGC TTTGGACGAT GCAACACTAG ACAATGGGTG TCTATGGGTT CGACCTAAAT CGCATAACGA GGCCACTCGG AGACAGTATA AACGCAACAC CAAATACTTT GGCTTGGAGT CAATCCAAGC CCGAAGCAAC GAGTGCACGG GGGATTCTTC CGAGCAAAAA TTTCTTATGG AGACACTGCA TGACAACAGC ACTGATTGGG AAGGTGCTGT TCCCGCAAAC GGCTGGCAAG GTTTGCTAGA CGAAGGTTTT GTTCCAATTG AATGCAGGAC TGGCGATTTG TTAGCCTTTT GTGGCGAACT CGACCATCTT TCACTTGCAA ACCAAAGCAG TCGTCCACGA CACACATTCC AACTTCATCT GGTCGAAGGT CCAGAGGCGG ACGTAATATG GTCTCCTTTC AATTGGCTAC AG
|
Protein sequence | MARNMLTENE KTTIRARATE TLIALELDAS RDPQASSWLN KALPASRSER CQYFDTEGFL HNPAFATDSE CAAMKEQMRD LVESWDPSQA LDSFGTDDKQ NSKRGDYFLD SADQVHHFVE PDSLDEDGLR LKPENQSDKL TALNKSGHAL HLMPGAFHDY CTSEKVRSLV TEMGWKDPVV PQSMYIFKQA RTGGVVNSHQ DSTFLFTTPR QSCLGLWLAL DDATLDNGCL WVRPKSHNEA TRRQYKRNTK YFGLDTDWEG AVPANGWQGL LDEGFVPIEC RTGDLLAFCG ELDHLSLANQ SSRPRHTFQL HLVEGPEADV IWSPFNWLQ
|
| |