Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33631 |
Symbol | |
ID | 7204072 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1418941 |
End bp | 1420301 |
Gene Length | 1361 bp |
Protein Length | 400 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186249 |
Protein GI | 219113331 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAGCG AACGAAGTTC CCCTTCCGGC TCTAAGGAAT CAACATCTCA AGATGTTGCC GATGTAGCGG TGATTGGAGC CGGCTGGTGG TCACAAGGAT GGCATATACC ACAGTTGAGT CGGAACACTC GGGCCAATCT CGTAGGAATT GTCGATTCTG CCTCTCAGCC CCGATCGAAT CTCAATCCTC ATCTCGAGTC GCTCGAAGCG CTGGCACGAA AGTACGACAC TGCCGTATTT TCCTCCGTGT CGGACCTTCT GGCGAACACA CCCACTTTGG ACGGTGTAAT TGTGGCAACG CCGCATTCGA CTCACTACAA CATTGGTAAA GAAATCTACG ACGCCAACAG GAACAGGGAA AAACCAATCC ATATTCTCAT GGAAAAGCCC ATGACAACCA ACATAGAGGA AGCCTACCAG CTGCACCAAC TGGTGGCGTC CCGTCCGGAG GTTTCCTTCC TAATCAACCA TTCGGCCAAC TACCGTTCGC AAACAAAGGC AGCACATCAA GCTGTCCCCC AACTAGGGAG TTTGCGGCAT GGCTCGATCT TCATGGCGTC TGCTCTGAGT TGGATCTTTG TACGTCCATG GAATGTTGGC TAGCGTTGTG ATGCGACTCG AGGGCTTGAA TGTGAGCCAT ATCTTCTCGG GGTTTGCATT GTCATTGCGA GCTTCTGTTA CTTTGGTCAC TCACTGTCCT ATGTTTTATT CCTTCTTTGG CTCGTTCACG CATCTAGGAA GACCCTTCGA ACACTGGTTG GAACGTTCCG GGGCCTGACA TGCTGGGGAA CGGCTTTGCG TGGGGTCAAT CGTCGCACGT CTTGGCTTGG GTGTTCTACG TATGCCCGCA ACTTAGCCCT ATCGAGGTAT ACTGCCGCAT GACGCATTCT GCCGCCACTG GAGCTGATGT TGCACTTTCT GGAACGATCA TTTGCCGAGA TTGCCACTCC GACAATGAAG TCATCCTTTC CGTTGCCGGT ACCAGTTTAC TTCCGGGCAA TGAGCACTCG GACCCTCCGG TTGGCAAACA AATTCAATTC AAACTCTACG GCGAAGACGG TGCTATTATC TACTGCGGCG ATACACGGGA TGAGAACACC GGAAATCTGG AACTGCGCCG CGTGTCGATG GATGGTGTCG TGGAGTTGCC TGTTGGTCCG GGATTTGCCT TTGAGAATTT AGAAACAGAA CATGATGGGC CTGAGTCATT GCAGGCTTTC TTGGATGCTT GTGTAGGAAA ATCAACGCAT GTGGGGGCCG ACAGTTTGGT TGGACTCAAA ACGGTTCAAG TATTGGATGC CATGTACCGC AGCCACGCTT CCGGTCAATC CGAACCTGTA CGGCATGCGG ATGCGGAATA A
|
Protein sequence | MRSERSSPSG SKESTSQDVA DVAVIGAGWW SQGWHIPQLS RNTRANLVGI VDSASQPRSN LNPHLESLEA LARKYDTAVF SSVSDLLANT PTLDGVIVAT PHSTHYNIGK EIYDANRNRE KPIHILMEKP MTTNIEEAYQ LHQLVASRPE VSFLINHSAN YRSQTKAAHQ AVPQLGSLRH GSIFMASALS WIFEDPSNTG WNVPGPDMLG NGFAWGQSSH VLAWVFYVCP QLSPIEVYCR MTHSAATGAD VALSGTIICR DCHSDNEVIL SVAGTSLLPG NEHSDPPVGK QIQFKLYGED GAIIYCGDTR DENTGNLELR RVSMDGVVEL PVGPGFAFEN LETEHDGPES LQAFLDACVG KSTHVGADSL VGLKTVQVLD AMYRSHASGQ SEPVRHADAE
|
| |