Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_24019 |
Symbol | |
ID | 7199201 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 191978 |
End bp | 193524 |
Gene Length | 1547 bp |
Protein Length | 414 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185289 |
Protein GI | 219130265 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGGCT TGCGTGTGGA AGATGCTTTA TTGTATCTAG ATGACGTCAA GCGTGAGTTT AGAGGTCGTC CACACGTCTA CAATGAGTTC CTGGGGATCA TGAAGAACTT CAAATCGCAA GAAGTCGATA CGCCTGGAGT CATTGCGCGG GTTTCCAAGC TGTTTCGGGG ATATAACAAG CTTATTCTGG GTTTTAACAC TTTTTTGCCG GAAGGGTATA AAATCTCGTT GGCAGATCTG GAGCGTCAAG AAGCCGAGCT GCGCAAGCTG GAGGCCATGC CACAGGCGCA ATTTCGCCAA GGACCACCTC CGGCGAACTC GGAAAACGGA CCATCGCCGG CCAGTCCCGG ACCACCGCCA CAACATCCTA CATCGCAACA GCATCCACAA CCAGCTTGGT CCCAATCATC AGAGAGTGCT CCCGCGCCAC CGAAGGCCAA AGGTAATGGG CCGACACATC CCAAAAAGAA GAAGGGCCCG CGCTCGTTAC CAACCAAGAA AGCCGCCGCT GCAGCGGCGC TACCTCCACC CCACCGCCAC CGCCCCCAGA AAATCAGAAC CAAACGGTGG AATTCGATCA CGCGATTTCC TACGTCACAA CAATCAAGAA GCGGTTCGTC CACGATCCTA GTACGTATCA CCAGTTTTTG GAGATCCTCC ATACCTATCA GAAGGAGCAA CGGGGCATCA AGGAAGTCTT AGAACAAGTT GCCCATCTGT TTCAAGATCA TCCGGATTTG CTCAAGGAAT TTACTTTTTT CTTGCCCGAT GCCGTTCAAG AGCAAGCCAA GGAACGCCTG CATCGCGCTG CGGCGGAGTC GGAAACGCGA CTCGCCGCTC AACGCCAGCA GCAATTGGAA CAATATCACG TTGATACTAA TGTCAAGAGT CCCGCGAATG CCGTCAATCG CCAGCCCAAA TTTATCGACA TGACACACGC CAGTCAGGCC GCTGCTCTGG CTCCAGGCCC GGAACCACAG CGTAAGCATC CTTTAGAAGG GGCGCCCCCG GCCTCGGCTA TGGCCGATCA AAGCGAATCG TACGTTTACA ATGCTGCTGT CGAACGCCAG TTCTTTGACG CTGCACGTGA AGCCATGTCA TTCAGTAGGG ACGGCGGACA GGCCTGGGCC GAGTTTCTCA AGTGTCTCGA TATGTACGCC CAAGAGATTT TGACGCGAAC GGAGATGTTG ACATTCGTGG AGCAAATTCT GGGCAAGCGG AACGCGAAGT TGTTGGAAGA GTTCAAGCGC ATACTGGTGT CGGCTGGATC TCCTGATGGC CCCGCTCCTT TGCTGGAAGA TTCGTGGTAT TCTGTACCGT TGTCTGAGAT TGACTTTTCT CGATGCCGTC GCTGCACTCC GTCGTATCGT GCGCTGCCTC GTGATTACCC TGCACCACCT TGTGGTGATC GAAGCGACGA GGAGGCAAAG GTTTTGAATG ACATCTGGGT CTCTTTACCG GTCGGCAGTG AGGAAAGCTA TACATTTCGA CACATGCGCC GTAACACGTA TGAAGAGACG CTCTTTCGAG TTGAGGACGA ACGGTTT
|
Protein sequence | MRGLRVEDAL LYLDDVKREF RGRPHVYNEF LGIMKNFKSQ EVDTPGVIAR VSKLFRGYNK LILGFNTFLP EGYKISLADL EQNQNQTVEF DHAISYVTTI KKRFVHDPST YHQFLEILHT YQKEQRGIKE VLEQVAHLFQ DHPDLLKEFT FFLPDAVQEQ AKERLHRAAA ESETRLAAQR QQQLEQYHVD TNVKSPANAV NRQPKFIDMT HASQAAALAP GPEPQRKHPL EGAPPASAMA DQSESYVYNA AVERQFFDAA REAMSFSRDG GQAWAEFLKC LDMYAQEILT RTEMLTFVEQ ILGKRNAKLL EEFKRILVSA GSPDGPAPLL EDSWYSVPLS EIDFSRCRRC TPSYRALPRD YPAPPCGDRS DEEAKVLNDI WVSLPVGSEE SYTFRHMRRN TYEETLFRVE DERF
|
| |