Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14990 |
Symbol | |
ID | 7203702 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 611373 |
End bp | 612980 |
Gene Length | 1608 bp |
Protein Length | 467 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182871 |
Protein GI | 219125194 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCGG TGGACTTTTA CCGGCGTGTC CCGAAAGATT TGACAGAGGT AGGCCGATTG CTGCAAAGAC CGTAGTTTGC TGAAATACAC AATGCGTTGT ACCCACACGC GGCAGCACAC TGTCTCTCAC GTTTCTCTTA TCTTTCTTTG GATGTCGTGT GGTAGGCAAC CAGTTTGGGT GCCATTATGA GTGTGTGTGC CCTGGTGGTA ATGGGGGTGT TGTTCCTTTC GGAAACGGCC GCCTTTGCGC GCACGGGTAT TGCCACGTCC ATTACATTGG ACGAGAACAC GTCGCCGCAA ATTCGCTTGA ACTTCAACAT TACGCTCACG GATTTGCAGT GCGATTATGT TTCGATCGAC GTGTGGGACG CCTTGGGCAC GAACAAGCAG AACGTGACGA AAAACATCGA CAAGTGGCAA CTCGACGCCC AAGGGATTCG AAGGATCTTT TCGGGACGCA ATAGGGAAGG TCGGGAAGTC GTACACGATA GTCACGATCG GAGTTTGGAC GAAATTCATT CGGAAGATGG CAAAGCCGTG GTAGACCTCA CGGCGGATAC CTTTGACGAT TTCATGGAAG AGCACGAAAT GGCCTTTGTG GACCTTTATG CTCCATGGTA CGTGTCCATC GTTCATAGCA ATGCCTCTAT TGCCTGCCTC GTACGCGCTG CAACTTACCA TCCGGTCTCA CCCTGTTTTT CTAGGTGCGT TTGGTGCCAA CGGCTCGCGC CCACGTGGGA ATTGTTCGCG CAAGAAGTCA AAAAGGAAGG CATGCCCATT GGTGTCGCCA AGATTGATTG CATGGCCGAA GCCGACTTGT GCCGCGCACA GCGCGTCATG GCCTTTCCCA CGTTGCGCTG GTACCACGAG GGCAAAGCGG TGGCGCCCGA CTACAAAATG GATCGAACCA TCCCGGCCCT TACCTCGTTC GCCAAACGTA AACTTGATAT GGACGAAAAG TTTAAGGAAT GGCATTCCAA AGCGTCCGAT AGTGCCGATC CAGCCGAAGT GGAAAAGAAA CGACAATTGT ACCAACAGAA CCGACCGGAT CATCCCGGTT GTCAGGTATC GGGACATTTG ATGGTCAACC GGGTGCCGGG CAATTTTCAT CTGGAAGCGA AATCCAAAAG TCACAACTTG AACGCCGCCA TGACTAATCT ATCGCACGTG GTGAACCATT TAAGTTTCGG GGAACCCATT GACGAAAATA ACCGCAAGTC GAAACGGATA CTCAAACAAG TCCCGGAGGA GCACCGACAA TTTGCACCCA TGGATGGGCA AGCGTTCCTA ACCAAAGCCT TCCATCAGGC CTTTCACCAC TACATCAAAG TCGTTTCGAC GCACCTGAAT ATGGGCTCGT CGGATGCGAA TTCCATGTTG ACGTATCAGT TTTTGGAACA ATCACAAATT GTCTTTTACG ACGATGTCAA CGTACCGGAG GCACGTTTCA GCTACGATTT GAGTCCGATG AGTGTAGTGG TCGAAAAGGA AGGGCGGAAA TGGTACGACT ATTTGACCTC GCTGTGCGCC ATCATTGGAG GAACTTTTAC GACCCTGGGT TTGATCGATG CGACGCTGTA CAAGGTGCTG AAACCGAAAA AACTGTAA
|
Protein sequence | MSSVDFYRRV PKDLTEATSL GAIMSVCALV VMGVLFLSET AAFARTGIAT SITLDENTSP QIRLNFNITL TDLQCDYVSI DVWDALGTNK QNVTKNIDKW QLDAQGIRRI FSGRNREGRE VVHDSHDRSL DEIHSEDGKA VVDLTADTFD DFMEEHEMAF VDLYAPWCVW CQRLAPTWEL FAQEVKKEGM PIGVAKIDCM AEADLCRAQR VMAFPTLRWY HEGKAVAPDY KMDRTIPALT SFAKRKLDMD EKFKEWHSKA SDSADPAEVE KKRQLYQQNR PDHPGCQVSG HLMVNRVPGN FHLEAKSKSH NLNAAMTNLS HVVNHLSFGE PIDENNRKSK RILKQVPEEH RQFAPMDGQA FLTKAFHQAF HHYIKVVSTH LNMGSSDANS MLTYQFLEQS QIVFYDDVNV PEARFSYDLS PMSVVVEKEG RKWYDYLTSL CAIIGGTFTT LGLIDATLYK VLKPKKL
|
| |