Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47350 |
Symbol | |
ID | 7202405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 367919 |
End bp | 369337 |
Gene Length | 1419 bp |
Protein Length | 415 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181540 |
Protein GI | 219122414 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATAATGGCA CTACCGTAAT TGTCTGATGC ACACCGTAGG GCAAACCAAA TCCACGTATC CACGAAACCA ACTGCTGAAT TTTTGGTCTT GTACAAGTAT TTTGCGACAA CCGTAACTCA CTTCTTGTAT TTCTAAAAGC TAGTCCGTAC AGAGGTTTCT GTTAATCTAA AATGGACTGG GGGGCGCTCG ACGAAGAAAG CTGCAACTGC GACGAAAGCC CGAGCCACAA GAACCGTGAT ACTAAAGACT ACTTTTCAAC CAGCAGTGAA AGAAGAGTTT CCGGTCCAGG GAACAGTCAA TTGTCGAAAC AGACGTCTGG CCAAGCTTGC GCCGATTCGA AGCTGAAAAA GCAGCCCAAG CAGACAGGGA AATGGAAGAA GCCCCCCGGG ATGCCCAAGC GGCCGCTTTC TGCGTATAAT TTGTTTTTTG CACACGAAAG GAAGCAGCTT ATTGCCTGTG GCGTCCTTGC CAGTGGCAGA CAGAAGAAGC ATTATACAAA GCAATCGACG ACTTTACGTA GACCTCCCCG AAAAATGGGA TTTGCAGGGC TAGCGCGAGC GGTTGCAGCC AAATGGAAGA TGATTGATGA TAGTACCCGG CATACCTTCA ATCAACAGGC AGAATGTGAG CAAGCAAAAT ACAAAATAGC AATCAAACAG TGGAACAGCC AGCATCTAGA TTTGTCAGAA ACGAACGTAG TCAATCATGT TGGGCAAGAT AGAGCCACGT GCAGCGAATA TGACTTCAGC AGCAATCCGC TCCTCCATAC TCCGAAAGAA GGACAAGCAA CAAAAGGAAA GAGTCTTCCA ATTCCCACTG TGTATGTCGC AAGGCCAGGA AATGGTACTG TTGGGTGTAA TGTTCATCAA AGTGAAGACA CCCTTCCGCA TGGTTTACAA GAAAGTCGCG TAGTGGATCG GGCTCGTTCC ACAGAGAGAT GGTACAGCGG GGCACAACCT AGTGGATTTG TTAGCCTTCC TTTCACTCAT GAGCAGTCCT CTACAAACCG AGTCTGCAAT AAGCGAGCCA CGTCCCTAGG ACAAGCTGCC GGCGTTGTTC CACCGCGTAT TATGCCGCCA ATTCTTCAAC ATGACGACTT CAAAATGGTG CCGCAATCCC GCAAGAATCG CTCGTCGACA CAACGAGAAC AACACATAGC AATGGGCCGT TATGGGTCTG CTCATGCATC AGGGAGTCAA AGCCGTAGTA AAATGGAGCC CGCAAAAGGC TTCTGTAGAC ATGATGAAAA GGTGGAGTCA AAGCATCTCA TAAAAGTACT GTCATCTTCC GGAAGTGAAT TTCCGACGAA AGGAGCAAGA TCTATATTGT CAGCAAAAGG CAGGTCGTTT CGCCAATTAA TATTTGATTT GGATGATGAT GAAGTAGATC TGCTGCGGGA ACTGGCTAAA AATCCCTAA
|
Protein sequence | MDWGALDEES CNCDESPSHK NRDTKDYFST SSERRVSGPG NSQLSKQTSG QACADSKLKK QPKQTGKWKK PPGMPKRPLS AYNLFFAHER KQLIACGVLA SGRQKKHYTK QSTTLRRPPR KMGFAGLARA VAAKWKMIDD STRHTFNQQA ECEQAKYKIA IKQWNSQHLD LSETNVVNHV GQDRATCSEY DFSSNPLLHT PKEGQATKGK SLPIPTVYVA RPGNGTVGCN VHQSEDTLPH GLQESRVVDR ARSTERWYSG AQPSGFVSLP FTHEQSSTNR VCNKRATSLG QAAGVVPPRI MPPILQHDDF KMVPQSRKNR SSTQREQHIA MGRYGSAHAS GSQSRSKMEP AKGFCRHDEK VESKHLIKVL SSSGSEFPTK GARSILSAKG RSFRQLIFDL DDDEVDLLRE LAKNP
|
| |