Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47293 |
Symbol | |
ID | 7202328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 210541 |
End bp | 212283 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181508 |
Protein GI | 219122347 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGACT TGGTAACCTT TTCGCTTTTT ATCTTGCTCT TTTGCACTTG CTCCGACTCG CTCGCGGTCT ACGGACGGTC CGCTGCGACG ACTTCGCGCA CAGCTCCCCC AATCGTTTAC ACGATTGCCG GATCAGATTC CGGCGGTGGC GCAGGAATTC AGGCGGACCT ACACGCCATT CATTCCTTTG GGTGTCACGG ATGTTCCGCT ATCACCTGCT TGACTGCGCA AAACTCGGTG GGCGTCATTG GTGTGCACGC CCCTCCACCG GATTTTCTCC GAGCCCAGCT GGAAACTTTG TTGGAAGACT TGCCACCGCA GGCTATCAAA ATCGGAATGC TCGGAACGAA AGAGCTTGCA ATCGAAGTGG GAGCCTTTCT GAAGAAGTTG AAGGCTTTAG ATCGGAAAGT TTGGGTTGTT TTAGATCCGG TCATGATCAC GACGTCGGGA CACCGGTTAA TTGAAGAAGA CGCACAGGAG GCCATGGTCA AGCACGTTTT CCCGCATATT GACGTTTTGA CGCCAAACAA GTTTGAGGCT GAAGCTTTGT TGAATCGCAC CCTTGAGACA ATCAGCGATG TAGAAGAGGG CGCAAAGGAC TTGATTGCAC TCGGGGCCCC ATCCGTTTTG ATCAAAGGAG GCCACACGCT GTACGAAGGG GGCAAAGCAA GCAACCACAT AGCCTACGCC CAAGATTACT TTTTATCGTC TGTGCAAAGA AATACCGGTG AGCCACGATT GTGCGATGGG GACCTTGGCG TCTGGTTGCG ATCGCCACGT TACGAGACAG AGCACACGCA CGGGACCGGT TGTACCCTGT CGGCATCCTT GGCGGCTTCT TTGGCGCTGG GGGAACAAGA ACGCCAAAAA CCGGACGGAA AGCGACGAGG AGCAACTAGT GCCATAGATA CAGTGGATGC ATGCTGCTTA GCCAAGGCAT ACGTCACGGC CGGTATTTTT CATGGAATTC AGCTGGGACA AGGTCCAGGT CCGGTTGCAC AGACTGGGTT CCCGTCTTCC CACCAATACT TCCCCATGGT GGTTGCAGAC GCAGCGGAAG ATCATCAAAG ATTTCCACGA ATGAAAGCCT ATGATGACAA GACTACCTAT GATGATAATC GACCAACGCT TGGTCGGATA TTGCCTGTTG TCAACGATGA GGTTTGGGTC CAGCGCCTGT GTCAAGATCC CGGTGTCCAC GACATTCAAC TTCGGGTTAA AGGTATTGAC GACAACAAGA AAATCTTGGA AATTATTAAG AAGTGCCAGA AGCTATGCCA GGCATCGGGC AAGCGCCTTT GGGTTAACGA CTACTGGCAA GAGGCAATAG AATCGGGATG CTTTGGTGTC CATGTTGGCC AGGAAGATCT TTACACATGC ATACGAGCAG GTGGTCTGCA ACTTTTGCGA GAGAAAAGGA TGGCGTTTGG AATTTCCACA CACTCTTACG GTGAGCTTGC CACAGCATTG GGGGTCAAAC CGTCCTACAT CAGCCTTGGT CCCGTATTCG CAACAAGTAG CAAAACCGTT CAGTTTGACC CGCAGGGCTT GTCACTAGTA CGGAAATGGA GAGAGCTTAT ACAAAAAGAG GTCCCGCTGG TTGTCATTGG CGGATTCTCA GATGCTGAAC GAGCAAAGGC TGTGCGAGGA TTGGGGGCCA ATTGTGTGGC AGTCATTGGA GCCGTAACTC AGGCAAATGA CACAGTTGAG GCTGTGTCAC AGATGAACGA AGCCATGCGA TGA
|
Protein sequence | MKDLVTFSLF ILLFCTCSDS LAVYGRSAAT TSRTAPPIVY TIAGSDSGGG AGIQADLHAI HSFGCHGCSA ITCLTAQNSV GVIGVHAPPP DFLRAQLETL LEDLPPQAIK IGMLGTKELA IEVGAFLKKL KALDRKVWVV LDPVMITTSG HRLIEEDAQE AMVKHVFPHI DVLTPNKFEA EALLNRTLET ISDVEEGAKD LIALGAPSVL IKGGHTLYEG GKASNHIAYA QDYFLSSVQR NTGEPRLCDG DLGVWLRSPR YETEHTHGTG CTLSASLAAS LALGEQERQK PDGKRRGATS AIDTVDACCL AKAYVTAGIF HGIQLGQGPG PVAQTGFPSS HQYFPMVVAD AAEDHQRFPR MKAYDDKTTY DDNRPTLGRI LPVVNDEVWV QRLCQDPGVH DIQLRVKGID DNKKILEIIK KCQKLCQASG KRLWVNDYWQ EAIESGCFGV HVGQEDLYTC IRAGGLQLLR EKRMAFGIST HSYGELATAL GVKPSYISLG PVFATSSKTV QFDPQGLSLV RKWRELIQKE VPLVVIGGFS DAERAKAVRG LGANCVAVIG AVTQANDTVE AVSQMNEAMR
|
| |