Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35855 |
Symbol | |
ID | 7200858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 909335 |
End bp | 910576 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180343 |
Protein GI | 219119153 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0016146 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTAT CCAACAAATC TTTCCACGAT ATTTTTCTGC AAGTCATGGT AGGGCACCCT GACGACGATA TACATAGGGA CGCCCACATG CAGGATACAC GGCGTTCCTC CCACCCCAAC GATACCTCAC CGCTCCCATC CGACGTCGTG ATCGGATGGC GTTCGCTGTG TGGTTCCGTC CTGCCGGAGA ATATCCGTGC CCACGCGGAA GCCTATTCTG CAACGGCCGT ATCACCACAC CATTCCAATA CTCGTCGAAC GACCCAGGCT ACCGCCAATT CTGTGACTAT TAGCAACCGC ACATTCCCGC TCGATTCGAA ACAATTGTCT GATCCTAGTA ACGTGCCTGC CTCCAACCAC AACGAAAACA GCTATCTATA CGAGCGTCAT AGTCCTGTGC CACTTGTGAG CCCATTTCTA CCCGATATGA CGGTTTGGGA CAATTCTATG CAGCTTGGTT TACAACCAAT ATTTACGGAT CACGATTCAA CGTTTGCCTC TGCTTTCGGC AAAACGTACG TATCTCCCGA AACCTTGCCG GTTCCTGCTC TCCCGGCGCC TCCCCCCGCT TTCGATCGGG CGCACTCCTT GGCCTTTTCG GTTTCCAGCG ACTGTACGAG CCAATCTGCT CTCTTTTTTC ATCCTATATC TAATCCCACA CACGATCCAC CCGTTTCACC ACTCAAACCA TGTTTCTTGG ATACACAAGA GCAGCCCGTG GTGCCAGACA TCGTCCACAA CTTGCCAATC GAACCCCGTC CCACCAAGAA GCTACGCACA ACGGAAACGA TCAATAATGA TGATCTGTGG CCTCGCGCGT TGTGGCCACA ACCTACACCG CAGCTAGCGA TCAACACAAC GTGTGCGCCA CTCGCCCCAC CACCGGCCCA GCACGACTCC CAACCAGTCG TATTGCTCCC CATGCCAATT GTTTGGCCCA AAACACCACC AGCCTTGAAC ACAAACCTGA CTTTCGACAC GCCGTGGCTC TCCGCCGCTC AAGTGGCCAC CAAGAAAAGT GACCCCGTCG TGAAAAGGAA TAAGGACAGC GAACAGATTT CCGCCAACAT TCTTGTCGCC TCGCGACCTA GCGTGGTCGA TCCACGGTTG GCTACGTTCT TGGAACGCTT TGACAACGCC GAATGGCGAT TGCAAGCCCT GCAGGCCAAA AACGCCGAAC TCCAAGCCAA AGTGCAGGAA GCCGAACGGC AAAAACGCGC CATGCAACAG TTGGCCAGGT AA
|
Protein sequence | MDLSNKSFHD IFLQVMVGHP DDDIHRDAHM QDTRRSSHPN DTSPLPSDVV IGWRSLCGSV LPENIRAHAE AYSATAVSPH HSNTRRTTQA TANSVTISNR TFPLDSKQLS DPSNVPASNH NENSYLYERH SPVPLVSPFL PDMTVWDNSM QLGLQPIFTD HDSTFASAFG KTYVSPETLP VPALPAPPPA FDRAHSLAFS VSSDCTSQSA LFFHPISNPT HDPPVSPLKP CFLDTQEQPV VPDIVHNLPI EPRPTKKLRT TETINNDDLW PRALWPQPTP QLAINTTCAP LAPPPAQHDS QPVVLLPMPI VWPKTPPALN TNLTFDTPWL SAAQVATKKS DPVVKRNKDS EQISANILVA SRPSVVDPRL ATFLERFDNA EWRLQALQAK NAELQAKVQE AERQKRAMQQ LAR
|
| |