Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49984 |
Symbol | |
ID | 7198693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 35405 |
End bp | 37141 |
Gene Length | 1737 bp |
Protein Length | 572 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184879 |
Protein GI | 219129403 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGCA AAACGAAAGA CTTCACTATT GCCAACAATA GTCTTCCCGA ATTTGGAAAT CCCCCTGACA AGCACAGCAG AAAGAAGAAA AGCAAAATCC GGAAGAAGGA TCGACCATTG TGGTCGGCGC CGCGCACGGA CGCGCCACTT ACTGCTACAG ACGGGGAAAC CAACTCCGTT AACGAAGCTC TGGGCTATTC TCAATTTCAT TCTCACGATG CCGAGCACGA ATCTGACGAC GGTTGGTCTG ATACTTCCGG TAAAAGTAGT GAGCCAGACC TGGGGAAAGG ACGGCAGGGT CTAACAAAGA ACGCCGCCAC GATCCAAACC AACGTCGCAG AATCTCGTCA AGCCGATCCT CCGATGATTT GCAATCTCCC TGTACCGGTA TCCTTTAGTT CCGACGATGA AGACTCTAGT GGAGAGGATA GTGACCATGG ATTTGCGCAT TCCGCGCTCG GAGCATCTCC TTACACTGTC AAACAGCGGT CAAATGAAGA CTTGACGCGA GACATCAATC AGCGACCTCC TTCCCCTCCC GCATCTCCGA AAAAGAATAC GTCCGAACAT TTCATTGGGT ACCGTTCTAT GGACACAAAC CGCAAGGCAA TTCCCCATTA TTTGACACGC ACAAGTAGCC GTCCGGACCC TAGCGATAGT CCAAATTTTC ACAGCAGCAA TGCCCGTCAA CATCAGCACG AGCCGCAGTA CAATCCAAAC GAGCAGAGCT CTGTTGGTAG TCCTCGTATA AATGGGATTG AATTTGCCGG GAACTACGGA GATGGACGAG TTTCCGGCAA TGAGAAGGGT ACGCGAGGGT ACGATCCTGA CTACGAGAGT GATCAGACCG GCCTACAGTA CATTCCAAGA GATCCGTACC AGGATTCCGA TTATGATCTC CAAAACGATC CTCGATACTA TCCCGAAACG GCTGAAAGCG GTAGTGAATA CATTCCCGCA ATCTATAACG CCGAATTTCC CGATGCCTCG AAGACAGACC CCGAGACCGG GTATTTGCCA TCACAGTCGC AACTACTCCG AGAGGCCAAT GATTCGATTT CGTCCAGCCA AGTTCAGAAG CGTGACCGGC GCACAATGAC ATGGCTCATT GTGTGTCTGG GATGCGCTCT CGTGGCACTG GCTGCTCTAA CAGGAGGAAT AGTCGGAGCT TTGGTGTCCA AAGAGGATGC CGATGTGGTT GAACTGTCGG AGCCAACGGA AAACAGTTCC CCAACCACGC CCGCTGCCAA CATAACAAAG GCACCAACAA TCCCAACTCT AGTACCCACT TCTTCGCCAG CCGATATTGA AGAGAGAACA GAGGGACCAA CTCCATCTCC TCAAACATTC TCACCCACAT CATTGGCCAC AACGATCACA AGTCTTCTAC CGACACCTTC GCCAACGAGA ATGGAACAGA GTACAGAATT TCCTACTCAA GCTCCGCAAA CAATTTCACC CACGTCCTTG GCACCAACGA TCCCAAATCC AGTTCCGACA CCTTCACCAA CGGACACTAA AGAAAGTGAA GGAGAACCAA CGCAATCTCC TCAAACAATT TTTCCCCCTA CAACCGCTTC AAGCGAAGAA GGTACTAGCA ATGCGCCAGC GGCCGCCATT ACAAATACCC CACAAGTTGC GACAACGCCT GCGCCTGTTA CCGGCGGTGG TGGTTTCGGA ACTAGCGGTG GATTTGGCAC CGGTGGATGG TGGCGGTAGG GTGGAAACTG GCAAGTG
|
Protein sequence | MSRKTKDFTI ANNSLPEFGN PPDKHSRKKK SKIRKKDRPL WSAPRTDAPL TATDGETNSV NEALGYSQFH SHDAEHESDD GWSDTSGKSS EPDLGKGRQG LTKNAATIQT NVAESRQADP PMICNLPVPV SFSSDDEDSS GEDSDHGFAH SALGASPYTV KQRSNEDLTR DINQRPPSPP ASPKKNTSEH FIGYRSMDTN RKAIPHYLTR TSSRPDPSDS PNFHSSNARQ HQHEPQYNPN EQSSVGSPRI NGIEFAGNYG DGRVSGNEKG TRGYDPDYES DQTGLQYIPR DPYQDSDYDL QNDPRYYPET AESGSEYIPA IYNAEFPDAS KTDPETGYLP SQSQLLREAN DSISSSQVQK RDRRTMTWLI VCLGCALVAL AALTGGIVGA LVSKEDADVV ELSEPTENSS PTTPAANITK APTIPTLVPT SSPADIEERT EGPTPSPQTF SPTSLATTIT SLLPTPSPTR MEQSTEFPTQ APQTISPTSL APTIPNPVPT PSPTDTKESE GEPTQSPQTI FPPTTASSEE GTSNAPAAAI TNTPQVATTP APVTGGGGFG TSGGFGTGGW WR
|
| |