Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38585 |
Symbol | |
ID | 7203321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 452553 |
End bp | 453935 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182537 |
Protein GI | 219124494 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGCAT CGAATGCGGC TTTGTGTGTC CTTCTCATCT TGACAGGATG CCAACTTGAA GGAAGCAGGA AATACACCAG CAAATTCAAC CCGATAGTAT GCGATGATGG TTGCGTTTCA AAGTCTAGGA TTACACCACT CAACGGTCAG AAAGAATTCC ATAGCCATCA ACGCCCCAGC TACTATCCTG ATTCGTCCGC TTTGCAAAGT CGACTTTCGT GGTCGTCGGT GCCAAGAGCG GATAAGATGC ACAACGAAAC ATTTGTACAT CGACAGAAAC GAACTCCTCT GAAAAGCAAC CAGCCTATTG TCTTTTTGGA CGTAGGTCCG CAGAAGACAT CCACTTCGGG TATTCAGAGC TTTCTGGCGG ACAACGAAGA CGGATTGAAG CTGCATGATA ATATCACTCT ACCTGCGCCT TTCCCTATAA GAATTGGTAA CTCCACAAAA ATGCTGAGGT TTATAAACTC GAGACATTTA ATCTTTTGCT TCTCCAGTAA AGAGACTAGG CCCAAGCAAT GGAGACAATA CCTGAGCTTA TCCCCCCCTC TCTTTCATTG TGAGGAGGTT TTGAGCGCAT TTCTCGACTT TGTCCGCACC GCACGAGCAA GTTCCAAAAA CATTCTCATG TCGGTCGAGT GCTTGTCATT TTTAAACGCT GCGCAGATCC AGCACTTTGT CAAAACCTGC TTTGTGGGAT GGGAAATTCG AGTCATAGTC GTTTACCGCC GTTTTGATGA GTGGCTTCCA AGTTTTCACT TTCAAGATAC ACGAAGTGAT CGGGCTCATT TACGCTCCAC GCTGGTGGAA TATCTGGACA GCCCGGAGAG TCTACATGCA GCCGAGTTTG CGTACAGCCA TGCAGTGGCC CAGCGATACC GGAGTATCGA TCCTAATCTC ACGATGTTGA ATTTTCACCA CATAGACAGT AATCGCAGTC TGATAGAGGA ATTCCTCTGT CGTGGGTTGC AGGGTCTGGC ACCGCATACG TGCCGTATCG CGGAGAAATG TGTCGCTCCC AAGGAGAATT CTGGATATTC GTTGGACGCC GGATTCGTAT TGGCCGTAGC CTTGAAAAAG AATTTGCTGT CACGAAACGA CAGAGTCACC AACGGTACCA TCTCACTCCA GTACGAAAAG CTGTTGTTCG AAAGTATCGA CCAAAAGCTG CAAGAAAGTA CGAATCTTCC GTTTTTTTGT GCGACAGAGC CTGTCCAGCA GTACATCAGG AACCGGACCA TGGAATGGTT TCCATTCGAT CTAGATTTTT TAGCCGCGAA ACAAAGCACA GGATACAACA AAAATCGCCC GCCCTTTTGC TCCTTGGATG CATCAGCACT TTGTGAGCGT GACGATTGGC AATTGTTTCT TCGTGGACTT TAA
|
Protein sequence | MRASNAALCV LLILTGCQLE GSRKYTSKFN PIVCDDGCVS KSRITPLNGQ KEFHSHQRPS YYPDSSALQS RLSWSSVPRA DKMHNETFVH RQKRTPLKSN QPIVFLDVGP QKTSTSGIQS FLADNEDGLK LHDNITLPAP FPIRIGNSTK MLRFINSRHL IFCFSSKETR PKQWRQYLSL SPPLFHCEEV LSAFLDFVRT ARASSKNILM SVECLSFLNA AQIQHFVKTC FVGWEIRVIV VYRRFDEWLP SFHFQDTRSD RAHLRSTLVE YLDSPESLHA AEFAYSHAVA QRYRSIDPNL TMLNFHHIDS NRSLIEEFLC RGLQGLAPHT CRIAEKCVAP KENSGYSLDA GFVLAVALKK NLLSRNDRVT NGTISLQYEK LLFESIDQKL QESTNLPFFC ATEPVQQYIR NRTMEWFPFD LDFLAAKQST GYNKNRPPFC SLDASALCER DDWQLFLRGL
|
| |