Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47297 |
Symbol | |
ID | 7202378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 220761 |
End bp | 222390 |
Gene Length | 1630 bp |
Protein Length | 416 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181682 |
Protein GI | 219122707 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGCGTCGAA CGAAACTCAC TGTCAACACT TTTTGGCAGT GCGTTTGTTC CTACAATCGT ATCCAAGGCT GCAATCTATA TTACATATTC GTTACAGAAC CTCAGTATTC GATCAGAGCA ATACTCCAGC CTCCCTGGAT CCCTGAATTC AACCATTTTC TACGTTGTTG TCTTGGCTTT TGGTTCGAAG AATTGCATCG TAGAATTTCT ACGTGCTGTC TACGTAAAGT CATATATCGC ATCACAGTCA ACTCCGCTCT GGCAGCGTTG ACCAGGAGTG GGAAAAGACC GACCTCATTC CATGGCTACC GCACAAGTTT GGATGGTCAA CGACACCAAT TGCACCATGA ATCCTTCCTC TTTGCCTATA GCGGACGAGT TAGCTCCTCA CAAAGACAGC GACGCCACAG CTCACAATAT TGCTCACGAC TTTGCCCAAA CCCTAACCAC GCCGGCTAGT ACTGGGCGCC CAAAGACTGC GCCGATATGT ATGGATGAAA ACCTCCGTAG TCAAGCCATG ACTACGGAGG AGCTTGCTGC TTTTGCTTTT ACAGCACCTA TGAAAAGCAA CGTTTCTCTA TTTCCCGTGG TCCCTGTGTC ATCAAACTCC TCCGAATGCA CCCGTGAAAG TGACCTGGAT GTACTTTCCG ATCACGAGCA TTTTTCGAAA GGAGTTTCGG ATCAAAGTGT CGCGTCCACA CCGGTGCGGA TGGACCGGGC CTTGACCGAT GAGCTCAACG CCAAAGTTTC ATTGCGAAAA ACCTCCCGCC AAGGCCGCTC TACGCAGCGT TGGGCTGAAG AAGAGGACAC GGCATCGGGC GCCATTCGAC TCGTCACTGG CTGTGTTCCG ATTCTCAAAG ACGGCAAAAT ATTATTCGCG TCGGCTTCAC GCAAGTCCGA GTGGATTCTT CCCAAGGGTG GATGGGAAGA GGACGAAACC ATGCCGGAAT CGGCAGTTCG GGAGTGTTTC GAAGAAGCGG GGGTACTCGG TGTTTTGGGG CCACCTTTGC GAACGATCCA GTACGAAACC CGTAAAGCAA AAAAACGGCG ATTGGAACTT GAGAACTCCA ATCTTGCCCC ATTACGATCC ACCAAAGCCA AGATGACTGA TACTTGTGGT AGTGTTTTGT CTGACATCGA AGGAGCTATC ACTGACGGAA CGGCATTGCC ACCCGCACCA GTCGTGACCG CACCTACATT AATGCTGTCC GAAGAAGCCA TGTCCAGAAT ACTCGGGCAT CCCTCCAAGC CCGGTAGGCC AACGACAAAA GCTGCGCCTG TACACCCCAG TGACGACACA GTGTCAATTG CATCTACCAC GCTGTCGGCA ACGTATTCGC AGGTGCGCAT GACACTGTTT CCGCTTTACG TGACTAGTGT GATGGATGTT TGGCCCGAGT CGGGGCGCTT CCGGAAAGCG GTCTCGATTG ACCAGGCTTT GCAATTGCTC GAAACTCGAC CGGAACTACA AGCGGCCGTG CGCGAAGTAC AAGAGCGGAG TTTACATGAG GTGTCCAACC TATCGTCCCT TCCGCCAGCA TCGTTCCGAT GACAATGGCA GCAAGTAGCC GAACCTCGAA AGGAATCAAT TCCGCTGCAC AATGAAAATA GATTTACACG AACAAGGTAC
|
Protein sequence | MATAQVWMVN DTNCTMNPSS LPIADELAPH KDSDATAHNI AHDFAQTLTT PASTGRPKTA PICMDENLRS QAMTTEELAA FAFTAPMKSN VSLFPVVPVS SNSSECTRES DLDVLSDHEH FSKGVSDQSV ASTPVRMDRA LTDELNAKVS LRKTSRQGRS TQRWAEEEDT ASGAIRLVTG CVPILKDGKI LFASASRKSE WILPKGGWEE DETMPESAVR ECFEEAGVLG VLGPPLRTIQ YETRKAKKRR LELENSNLAP LRSTKAKMTD TCGSVLSDIE GAITDGTALP PAPVVTAPTL MLSEEAMSRI LGHPSKPGRP TTKAAPVHPS DDTVSIASTT LSATYSQVRM TLFPLYVTSV MDVWPESGRF RKAVSIDQAL QLLETRPELQ AAVREVQERS LHEVSNLSSL PPASFR
|
| |