Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42779 |
Symbol | |
ID | 7196401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1090785 |
End bp | 1092657 |
Gene Length | 1873 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176721 |
Protein GI | 219109937 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0527893 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TAGAGCCGTT CAAGATTGAG AGTATAGAAA TCGGAAAGCC CCGCAAGAAA TATCGCTTTT GCCTCCTATG AACTCACTCT TCTAACTTTG AAAGGATCTC ATTGTGCTGA CTGATAAATG TGTATTTCAG TCGAAAAACC TATGGTTGCC GACGGCAACA GAAGTTCCCG ATCTTCGTTT TTTGAAACGC CGATTCTACG GCTTCGATAC TCCGTGGTGG GACCGGCTTT GACGATCGAC GCACAAGTCT TGTTGGCTCT TGTATTGATA GCAGCGTATC TTTCTAATTG CTTGCAGGAG GAGCGTGTTT CCTTTCTTAA TAATTTCTGG ACAATACGAA CGATTCTGCA GCAGGAAAGT CTCAGGGCTG TAATTTTCGT ATTTTCAATG TTTGCTTTTG GGCGGAAATG GTTTGGTTCT TTGCTAGGAG CTCAAAAATT GGCTCAAACA GGTTCATCGG CTCTGAAGAG CGTTGCTTCA AACGGAACCA TCGTGAGAAT CCGTAGTTCC ATGTCGATGT CTGTCATATC GGACCGGCTA ATGTCGATGA ACGGCAAGTC GACCTCGCAC AGGGCTTCTC TGTCACTGTC AGACAAAACG TGCTTTGCTC AATTGAAGGA TATTGAGAAA CTGTCGGTAA AGGATATGGG GAGCATTTTT CGCTACGCAA TACAATACAA TATATGGACT GACGCTCAAC TTAAAGCCTT TCTTTCAGAA ATTCGGGTCC AGGCCTTGTC CGTTGTTACC GCAATTGATC ACGCGCTTCC CCCCTTACAT CGTGCGTCAA TGGTGCAAGA ACCAAACGCG CAGAGTAGCA GTCCTAGGAA CTTTGGAGAT ATGGACGTAC TCCTGTTTTC AGCAGCCGTT CGAGTATTTG CGGAATGGAG ACTTCTTCGC CTTACCCCAG CAGGATATCG AAATTACGCA CTTGGGATGG CGCTGACACG TCGCGATTTG GTGCAAAATA TTGGAAAGAT AGAAGCTGCT GTACATGAGC TATTGGAAAG CCAAACTTCT CACAACGGAG TCAATTCTAT TAGACCTACA ATTTGGCAAT TGCTGGATTA CGAAGTACAG CGTGGTGTGC ACCCAAAGCT ACCTTATTTG GTCGAGAAAT CGGGTGCGTC GGGTATCCTT TGGATCATGC GGCAACTGAG TTTCCAGGTG AATTCATTTG AATACATTTC TAAGGTACCA ACAGCGTTTC CGTCATTCAA GATAGCCGTT CGCTCAGCAT ATGATCGAGT TTATGGCGAC TATCATGGAT TCTTCTTAAA GCAAATTTTT TGGAATTCGT TCAAATCGGC TCCGGAAGCT AGCGTAATTC TGAAGTTTAT GGAGGAAACC GAAGAAATTA TGTCCAGAGA TCTGTCTCCA TCGAATTCAT CCGAAAAGGC CACTGTTCTA ACAAGAGATG CTGTTTGTGA TAACATACAA TCTCCTCAAG AGCGGAATCA TTTTATTGGC ATGTTTCTCG AAGTGCAGCA GTTTCTTAAG CAGTGTCACG GGGGGCACCC AAAAGTAAGG CCACCAGCAG TCGACACATC ACTTCATGGC CCTGAACATA TTGGACAAAA GGGAGGGGAT CTCACCGCTT TTCTTCTCGA AATGCACCCG CTAATATCAG GACTTGATGG CCTGATTGGA CACTTCAATA TGAAAGATCC CTCTAAAGTT TAGCCGCCGG AGAGCCTGTA CCTCCTCTAA TCCCATTTAT TTACTTGAAG CTAGCAGGGA CATAGATACC GGCAACTTCA TCCTTGTAAG ATGATTTAGT AAAAGGCCAA ATTGTTTACA GTCAGCAATT TGAAAGTGTG TTGGCCTACC GCATTTTCGG GGACACTAAC AAAAACAGTT GTTGTAACAC CGT
|
Protein sequence | MCISVEKPMV ADGNRSSRSS FFETPILRLR YSVVGPALTI DAQVLLALVL IAAYLSNCLQ EERVSFLNNF WTIRTILQQE SLRAVIFVFS MFAFGRKWFG SLLGAQKLAQ TGSSALKSVA SNGTIVRIRS SMSMSVISDR LMSMNGKSTS HRASLSLSDK TCFAQLKDIE KLSVKDMGSI FRYAIQYNIW TDAQLKAFLS EIRVQALSVV TAIDHALPPL HRASMVQEPN AQSSSPRNFG DMDVLLFSAA VRVFAEWRLL RLTPAGYRNY ALGMALTRRD LVQNIGKIEA AVHELLESQT SHNGVNSIRP TIWQLLDYEV QRGVHPKLPY LVEKSGASGI LWIMRQLSFQ VNSFEYISKV PTAFPSFKIA VRSAYDRVYG DYHGFFLKQI FWNSFKSAPE ASVILKFMEE TEEIMSRDLS PSNSSEKATV LTRDAVCDNI QSPQERNHFI GMFLEVQQFL KQCHGGHPKV RPPAVDTSLH GPEHIGQKGG DLTAFLLEMH PLISGLDGLI GHFNMKDPSK V
|
| |