Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47120 |
Symbol | |
ID | 7201919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 512439 |
End bp | 514685 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181391 |
Protein GI | 219122100 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCTT CGGTGCGAAT TTTGACGACA TCGTCTGTTG ATTCCAATCC TGCCATTCTT CTCGTGGAGC CTGATGGATC CAAAATTTTG ATCAACTGTG GAGAAGGAAG CCAACGGTCT TTTCTAGACT CCCAGCAGCG GGTTTCGACA GTGAAGGCCG TCTGCTTGAC GCATCTGTCT TATGAATCCA TCGGCGGCCT ACCAGGGATG ATTCTCACAG CTAGCGATGT TCAGAACGCG ACCATCGAAA ACGCAAAAGC TGCCGCCGCA GCCAAAGTTC GCAAAAGCAA TAATTCTACA CAACTTTTGC CCCCCTTTCC AACGGATACG GCACAGGGCC TCAACGTTTT TGGACCAGAA GGGACTAACT CTTTTCTCAA GTCTCTGCGG CACTTTATGA GACGAGATTC CTTTCGGCTG AACGTCCACG AAGGGCTCGT CGAGGGCATT CGTGTTTGTC TCCCAAAAAC TCGTAAACGC AAAAGTGGTC AACCAGTTTC TGAAGGAGCT TTTTTTTCTG TAAAGAGCTT TCCTTTTGTG GAAAGACGTA TCGGCGATCG CAAGCGTTCT AGGACTTTGC CGGAGCGCAA AACTCTTTCC TACCTCTTCT GGACACCACG TTTTCCAGGA AAGTTCATGG CCAACGAAGC GAAAAGGCTT GGGGTACCGA AAGGACCTAT GTATGGGATG TTGAAAAGTG GGAACAGTGT GACCTTTTCC GACGCTTCTG GCGAACAACG TACAGTGACA AGCAATCAGA CTGTGCAACC AGATAGTCCA GGAATAGGTG TCGCTGTATT GCGGTATCCC GAAGATTTCT TTGAAGAGCA GCTTTTAGTA TTTTTCAAGC AAATGACAAT GAAGAGAGTA ATAAGCTCGG TGGGAGTTGA ATTGGAAATT GCGATCCATA TTGCTAGTCG GAGCTCGTTT GGTGACAAAA TTGCGCGGCA ATGGAGAGAC GAATTTCCGT CCACCGTACA GCATTTACTG TTGGACACCG ATATTAGCGC GGACTCACAT GGCACCCCGT TTCGATCAGC GGCGCACGGC GCGTTATGTC GATCTCTCGT TTGCCCGGAC CTGTATGTAC AGGTTAGAGA GCCAAATACG TTGAGACGAC CATATGGACC TGAGCTGGCT CGTGCGGGCT CAGAATTCGT TCTACTACCT CGGGGTAAGG TTGGCTTTTC AGACTTCGTT GATTATAACA TAGATGATGG CAAAGAGAAA GCGAGAACTT TAGTGAAGGA CTCGGGAGCT TCCACGTTGG CGAAGGAGCT TTTGGCTGAA TGCGCTCTAT GCGTGAATGA ATCATTTTCA GGGGAGCTCT TTTTCACAGG TACCGGGTCC GCAATACCAT GCAAGCATCG GAATGTGTCC GGAATTTGTC TCACCTCACC GAATGGAAAC TCTATTCTTC TTGACGTTGG AGAAGGAACA GTTGGACAAC TCCTCCGCGC AAACAGTGGT CCAACATCAA GTACACTTGC ACACATCAAA GCTGTGTGGA TCTCGCATCC ACATGCTGAT CATCACTTGG GGATTCTACG ATTACTCCAC GATCGGAAGG CGCCCGACCC CTTGTTACTA ATGTGTCCAT CACCCATTAT TTCGTTCCTG ACGGAGTATT GTTCCATGGA TTCTGACCTG TCGAGCGCAT ACGTTGCCGT TAATTGCAAT GATTTGATCC GAGAAAATGC AAAGGCGAGC TTTCTACTGA AAGAGGCTCT CGGAATTGAT AGTAGCTTTG CGGTTCCGGT GACTCACTGT CCATACTCTT TTGGCTTGAT TTTAGAGGGC ACTTGTTTTG GTAAGCTTGT CTACAGTGGC GACTGTCGTC CTTCCAGCCA GCTCGCCAAG TGTGCTTTAG GCGCTGACTT GCTAATCCAT GAAGCTACTT TTGAAGACGG AATGGAAGTT GAAGCGGCCT TGAAAAGGCA CTCTACCATT GGAGAGGCTC TTTCGGTTGG AATGGAAATG AAAGCCAAAT GCGTCGTGCT TACGCATTTT TCGCAGCGAT ATCCAAAGGT TCCACCAACT CCAGTCAACC ACGAAGGATC AATCCCGGTC ATCTTTGCGT TCGATTTCAT GCGTCTGTCA CCCAGCAACT TAGTGATGGC CTCCAAGGTG ACTCCGGCAA TTCGTCTTTT GTATCCCGAG GAAAGCGAAG GAAGGCAAGG CGCGGAAACT GAAGCAGAGT CTATAATGGC AATTCCTGGA CTGTTCGCAC AGAGCGAACT CCTGTAG
|
Protein sequence | MTASVRILTT SSVDSNPAIL LVEPDGSKIL INCGEGSQRS FLDSQQRVST VKAVCLTHLS YESIGGLPGM ILTASDVQNA TIENAKAAAA AKVRKSNNST QLLPPFPTDT AQGLNVFGPE GTNSFLKSLR HFMRRDSFRL NVHEGLVEGI RVCLPKTRKR KSGQPVSEGA FFSVKSFPFV ERRIGDRKRS RTLPERKTLS YLFWTPRFPG KFMANEAKRL GVPKGPMYGM LKSGNSVTFS DASGEQRTVT SNQTVQPDSP GIGVAVLRYP EDFFEEQLLV FFKQMTMKRV ISSVGVELEI AIHIASRSSF GDKIARQWRD EFPSTVQHLL LDTDISADSH GTPFRSAAHG ALCRSLVCPD LYVQVREPNT LRRPYGPELA RAGSEFVLLP RGKVGFSDFV DYNIDDGKEK ARTLVKDSGA STLAKELLAE CALCVNESFS GELFFTGTGS AIPCKHRNVS GICLTSPNGN SILLDVGEGT VGQLLRANSG PTSSTLAHIK AVWISHPHAD HHLGILRLLH DRKAPDPLLL MCPSPIISFL TEYCSMDSDL SSAYVAVNCN DLIRENAKAS FLLKEALGID SSFAVPVTHC PYSFGLILEG TCFGKLVYSG DCRPSSQLAK CALGADLLIH EATFEDGMEV EAALKRHSTI GEALSVGMEM KAKCVVLTHF SQRYPKVPPT PVNHEGSIPV IFAFDFMRLS PSNLVMASKV TPAIRLLYPE ESEGRQGAET EAESIMAIPG LFAQSELL
|
| |