Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49232 |
Symbol | |
ID | 7195697 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 309200 |
End bp | 311894 |
Gene Length | 2695 bp |
Protein Length | 780 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183852 |
Protein GI | 219127250 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGAGG GGGGAAACGC GTCGAACCCT TCTGGTATTG GCCAGCCGCA AGTTTCGTTT TCTTGACGCA TTCGTCCCCC CCTTTTCTCG CACCGTAGCG CAGTAAACTT TCTTCTACAC CCGCATTCCG TATGAATCCC GCCAACACGG AGGAGGAGGA TGTCGCGGAA GACGGGTCTC TCCTGTCCAC CCGCGTGCTG TGCCGGACGT TACAGAAAGA ACTGCACAAC ATGGGCAAGG CCTTGTCGTG TCCGCTGTGT CTATCGACGT ACCGTGACGC CGTCACGTTG CCGTGTTGTC ACGCCTACTG TCGGTCGTGT TTGACGCAAG CCTTGGCGAC GGGCAGTGCG CGTCGGCCCC CGACCTGTCC GTGTTGTCAA CAACGAACCG CGGGTCGACG GAGTTTGACG GACGCACCGA AACTCAACGA ACTCGTCCGG GCTTACAAAC TCGCCCTCCG GCATTTCGGA CTCGCCCCCG TACGGTACGT ACTTGTGTGT ATATACATAC ATACATACAA GCACTTGGAA CGGTGTACCC GCAACGATTC TCGGTGTCTG TGGAAGGTCG CTCATCTTGT GTTTGTATGT GTCCGTGTAC AGATACGAAG AAGCCTTGCC CATGACGCAA CTCGTACCCT CCACACCGGA CGAGTCACCC CTCGACACGG TCGATGTACA CCAGCATTTG CAAGCCGCAC GTGTCTTTGC GCAGGCCTGG GACGGACCCG GCGACGTCCC CTACCGCGAC GAACAGGATC TCGTCGTGGC GGCCAACCGA CGCTTCTTAC TACAGGCTGC CGTCGCCGCC CAAGCCAAAC CGCAGTCTTC CCGGAATACT AACTCCACGA CTCTTTCCTA TTCACAACTC GCCAACCAAG CCGCTGAACA GTCCGCGGCG GATCGTAACG ACGAAACCTC GCCGTGTTGG TGGGAACAAC GGCACGAAAA GTCGCAGAGT ACCGTGGTAC GGTTTCGGTC GCAGGCGGAA CCCGAGGGCC TCGTGGAGGC CTCCCCACCG CACACGACCG CTGCTGGTGC CATTGCATCA ATGCAAGCTG GGATGGACGG TCCGTCCCTG CGTGACACCA AACCAACGGC TCTCGCCAGT CCAACCCAAG CACCATCGAC CAATTCAGAA CACGATCACG ACGATGACGA TTGTACCGTG GATCCCGACT TGCCGATACA AAACGTGTCG ACGTACGCCA GCTTTGCGCG GGACGTCCCG TCACCCGCCA CACTCTGGCC CCTTTCCCCC TCCAACGCGG CGGAACGCGT ACACGCACAC GACAATGACG AGCTTACCGT CGACCCGGAT ACACCCACGA ACGCGTACCG CATACCATCG CCCCCAACAA TCGCCAAGCT AGTCTCCACC GACAAGCTTT CCCCGGTACG TCCTCACGAC GTGACCGGCA CCACTGTCGA TACAACCATG GAGGTCTCCA TGACAACGGT TACGTCATTC CATACAACGA CGGACACGTC GCGGATGCCA GTAGCGCCCC CTCCCGTTGG CCCAACTCTC CCCACTACCC GCCCACACTC GGGGGTAACG GAATCCACGA AACCGTCTTC TCCTCCGACA GAGCGCTGGC AGAGAACACG TACCCCTTCC CCTGCCCACG TGAGACTCGT CGAGTCGTTC CGGGAACAGC GTTCAAACGA CCTACCATTA ACTACCCTAC CGGCACAGAA CGGAAACGAT GCGACCGTCG CCGAGTCGTC TGACCTCTTC CCCGTGGGGG CCATTGTCCA AGTGCAGCCG CGCACCTGGC CGGGCGTCAA CAAACCCGGT GGTGTCGGCC GCGTCGTCAC GGTGCATACC CACGTGGGCA ACGCTGCTGT TCAGTACGAC GTGGCCTACG TTCTCGGCGG CCGCGAGCGT CGCGTCGATG CCGTCTTTGT GGCGAACCAA CCCGCAAAAC TGGGGACCAC TACGGAATTG GTGTCGGCGA CGCCAACGAC TACGGAATGC ATGCCTCGGA GTCGGGCTTC CTACCGAATC AAACACAAGA GAGAAGAAGA AATTCCTAGC TTTTTATTGG AGCAGCTAGC CAAGGAAGGT TTCGATACGA AAGGCACGGT TGCCCCCGTC CAAGAAAATA TGGCCGATGC CGCTGGTGCG GTAGAGAATC AGAGAGCGGA TTCCAAACCG ATTCAGCAGC GCTTTGGAAG AAAAAGATCA GCCACAACAG CGTCCAAAAG CAACCCTACC AAACGAACCA GGCGAAAGGA AAACGTTCAC GCGCCAGCCC GTGCCATGGC TACTTCTACA GCTCCGACGG TAATCCCTCC AGATCCTATT CTCCCTATCT CTCGGGAAGA AGCCTTGGTT CTGGCAGATC AGTTGTACCA GTCTCGCATC CAAAAGGCAA TCCAGTCCGG CGTCATTCAT GTGGTGGCTT CCTCGTTGTC GGATCGCGAC AGAGAAGAAC TCCAATTTTT GTGCAGTGAA ACCGGGAGAG GCAAAGGTAC GGTATACAGC GTCTATTTTG AAACCATTCC GCTCGCTCTT TGTTCTCACG CTACGCATTT TTTGTACTAA TCAAGTCAAA GTTGTCCTCT CGGATACTAT TCAGTCGAAA ACGACGACGC TATGCTTGCT GCCCGTTGAT CCTCATTCCA GCATTACCGA GAATGCGCAA GCCTTGACCC GAACCCTGAA AGCTATGCAA TCGGCACTTG TAGGG
|
Protein sequence | MEEGGNASNP SGIGQPQRSK LSSTPAFRMN PANTEEEDVA EDGSLLSTRV LCRTLQKELH NMGKALSCPL CLSTYRDAVT LPCCHAYCRS CLTQALATGS ARRPPTCPCC QQRTAGRRSL TDAPKLNELV RAYKLALRHF GLAPVRTWNG VPATILGVCG RSLILCLYVS VYRYEEALPM TQLVPSTPDE SPLDTVDVHQ HLQAARVFAQ AWDGPGDVPY RDEQDLVVAA NRRFLLQAAV AAQAKPQSSR NTNSTTLSYS QLANQAAEQS AADRNDETSP CWWEQRHEKS QSTVVRFRSQ AEPEGLVEAS PPHTTAAGAI ASMQAGMDGP SLRDTKPTAL ASPTQAPSTN SEHDHDDDDC TVDPDLPIQN VSTYASFARD VPSPATLWPL SPSNAAERVH AHDNDELTVD PDTPTNAYRI PSPPTIAKLV STDKLSPVRP HDVTGTTVDT TMEVSMTTVT SFHTTTDTSR MPVAPPPVGP TLPTTRPHSG VTESTKPSSP PTERWQRTRT PSPAHVRLVE SFREQRSNDL PLTTLPAQNG NDATVAESSD LFPVGAIVQV QPRTWPGVNK PGGVGRVVTV HTHVGNAAVQ YDVAYVLGGR ERRVDAVFVA NQPAKLGTTT ELVSATPTTT ECMPRSRASY RIKHKREEEI PSFLLEQLAK EGFDTKGTVA PVQENMADAA GAVENQRADS KPIQQRFGRK RSATTASKSN PTKRTRRKEN VHAPARAMAT STAPTVIPPD PILPISREEA LVLADQLYQS RIQKAIQSGV IHRRTPIFVQ
|
| |