Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38879 |
Symbol | |
ID | 7203608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 432176 |
End bp | 434599 |
Gene Length | 2424 bp |
Protein Length | 807 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182835 |
Protein GI | 219125118 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.192146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAGCA GCTGTAGCAT TTCCCGAAAT GGATCCACTA GAACTTTGTC AACATCACCC AAGAAGCCCT TGAAGGCGGC TTCTTCTCCT AGAAGTAGTG GCGACTTCCA AGTCTTACGA CAAGGAATCG AAAAAGCTCG GGATACTGTA CATATATCAA GAATGGAACT TGCCTCCTGC CTTGATCGAC TCGGGGAGCA TTATGCCCGA CATCACGAAT TCGACGAAGC CATGGATGCA TTTACCGAAG CGCTCCACGA GAAGCGAAGC GTCCTTTCGC ATATATTACC AGAGAACTTG TGGTCGAGTA AGAGCGCCCT TTCCCCACCG TTGGCAGTCG ATTTCGAAGA TAAAACCGGT GGAGACTCTT TTGACAACTT GACCGACGAA ATTATCATGA CTTTGCGTAG TCTTGGAAAC GTCCACTCTC TTCGTGGGGA ACAAGACGAG GCCATGCGGT ATTTCACAGA AATTACCAAT CTTCGAGCAA GGAAGACGGA GAAAAAAGCT GACAGCGGCG ACCAAGCCCT CTTTTCAGGA CTAGGAATTG ACGAAGACAA CTCGGCACAA ATGGCTGAAA TCAATGAAGA TATGAAAGCT CTAGGCGACA TGTTTCAAAT CGTTTCGTTT CGAGACCGTG AAAATGGTTT GCTCACGCAA AGAAGTCGGA TGACTTCGAC ACTTAGAAGC TCCGCAAAAG AGAACACTTC TTGTTCGAGC AATAAAAGGA GAAAAAGCGA CTCTTACTGC GGGATTATGC CCGTCATCGA AAGTGAGCCC TTCAAACGAT CTTCTTCACT TTGTCTTTAT GCTACGAATA GTGACCTTAG TGAAGCTCTC CGGATGTATA AAGCTGTTCT TGAGTCATAT ACTGGTCCCA AGCTAGAGCA GCACAAGGAC ATTTTTAACT CTCTTGCCTT AAGAGTCGAT CTGCTGGCGG AAACTGGTCA GCAAGACGAC TTGGGCTCGA CAAGCAAAAA CAGATTTGAT AAGAACCTAG ACCTTGCGGT GGAGATCTAT CAGCACACTC ACACAGCACA AGTGGAAATG ATTACGACGG AAAGGTCGGG GTCGGGGTCG AATCCCCAGG CATGTAAGGG TATTGCCTCG ACTTTAATTC GTATGGGAGG TCTCTACTTT AAGCTTGGAC GTCGGGTTGA AGAGTTGAGC ATGTACAAAC AGGCGAAGGA CGTTTACTGT CGAGCATTCG GAGACAAGCA CCCTTTCGTT GCTGGGGCAA GGAAAAATAT TGGCATGGTT ATGGCCGAAA GAGGGGAATA CGACAACGCA ATGGATCAGT TCAAAAGAGC AAAAGAAATT TATCTCGCTG TCAATAGAGG TGACGAAATC AGCAGAAACG TTGCCAGTGC CATATCTTGT ATGGGAAATG TCAAAAATCG AATAGGAGAA CTTGACGAGG CCCTTGAACT GTACGTGGAG GCCCTGCGAA TCTACAAGGC AATTCAGGCC AAGCCAACGG ACAATGAGTG CGACGATGTT TGCACTTTGG ATGTGACAGC AACACTAAAG GTGATTGGGA TGGTACATTC AAGAAAGGGG AACCTTGATA CTGCAATGTC AGTCTTCTTG GAGGCCCTGA CTCTGCTTCG AACGTATGGG GATAATGCTA CAGCAAGCTG TAAAGAAACG ACCTCATCTG TCTTGACTAG GATGGCCAGC ATCTACGCGA AGAAGGGTGA GCTGGACCAT GCGATGGATC GTTACAAAGA GGCTTACGAG ATCTCTGTCC AGAATCACGG GACGACAAGC CATCAAGAAG TCGCTGGTAT TCTGCATTAT ATTGGTGGTA TTTTTCACAA GCGATCAAAT TTTGACGAAG CAATGAACTG CTACCAAGAG GCTATTCGCA TCTACCATGA AACACTCGGG CCTGGAAATG CAGCTGTAGC GGGAACCCTT GTCATGGTGG GAAGCATCCA TTACAAACGC CGAAACCTGG ACTCTGCGAA AATGTTCTAT CGGGAAGCTC TTCGACTAAA CAGGGATGCC TACGGCTTTC ACCACCCAGA TGTGGCTCCT ATCCTCAAAA GTATTGGCAC AATCCTCACA AAGAAAGGAG AATACCAAGA GGCATATGAC ATGTTTAGGG ATGTACTTTC GATCAAGTGC ACGATTCATG GTACCGGTCA TCCCGAGGTC GCTAGTGCCT ACAAAAGCCT GGGGAATGTC CACTACAAGC TCGGTGAGCT TGCAGATGCG GAACGACAAT ATCGACATGC TCTGAATATT TTTCGACGTA CTCGCGGAGA AGACCACGCC GATACAATTG CTGCTAAAAC AACAATTGAT CATATACGCT ACTGGATGAA GGAGCGAGGC CAGCGAAAGC ATGAGCAACG ACAAGCTCGG AGCCGCGCCT TGTCGGAGGG ACGAGATGAG GAAATTGATA AACGCAGTTT CTGA
|
Protein sequence | MRSSCSISRN GSTRTLSTSP KKPLKAASSP RSSGDFQVLR QGIEKARDTV HISRMELASC LDRLGEHYAR HHEFDEAMDA FTEALHEKRS VLSHILPENL WSSKSALSPP LAVDFEDKTG GDSFDNLTDE IIMTLRSLGN VHSLRGEQDE AMRYFTEITN LRARKTEKKA DSGDQALFSG LGIDEDNSAQ MAEINEDMKA LGDMFQIVSF RDRENGLLTQ RSRMTSTLRS SAKENTSCSS NKRRKSDSYC GIMPVIESEP FKRSSSLCLY ATNSDLSEAL RMYKAVLESY TGPKLEQHKD IFNSLALRVD LLAETGQQDD LGSTSKNRFD KNLDLAVEIY QHTHTAQVEM ITTERSGSGS NPQACKGIAS TLIRMGGLYF KLGRRVEELS MYKQAKDVYC RAFGDKHPFV AGARKNIGMV MAERGEYDNA MDQFKRAKEI YLAVNRGDEI SRNVASAISC MGNVKNRIGE LDEALELYVE ALRIYKAIQA KPTDNECDDV CTLDVTATLK VIGMVHSRKG NLDTAMSVFL EALTLLRTYG DNATASCKET TSSVLTRMAS IYAKKGELDH AMDRYKEAYE ISVQNHGTTS HQEVAGILHY IGGIFHKRSN FDEAMNCYQE AIRIYHETLG PGNAAVAGTL VMVGSIHYKR RNLDSAKMFY REALRLNRDA YGFHHPDVAP ILKSIGTILT KKGEYQEAYD MFRDVLSIKC TIHGTGHPEV ASAYKSLGNV HYKLGELADA ERQYRHALNI FRRTRGEDHA DTIAAKTTID HIRYWMKERG QRKHEQRQAR SRALSEGRDE EIDKRSF
|
| |