Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45071 |
Symbol | |
ID | 7200081 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 112983 |
End bp | 115972 |
Gene Length | 2990 bp |
Protein Length | 853 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179363 |
Protein GI | 219117137 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.706986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCAT ATTATGAAGA AGAGGTCACG GAGGCTATAC TGCCAGGTTT GGAGTGGCGA GCGGAAAGCC GTGTCCGCCG TCCAATATTG GCCAGAGGAG GCATTATCGC TGACGAGGTA AGCTCGACCG TTCTTCTTCT TCTTTCTCCA TTAGCGCACT CTTGAGCTTC TCTTACATGT GCTTTTGTTC AGGTCGGCTA CGGAAAGACC GCTATCACTC TGGGACTTAT TGATTCTTCG CAGAGGATAA ATGGCGATTG TCCGGAACCG CCGCCCGCTT ACCGTGACCG CTTGATTCAG ACGAAAGCTA CTCTAATCAT CGTTCCGGAG CATCTCATGG GTAAGGACTT ATATCTGTTT GATTAATTTG TATTTCTGGC AATTAGCTCA TTTTGCTAAA CCTTTCAACT AAACAGGACA ATGGCCGGAA GAAGTACAAA AGTTCTTGGG TAGAAGCAAG AGAGTCATTG AGATCAAAAG TATGGCTTCA ATGAACAAGA TTACAGTGGA AGACATTCAG AAAGCCGACA TTGTCGTTGC GAGCTTTCAG ATACTGAGCA ACGAGACCTA CTTCTTGAAC CTTGCCGAAT TTAGTGGTGT GAACGCGTCG GCGTTGCCGT CGAACAAGAA TGGCGGCCGG CATTTTGATT CCTTGTATAA TGACTGCCTC GAGGGTCTCT TGGTGAGAGT GCCACATTTA ATTGGTGACA CGCGCAAGGT TTTTTCAGAA ATACGTCAGG ATGCTGAATG TCACGAAAGT GCGAATGCGG AAGAAAACTT TCGTGTGGAC GGAAAGAAGA GCGCCTATAA AAATGGCTCC AAATCGTCCA AGATGAAGGC GCCCGAGACC AGCAAAATTG GCCGTGAACG TGATCCATGG GGGCTGTCCA CTTCAAAGGT AAAGGCTTCC TACCGTAAGA TGTCCTGTCC ACCCTTGGAA TTATTCTTTT GGAACCGACT TGTCGTTGAC GAGTAAGTCT TCAGCAGTGC GGTTTATTTT TCTAGCTACT GTCATTCTGA CGTTTTCTTT CATTGCAGAT TCACGTACCT TGAAGACAAC AAGCGCCATC GGGCTTTGTC GTTCATACTT GGCGTCAAGT CGTCGTATCG TTGGTTGCTT TCGGGAACAC CCAAGCATTC TAACTTTGAC GACATTCAGA GCCTTTCTTC TCTTCTGGGC ATTCACCTGG GCATTGACGA GAGCCTTCCC GGGGTCAAGA TGAATAGAAG TAGAGGTGTT GTCGCCGAAG AAACGACCGG TCTCGAAAGT TTGTCTTTGT ACCTGGAAAT GCACTCCATG CAATGGCACG AACGTCGTCA CGTGCAGGCG CAATCATTTC TGGACCAGTT CGTTCGACAA AACAAGGCCG AGCACGACGA GATTCCGTGG GAGGAGCACT TAATTTTTGT AGATCTACCC CCTGTGGAGC GCGCTATTTA TTTGGAACTC GAGACTCATT TGAAGAGTTT GGATATGAAC AAGAACGCAC AAAAGACAAA GAGGAAGAGC ACTGGCGACC GCGACAATCG CATGCAACGG ATTCTTCAAG ATTCGGCGAG TGCGGAAGAG GCCTTGCTAA AGTGCTGCAG CCACTTTAAC ATGTCGTCAG AGGCCGCCAC CGCGCTGGAG ACGATTACCG ACATTATAAA GCTTCGAGAC ACGCAAAAGA AGGAATTGGA ACGGGACATT GTCGTCTACC TTGCATCTGC TTTTCGCCAG CAGCATCGGA TTCTCCAGCA TCAGTCGGAT TGGCTCCTCG TGTCTCGATC AGAAAAGGGT GAGGTCGCCA GTGCTCTGCA ACAGTATCTG CGAGAGGTTG AGAAGCGTGA CAGTGTAACG CACGGGGCCG ACGACGAAGT TCACGACTGT ATCTTGCAAC TTGTCCGACA AGCGGAAGAA GCCTTTCACG CAGATCCCTC TCGGATCGAC TCTTTTTTCG ACGTGGACGA GGGTGACGAT CCACAGGAAG GATCCAGCCC CAAAAGGCGT CGGGGTGCCT CCCCGCAGAA ATCGAAGAAA GAAGCGGCCG AAAAATTTAC CGAGCGCCTG TTTGCAATGA AGATCCAACT CCGCGATCAT CTGCACCTTG TGCGATCTAT GGGTAAGGAA CTGTGTGGCC GCGTACGCAG TCTCCGGTAC GTGCAATGGA TTCGCAAATT CCAGGACGCC AGCATCAGGT TCACGTGCGG ACACTGTCGC GCCACGGGTC TCGAGAGTGA CCAGGTCGGG GTGTTGAGTT CGTGCGGTCA CGTTGGTTGT TTGGGATGTT TACGGGTCGA GTCGGCGGCG GAAAAGTGCG TCGAGTACCC GTCGTGCCGC GCGCGCGTGA GCAGCGCCCA CGTTGTGTCG TCGCGTCATT TGGGGTTGCA CCGGGCCGAC TCGAGTGGCG GACGGTACGG CCGCAAGTTG ACCGTCCTGG TGGAGAAAGT CCGGGAGATC ATTGCGATGG GCGACCGCAT GATTGTCTTT TGTCAATTCG ACGACCTCAA GGAGAAGATC CGGCAAGTCC TGTTGGAGAA CGGTGTGCCG TCGTTGGAAG TGGCGGGTTC GGTGCACCGC CAGATTGCGT CGTTACGGGT CTTCCAGAAA GAAATCCCGG GCCCGACCGA TCCACGGGTG TTGGTATTGA AGATGGACGA CGAGCAGAGC GCGGGCTTGA ACTTGACGCA TCTGAACCAC GCACTTTTCG TCCACCCGCT ACTGGCGTTG TCGCGTGCGG AGTACGACGC GTACGAAACG CAAGCGATCG GACGCATCCG GCGATTCGGT CAAACCAAGA CGGTGCACCT GCACCGGTTT CTCGCGCGGA ATACCATGGA TATGGAAATT TGGGAGGAAC GGACCAAGCC TCGGGCGTAG TGTGAATCGT CGGTTGTACC TGTATGCATG TGCGTGTGCG TGTGTGTATG TGGTGCGGAT AGAGGGAGGA ATGGATAAGG AATTGGATCG GGAACGTAAA ATCCAATAAT GTGAGGGGTT TTGACTATGA
|
Protein sequence | MEPYYEEEVT EAILPGLEWR AESRVRRPIL ARGGIIADEV GYGKTAITLG LIDSSQRING DCPEPPPAYR DRLIQTKATL IIVPEHLMGQ WPEEVQKFLG RSKRVIEIKS MASMNKITVE DIQKADIVVA SFQILSNETY FLNLAEFSGV NASALPSNKN GGRHFDSLYN DCLEGLLVRV PHLIGDTRKV FSEIRQDAEC HESANAEENF RVDGKKSAYK NGSKSSKMKA PETSKIGRER DPWGLSTSKV KASYRKMSCP PLELFFWNRL VVDEFTYLED NKRHRALSFI LGVKSSYRWL LSGTPKHSNF DDIQSLSSLL GIHLGIDESL PGVKMNRSRG VVAEETTGLE SLSLYLEMHS MQWHERRHVQ AQSFLDQFVR QNKAEHDEIP WEEHLIFVDL PPVERAIYLE LETHLKSLDM NKNAQKTKRK STGDRDNRMQ RILQDSASAE EALLKCCSHF NMSSEAATAL ETITDIIKLR DTQKKELERD IVVYLASAFR QQHRILQHQS DWLLVSRSEK GEVASALQQY LREVEKRDSV THGADDEVHD CILQLVRQAE EAFHADPSRI DSFFDVDEGD DPQEGSSPKR RRGASPQKSK KEAAEKFTER LFAMKIQLRD HLHLVRSMGK ELCGRVRSLR DQVGVLSSCG HVGCLGCLRV ESAAEKCVEY PSCRARVSSA HVVSSRHLGL HRADSSGGRY GRKLTVLVEK VREIIAMGDR MIVFCQFDDL KEKIRQVLLE NGVPSLEVAG SVHRQIASLR VFQKEIPGPT DPRVLVLKMD DEQSAGLNLT HLNHALFVHP LLALSRAEYD AYETQAIGRI RRFGQTKTVH LHRFLARNTM DMEIWEERTK PRA
|
| |