Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26948 |
Symbol | |
ID | 7199979 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 905006 |
End bp | 909365 |
Gene Length | 4360 bp |
Protein Length | 1276 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179315 |
Protein GI | 219117041 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACACAATCCA GCGACCACGG CAACTCACTC TGTCCGGACC TTGTTTACAG TTTCCCTCCC CCTTTTTTCT TCTCCGCGAC AACGATCATG ACGGAAACGA ACGTACCCAC CATTCCGTTT CCTCTCATTG ACTTGGTCCC GAAGAAAGAA ACCAACGTGC GGCGTTTTCT GGAAGCCTAT CCGGAGTACG ATGGCCGGAA TGTGATTGTC GGCATTCTGG ACACGGGGGT GGATCCGGGT GCGCACGGGC TCGGGACGTT GCCGGACGGC ACGACACCCA AGCTATTGAA CGTTGTGGAC TGCACCGGAT CCGGAGACGT CGACGTGTCG ACGCAAGTCG AACTCCAACG GCATGTAAAT GCCGACGCCG ACGACGGTAG CGACGGAAAC GTCCACGAGT ACTATCACGT GGAGGGGTTG ACGGGTCGAC GACTCCGCTT ACCTACCAGT TGGAATTATC GCGACTTTCC CACACCGAAA GACTCGAACA AAGACGAAGC GACGAGCGCC AACGGCAACG ACACCGCTGT CGACGATACA CCCACGCCAC TCACCACACC GCTTCCCCAG GTGCGATTGG GGGTCAAGCG CGCGTACGAA CTGTTTCCGG CCAAACTCCG GGAGCGTGTG CAAGAAACAC GCCGGCAGGC TTTTCAAGCC CAACTCGATC GCTATGTAGT GGACGTCCGT CAACAGCTGG CCGCCTGGAA TGTCGCACAT CCCAAACCAC CAACGTCTCC GGAAGAAGCC AAAGTACGTG ACGATTTGCA GGCTCGTTTG GATGTTTTGT TGGACAGTGA GTGGAACGAC GACCCGGGCC CACTCTACGA TTGTGTAGTC TTCTACGACG GTACTGATTA TCAAGCTGTC GTTGACGTAC ACGAAACGGG TGATTTGCGA AACGCTCAGC CCTTTACGAG CTTTGCCAAA TCTCGACAAT TTGGTACCCT GGGAACCATT GATCAAATGA ATTACGCCGT ACAGTTTTAC AACCAAGGAA CCATACTCAG TTTGGTGACG GACGCCTCTC CGCACGGTAC GCACGTTGCT GGGATTACGG CCGCCGCCGA AGGTGAGCGC AGCGGAGTGG CTCCCGGAGC CCAATTGGTT TCGTTCAAAA TTGGCGATTC ACGACTGGTA CGTATCGCTT TCGAGTCCGA GGCACCAACG AGTAGCTACA CTTTGTCTGT TTGTGGTTTG CAACCGGTGG GAAACCAGCA AGCAAGTGGC CAGTCATGGT AGACTCACAC ACGCACTTAT CCTGTCATTC TTTTAGGGAA GTATGGAGAC AGGAACTTCC TTGACGCGAG CAATGATTGA AGCCGTGCGG CACAAATGCG ATGTCATTAA TCTGTCATAC GGAGAGGGGT GTGCAATGCC CAATCATGGT CGATTCGTCG AGCTTGCCGA AGAACTCGTG TGGAAACATA ACGTGGTCTT TGTTTCCTCG GCGGGAAACA ACGGGCCAGC TATTTCCACG GTAGGAGCCC CGGGTGGGAC GTCGTCGGCT TGCATCGGTG TCTCCGCGTA CGTGTCACCC GCCATGATGA AAGCTGGTTA CAGCATGCCC GTCGACAATG TTGACGACAA GATTTCGTCC ACCGAAACCG GTACTACTCC GGACGAGCCA GATGCAGAGT ACCATACCGG TACGACGTAT ACCTGGAGCA GTGTCGGGCC AACAGCAGAC GGTGACAACG GTGTTGACGT GACGGCACCC GGTGGCGCCA TTACTTCAGT TTCAAACTGG TGTTTGCAGA AATCTATGCT CATGAACGGT ACCTCGATGA GCAGTCCGCA CGCCACGGGG TGTGTGGCAC TCCTGATTTC CGCGTGCAAG GCTGAAGGCA TTCCCGTATC TCCGGCACGC ATCCGTCGAG CGCTACAAAA TTCGGCCAAA CGTCTGCCAA ACCTGTCGAC GCTGCAGCAA GGATGGGGCA TGATTCAGGT GGATCGTGCA TTCGATTATC TGCAGGCCAA CAAGGACGAC GACACCGAAG ATATCTATTT TGATGTTCGA GTGGCCAACC GCAGTGGTTC ACCTCGTGGA ATCTATCTGC GACAAGCAGA CGAATCAGCG ACTCGACAAA ACTTTGCCAT TCACGTGGAC CCAAAGTTTC GTCCAGAAGA CGACATCAGT ACAGATAGTC AGCGACGCAA AATTGACTTC GAAATGCACT TCCAAATCGA AGCGTCCGAG CCATGGGTGA CAGTTCCCGA CCACTTCATG CTGATGAATA ATGGACGCAC GTTCAAGATT GACGTGGATC CAACTGGATT GGAGCCTGGC GTACACACGG CGAGAGTGTA TGGTCTTGAC TCACGAAAGC CAAGCCGTTG CGTGGTGTTC TCGATTCCAA TAACAGTGGT TAAACCTATG GAGACGAAGC ACGACATCTC ACTGGGCGCA TTAGAGTTCA AACCAGCCGA GATAAAGCGG TTCTTTGTAA GGCCTCCCTT GGGATCGACG TGGATGGACA TTACAATTCG TGACCTTCGA GATGCGAATA TAGATGGGGA GTCGTCGACA AAGCTCATAG TGTTGCATAC GGTACAACTG CTGCCACATG CTGCGTATCG CGATTTTGAA CAGCAAAAGC ACTACAACCT TCGGCCCTCG CAAACTGTTG TTGCATCCAT TGCCGTGGAG GATGGGATAA CCTGCGAGAT TGACTTGGCA AGATACTGGT CCACTCTCGG TACAACGAAA GTTGATGTAG AGATTCAATT TCGTGGAGTT CGTCCCGTGC CCAACAAAAT GACATTACGC TGCGGGGAGG GCGGTTCACT GGTGCGGGTA CACAGCGACT TGGCGGATGA GACCATCAAT CCAGTCGCTA AACTAACAAA ATGGCTTACT CCGTTACGTC CCAAAGCTGG TGCCGCAATC AAGCCAATGG GACCCCGTGA CACCCTCCCG TCACGCAACA AAGAAATATA TGAGTTGGTG TTGACGTACG AATTCACTCA GGAAGAAAAA GGCTCATTGA TTCCTCGAGC TCTTGGAATG CAAGGGATCC TATACGAATC TGTCTTTGAA AGCCAAATAA TGCTTCTTTT TGACGGTGAA AAGAAATATC TCGGAGTTGC TGATGCCTTC CCTTCATTCT TAACGGTACC AAAAGGATCG GTCACCATTC GTATGCAAAT TCGCCACGAC GATCCGTCGA AGCTCGAAAA TTTGAAAGAC ATGCCAATCT GGATTGAACG CAAATTGGAA AAGGAAATTG CGTTGTCTGT GTATTCGTCC AGAGAAGGAG TCATGTCGGG GGCTGCTACT TTTCGGAAAC GTGTGCTACA TAAAGGATCC GGATGCTCCG TCTTTTTCGG CGAACCAGCG TCATCAAAGC TCCCTGCTTC TGCCAAGACA GGAGATCTTT TGACAGGTAA CTCCACTTTC GGATCCGCCG ATGCGTCGCT CCCTGGCACC GGCAAGCGTC CCGGAGGATT TCCTCTCGCG TACTGGATTG GACCAAAAGC AGAAAAAACG ACGACCGATT CGGAAGCGGT TGAGCCAAAA GATGAACGCA CTCCGGAGGA AAAGATGAAC GATGCGGTTC GCGATTTGAA GGTCGAGCAC TTGGGCAAGA TTCCTGCAAC AGATAAAGAA GTAAATTCAT TCAATGAGTT GTACGCCAAA CTTGAACAAG AATTCTCTGA TCACCTACCT TTGCGAATGA TCAAGCTCAA GTATCTTGAA TCGCGGAAGG ATCGTGTCGC AATTTTGGAC GATATTGTTC AAGTCAGTGA GGCTATTGTG GGTTTGATCA ATGAAGACGA ACTTGCCCTT CACTTTGGAC GAAAAAGTGA TTCTGAGGAC TCAGCAGCAG TTAGGGTACG AAAGAAATTT TGGCTACCAC TCATTATGAA CAAATCCAAA CTTACACAGC TGCCTTGTTT TTCCCCAATG TTTAAACAGG ATCGTAACGA AATGAAGGAA AAGAAAAGCA TCTTGACGGA AGCTCTCGCT CGCATGGCCA TGGCATACGC TGACATAAAA ACAGAGGAGG CCAAGCCAAA GTTTGACGAA ACATTGAAGA AATTGAAAGC GTGGGTAGAT CTTGATTCGA CGTCGAAATA CACTCCCTTG GTACTTGAAA GGGAAGAGCG TGCGGGTCGG TACGGAATAG TATTGAAACT CATCAGCAAA TTGCTATCAA AAGAGGTAAA AGAAAAGGAC TTTGTGAAAC CGCTCTCCAA GCGCGATCTA CTGGAAAAGC GTGCCATAAT CTTGGGAACG CTTGGTTACT CAATTTTGGT GGAGCACGAT AAGAAAACTC GAGTGATTGC CTGTCCGAAA GCGTACGCTC TTTTTTAGAA ATTTGCGGAA CAGCAAACTA GTAAACAAGA AAGCAGTTAC
|
Protein sequence | MTETNVPTIP FPLIDLVPKK ETNVRRFLEA YPEYDGRNVI VGILDTGVDP GAHGLGTLPD GTTPKLLNVV DCTGSGDVDV STQVELQRHV NADADDGSDG NVHDANGNDT AVDDTPTPLT TPLPQVRLGV KRAYELFPAK LRERVQETRR QAFQAQLDRY VVDVRQQLAA WNVAHPKPPT SPEEAKVRDD LQARLDVLLD SEWNDDPGPL YDCVVFYDGT DYQAVVDVHE TGDLRNAQPF TSFAKSRQFG TLGTIDQMNY AVQFYNQGTI LSLVTDASPH GTHVAGITAA AEGERSGVAP GAQLVSFKIG DSRLGSMETG TSLTRAMIEA VRHKCDVINL SYGEGCAMPN HGRFVELAEE LVWKHNVVFV SSAGNNGPAI STVGAPGGTS SACIGVSAYV SPAMMKAGYS MPYHTGTTYT WSSVGPTADG DNGVDVTAPG GAITSVSNWC LQKSMLMNGT SMSSPHATGC VALLISACKA EGIPVSPARI RRALQNSAKR LPNLSTLQQG WGMIQVDRAF DYLQANKDDD TEDIYFDVRV ANRSGSPRGI YLRQADESAT RQNFAIHVDP KFRPEDDIST DSQRRKIDFE MHFQIEASEP WVTVPDHFML MNNGRTFKID VDPTGLEPGV HTARVYGLDS RKPSRCVVFS IPITVVKPME TKHDISLGAL EFKPAEIKRF FVRPPLGSTW MDITIRDLRD ANIDGESSTK LIVLHTVQLL PHAAYRDFEQ QKHYNLRPSQ TVVASIAVED GITCEIDLAR YWSTLGTTKV DVEIQFRGVR PVPNKMTLRC GEGGSLVRVH SDLADETINP VAKLTKWLTP LRPKAGAAIK PMGPRDTLPS RNKEIYELVL TYEFTQEEKG SLIPRALGMQ GILYESVFES QIMLLFDGEK KYLGVADAFP SFLTVPKGSV TIRMQIRHDD PSKLENLKDM PIWIERKLEK EIALSVYSSR EGVMSGAATF RKRVLHKGSG CSVFFGEPAS SKLPASAKTG DLLTGNSTFG SADASLPGTG KRPGGFPLAY WIGPKAEKTT TDSEAVEPKD ERTPEEKMND AVRDLKVEHL GKIPATDKEV NSFNELYAKL EQEFSDHLPL RMIKLKYLES RKDRVAILDD IVQVSEAIVG LINEDELALH FGRKSDSEDS AAVRDRNEMK EKKSILTEAL ARMAMAYADI KTEEAKPKFD ETLKKLKAWV DLDSTSKYTP LVLEREERAG RYGIVLKLIS KLLSKEVKEK DFVKPLSKRD LLEKRAIILG TLGYSILVEH DKKTRVIACP KAYALF
|
| |