Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39392 |
Symbol | |
ID | 7195129 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 381452 |
End bp | 385502 |
Gene Length | 4051 bp |
Protein Length | 1324 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183352 |
Protein GI | 219126204 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.192889 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGACA AGGTAACTGG CGCGGCATCG GAGAATGGCG ACGAACCCAA CAGCAACACT GGCAAAGGCA CCAATAAACC GAAAAGGCCC AGCCCCACCA ATTTCGGTCG CAGCTCTCGT CATGCTTCCC ATCCCAGTCG CGGTGGCGGA GCACAGCGCC GAGTGTCGTT GTATGAAAGC AGCAGCAGTG CCATCAGCGC GTTGAGCGGC AGTACCGGCG ACGACGATGA CGACGATTTT GACTTTTCCC ACATCGCTCC GCTCGGTCGC TCGAAAGGCG AACGGACGAT CAGCACTACC ATACGCCTTT CCAAACAGGG CTTTCGACAC AAGCGGACCG ACAACTTCGC TGTTCAATCG ATCCGACGGG GCAGCGAAGA CTGGAAAAAT CTGATCAAGC CATTCGTGGC CGATTTGGTC TTTCGCTCGC TCATTTGCCG CCGTTTTCAG TCGGAAGTCT CGTTTAGACC GTACACCGCC CACGCTGCCG TTCTCTTTAT TGATTTATCC AGCTACTCCA AAATTACCAC GGCCATTGCT CATCGCGGTG CGCACGCACT CAGCTCTATC GTCAACGCCT ATTTGTCACG CCTGCTCCAG TTTGTGCACA GCCACGGTGG CGACGTTGTC AAGTTTGCCG GGGATGCTGT GCTTGTCGTG TGGGAAGGAG AACAGAGCGA TTTAGCCATG AATGTTCTTT GTGCAGCAAG ATGCGCGTTA GAAATGCAAA AAACGGCCGG GTCGCATCCG ATAGACGGTA GTTCCATGAT ATTTCAAATA CATTGTGGAT TGAGTTGCGG ACGTATCGAG TCGGAAATTT TTGACGCTCC GACACACGTG AACATGCAGC GTTACTATCA CGCTGTTAGT GGCGAAGCAC TGTTGGAAAT CAGTGAGCTT GTGGATTTGG CATGTGCGGG GGAAATTTGC GCATCCGAAG CATGTCTCGG GCTCTTAGGC AGCCATGGAC GGTACCGTAT TATCGATCTC GCCAAGCAGA CCGAAGTGGG TAAATTTGGC ATGGGAAAGA TTTTGACCCA TCTAGATGTG GAGGAGTCCC TATCAAACGA GATGGAACTT CACATTGAAA GTACTCTCAT GGATCGTATG GCGCGACGAA ATAAGCATAT TGAAGAGAAT TTCATTCATC CAAGTGTTAT TCGCCAACTC AACCACGGTG GATTGTCACC AACACAGATT GCGCAGATGC GCAGTCTTTG CGTTCTATTT ATTGCTATGA CTTCCCATGG TAGTGCAGTG AATTGGTTGA TGGAAGTTCA AGGGGTGTTA GACAAGTACC GCTGCCCCAT TGTACAAATC ATCGACGACG ACAAGGGCGT TCACGTTACC GCAGCCATCA ATCTTTACGA AGCAGTTCCA GAAGCCAGTA TTCTAGGTCT GGATCTTTGC AAAGAGCTGG TCGACAAGCA CGTTGGCTGT GCGATTGGAA TGGCTGCGGG AGCTACATTT TGCGGCGTCA CAGGCTCGAG TTCAATCGCC TGTCGCTGGG ATATTACCGG AGGCCCTCCC GTTCGAGCCG CGCGTCTCAT GCAATTCGGT ATGCAATTTG GTCACGAGGT TGTTCTGGAT CAATCAATCT ATGACGACCC AGTGGCAGCA GTCCGCATGG TTGCGTTAAA CGCTGGAATC TACATCAAAG GAACGGATGG GGTGAGTCCA ATATACACAC TCTCCGAGTC TACAGACTAC TCAGCATTTC GAGTATTAGA GACGGTCCAT GGCGCAGTAC ACGACGATGT AGTCCGGAAA ATTCAAAAGA TGATTAACGG AGAAAGCACT CGGTGCGCGT GTTTAGTCAC CGGGCCAACG CTCTCTGGTA AGAAGATTGT TTGTCAGAGG GCGGCCGGCT ACGCTGACTT GGTTCCTTAT CTCCACGTTT GTGAAGAAAA TAGCGGCCTA TTACAACTGG CGAAAACGAT GGCCACTTGG TTCAAATATC TTGATGACGA CGTCGTAAAG CGTGGAGCAA AGGACGTCCT TGACTACTTG GAGAAAGGCC ATTGGACAAG GGCGCACGAT GAATGTGTCC GACTTGTCAC TTTGATCATC GACAAAGGAA TGAATGCTTG TTTCGTCGTA GATCGAATCC AGTTTCTAGA CGAGTTTTCG TTTTCGCTCA TCCGTGAGTG TCTTCGAGGA AGAGCAAAAG TCGATCGCCT ATCGAGCCGC CTCTCTGTCG GATCCGAGTC CTCCGATGGT AACGCAACTG GACGTATTTG CTTTCTTTGC ACGCACGTGC CCCTCTACAA CTGGATGTCA GCAACGGATA TCGTGGAACA TATTGCGCGT TCGCACCAAA GGATAAGAAT TCCGATTTTG ACTGTGGCAC AAACAGATAT AGAGTCCCTT CGTACTATGT TTCGAGATTT GGCTGACATG AATGTGAGCG ATCGTTGGCT CACTACGTAT GCGCAAGGTT CCGGATATTG TGCTGGATAT TTTGTAAGCA TACTCGTTTC TGAAATTGAG AGGGATTTGA CGAGAACTGA AGCTGATCAA TGTATTCTTC TTCTTTTAGG TTGAACGATC AGCAGCGTCC AGAATCTTAT CTGCCCAACT ATGGAATGAA GGAAAACCCG GCTATGCTGT GACGACGGAG GACATCACCC TGTACATTCC TCCAGGTCTC ATGGGAAAAA ATTTACACAT GACAGTTCAA CAAACATCAG CAGAGATTGT TATGAGATTT TCACAGGTAT TCGATGAGCT TCCACCCCTC TTTCAGACTT TAGCAAAGAT CCTGACGATT GCGACCCGAA GGAAATTTTT CAAGCTACCT CGCACCATTC TTTGGGAGGT ATTGAATGAT TTGGTGTCGG AAGGCGTAGA TTCTGAGCCA ATGGCTGCTG TCTTGAACGA ATTGGTAGAT ATGTTTCTGT TGAAGATTGA ACAGGTAAAA AACGAAGAGG TGGTGTCGTT TCTTAACCCT GCCTTTCTGG ATATTATTAT TGACGTTTGC ACCCCTGTGC AAATCCGAAG TATCGCGACA GCCCTTATCG AACGCCTTGA ACCAATACTT AAAAGGGACT TCCGAATCCC TCTCGTCATT GCAACGCTCC ATGGGATGCT TGGGCAACTA GAATCGATGC AGAGACACAT GTGGTCCGAA GGCTTCAAGG TACTCCTGCA AGAAGGCAAA GACTGGCCCG AAAGCGAAAG GATGCGGTGG TGTGAATTGA TTCAAGACGA AATTTCAGCT GCAGGATTTA ATGTTAACGA GATCCTCGCT GGTCAAGCGC TCTGTCAAAT ACCGGCAAAG AAAGCTATTA GCCATCGACT GCCTATGCTC AAAATATATC AAGGACCGAT TACGATTGGA CCTATGAGTC ACGCACTTGG AGTTCTTTTT ACAAACATCT TTCATGAATA CGGCGTATTC CATGGTGCTC GAAAGTCTTC AGTTTGTCGG CTCAGAAGAT CAATGGCTAG TTCATCCAGC AGATACTTAC GCCAGATGGA ATGCTTGGAA AATTTTCTGG ACGGGTACGG ACTAAGTATT TCACGTGAAG AAAAGGAAAG AGAACGCGAA ATAATCACGG CTCTGGCGAA GCCTGCCAGC CGCGAAGAAG ACGCTGTAGC AAAAGCTTTC GTTGTACTCG ACGACTATAT TCCTCAATTT ATCGAACCAC GGATGGAGCG GCTCTATTCT CTAGTTGCAA AACTTAGAGA GGAAGGGATT CCTGAGGTCG TCATGAATGC GCAGCCTGCC ATACGGAGAG CATATGAAGC ACTGCAAGTC CCAAAAAACC GGAACGATGC TGCCCAGGAT GCGCTGATGA CTCTTGCCAC GTTAAATTGG AAGCCTAAGC CGGTTCCGGA GCAACTCTGC CTTATCTATT ACCAAACTGT TGCTCGACTT CGCAACAAAG TGCTAAAGCG CTTGACACCT AAGCAGTTGG TTGGCTTTCG ACATCAGCAA TCAGTTGACG ACCTGGAGGC GTTCTTGGTC GTCACACCTC TTCTGTACCG ACTCCAAGCG AAAAACCTTC AGGGCATGAA GAGATCGACG AGCACTGATT TTTACGAGTG A
|
Protein sequence | MKDKVTGAAS ENGDEPNSNT GKGTNKPKRP SPTNFGRSSR HASHPSRGGG AQRRVSLYES SSSAISALSG STGDDDDDDF DFSHIAPLGR SKGERTISTT IRLSKQGFRH KRTDNFAVQS IRRGSEDWKN LIKPFVADLV FRSLICRRFQ SEVSFRPYTA HAAVLFIDLS SYSKITTAIA HRGAHALSSI VNAYLSRLLQ FVHSHGGDVV KFAGDAVLVV WEGEQSDLAM NVLCAARCAL EMQKTAGSHP IDGSSMIFQI HCGLSCGRIE SEIFDAPTHV NMQRYYHAVS GEALLEISEL VDLACAGEIC ASEACLGLLG SHGRYRIIDL AKQTEVGKFG MGKILTHLDV EESLSNEMEL HIESTLMDRM ARRNKHIEEN FIHPSVIRQL NHGGLSPTQI AQMRSLCVLF IAMTSHGSAV NWLMEVQGVL DKYRCPIVQI IDDDKGVHVT AAINLYEAVP EASILGLDLC KELVDKHVGC AIGMAAGATF CGVTGSSSIA CRWDITGGPP VRAARLMQFG MQFGHEVVLD QSIYDDPVAA VRMVALNAGI YIKGTDGVSP IYTLSESTDY SAFRVLETVH GAVHDDVVRK IQKMINGEST RCACLVTGPT LSGKKIVCQR AAGYADLVPY LHVCEENSGL LQLAKTMATW FKYLDDDVVK RGAKDVLDYL EKGHWTRAHD ECVRLVTLII DKGMNACFVV DRIQFLDEFS FSLIRECLRG RAKVDRLSSR LSVGSESSDG NATGRICFLC THVPLYNWMS ATDIVEHIAR SHQRIRIPIL TVAQTDIESL RTMFRDLADM NVSDRWLTTY AQGSGYCAGY FVERSAASRI LSAQLWNEGK PGYAVTTEDI TLYIPPGLMG KNLHMTVQQT SAEIVMRFSQ VFDELPPLFQ TLAKILTIAT RRKFFKLPRT ILWEVLNDLV SEGVDSEPMA AVLNELVDMF LLKIEQVKNE EVVSFLNPAF LDIIIDVCTP VQIRSIATAL IERLEPILKR DFRIPLVIAT LHGMLGQLES MQRHMWSEGF KVLLQEGKDW PESERMRWCE LIQDEISAAG FNVNEILAGQ ALCQIPAKKA ISHRLPMLKI YQGPITIGPM SHALGVLFTN IFHEYGVFHG ARKSSVCRLR RSMASSSSRY LRQMECLENF LDGYGLSISR EEKEREREII TALAKPASRE EDAVAKAFVV LDDYIPQFIE PRMERLYSLV AKLREEGIPE VVMNAQPAIR RAYEALQVPK NRNDAAQDAL MTLATLNWKP KPVPEQLCLI YYQTVARLRN KVLKRLTPKQ LVGFRHQQSV DDLEAFLVVT PLLYRLQAKN LQGMKRSTST DFYE
|
| |