Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41559 |
Symbol | |
ID | 7199398 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 162840 |
End bp | 167263 |
Gene Length | 4424 bp |
Protein Length | 1148 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185494 |
Protein GI | 219130695 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTCGGTCGC TTGGCGTTGA CTTTGCGCCG CAGATTGGCC GGTATTTCTG CAGCGGAACC GCTGTCGTCA GTCACAACTG TCGTTTTCGG TTTGGCGAAA ACAGCCCAGG GCGATGCGCG ATCCTTCCTT TTCTGGGCAG GAAAGGCCGC GGAAATTGTC AAAAAGGCTC CATGTGACAC CGAGAACGAA GAAACAACTG TCGGTGCCAA AACAGAAAGA ATCGCGCAAC ACCGTACCGA TCGACGACAA CGGCATGTCC TCATGGTAAC GATGGCAATT CCTACGAATG TTTGCTTGCT TCTCTCAGAA ATATTCCGTT TCTTGTGAGT AGTGGTTGTC CGAAAGCGAT TGCCAAGTGT ATTTACAGAA GGTTGTTGTG CTGTGCAGTC ACCCGTACGC AAAATCGATG GTGTCACCGA CTCGCGGCCA GACGTGTCCG GGATGCCTCC GTCGGGCGAC TCCCACAATC CATCACCGAC GTGACCTCGC CGAGACACAG TGCGGGCGCC TCCGCAGTAC GAACCAAGCG ACAGGCATAC GGACACTGGC CAAACCATTG CCTAGCACCC GTTTGAGCTG ATTCTTTTTT TCTGACCCCT GGCAAGCTTT AAGAATCCAG AATGATGGAG ATGGAGGAGC ACGATTTGGC GCAACAAAAA GACCGTAAGA TGCCAGCCCA ACCTCTCCAT CCGAACCATG TGCGTATCTA CAGGAATTTG GAATTTCATT TAGTTCAACT TGCGTTTGTT CATCCTTCAA GCGGTATTCG TCTTCTATGT GGAATAGAGA GTCACGAAAT TTTGGTTGTA TTGAATACCG GGTCTATAAG GAACTGTCTC TGTTTCTCTG TGGAACAGTG ACGTGGCCTT CGGAGAGACA TCCAGTCTAC AGCTATCTTT CGTTACTGTG GTCGCGGATT TGGGATCGAC TGACCTCACC GAGGTTGTTC CCAGAGATGT TTCTGATTGG TTGATGTGTG CCAATTTAAT TTCGCATGCT CGGCTCTGAT TGGTGTTCGT AACCAAACGT TCCTTCCCCA GCGTGCATTG ACTTGACGAC ATCGTATACG ATAGCCGGTA AAGTCTCGAG CTGTTCTGAC AGTAAGGGAG AAGGACACGC GCTCGAAGTG CAACTCGTTT ACGTTTCTTC CGCGGTGGGG TTCCGTGTTT CTTGGCGTAG TGCGTCGTCG TTGTTGTCCT CACGGGGGCG GGGGTATCCA ATCTTGTCGG ATGGTTGTGG TGCCCGTGTC TCTCACAAAC CCCGTTTTCT ACTCGTTGCA CCCTTATTAT ACCCTTATCT GTTGTGTCTA TACCCTTTGC TGTGTCTTGG TTGGTCTTCA TTACCTGCCA TTTGCCACCT GTAATCTGCT ACCTTTGCAC TCTAATTCAC AGTCGCAGTC CACAAACGAG GCAGTTTCCA GGACTACCGA CAGTTCGAAC CACACTTTGC TACAGAATTC GAATCTCTCG CAGCAGCCTC CCCTCCCTCT TCTGCTTCCT GCCACTGATT CTTTGCGGCA TCCCGCAAAC CCTCTCTACA GCAGGAATCG ATCACACGAC ACGAATAGCG CTATAGGCGT TTCCGACCCC GCTACGCATA CGAGCCGTGC CATGAGCAGT CACACGTCGC TGCAATACTC GAGTTCCGGA GGCATCGCGA ACATCTCTAC CACAACCGAT CCTCCACACA AACGACTCAA GTTGGACCAT GCCATGAGCC ACACATCGCT CGGCAACCCA TCCTTGAGCT ATCACGATTT TGCCGCACAT TACGACAGTC GCAGTACCTT ACACACTAGT AGCACCATGG ATCTAGGCGT TTTGCGGAAA GAAGATTCCT TGGGCATGAT GCGCAAGGAC GGCGACGACG AGGACGACGA AAATGATCAG AACGACCCGA TATCCTCCAC AGCTGTACGA CAAGCGACGG TCCAACCTAC TGCTCTTCCG AATGAAAGTG CGAAACCCAC ACACCCCACT ACAGCGAACG TAGCCACCAC AAATTCCGTT TCGTCCTCCG ACAGTCTGCG CGATCTATCC GCACACCGTC CACAACATCC ACAGAATACT ACTCGTCTTC CCGTTTCTTC GTCAACGACT ACGGTAACAT CGGGTTCGAA TTCTCCGCTC TCTGCGGGGC CGGTATCAGC CCAAGCTCCT CCCTCGCCTC TGTTACCTCT CAAGGCTACC AAAATGTCAC ACCTCCGCCA AAAATACATG CAAGAACTAG AGTACATGCT GTGTGAGTTC CAAAAGCTGG AACGTCAGCT ACTAGGTGCC AAGGCGACGA CAGCCGAATC CGCTGGCAGC CGCGAACGTC GAGAAAAACT GCATTCGTTC ATCACGCACC TGAGCGATAC GATCCAGAAC ATACAGACCG GATGTCAGCT AGAGTCGGAG GGAAAATCAA CCGTCGGAGA AGCTTCCAAG CAAGATATAG CCCAGGAGGC CGCGCTGGCA GATTTGACGT GCGAAAAGGG GGAAGAGGAA AACGTGCAAA AGCTGGAAGA GCACATTCTA GCCAATCTGT TGCCCGTCAA AGTCCGGCTC AAGAAACAAC TGGCGGCCCA GCAAGGTGCC AAGCATAACC CGGCGGGGAT GCCGGTTGCG CAAAGGGGAC TAGTGGCACC GAGCGAAGGT GGTAAAGGCA CGTTTGCGGC AGCGGCCGAA GAGCGCAGAA AGCAATTGGC GGACGCGGCC GCCGCGGCAC AAGGCTTCGA TCATACACAC GTACCGGCGG AACCGGTTCA TCCAGACCAG ACACAATTTG GTAAACCACT ACAAGGAAAC GGCTCCTCGT TGACGCGAAA TTTGCATGGA TCCACTTTGG GATCCGCGAT TAAAGTGGGA ACGGATAAGT CCAAAATTTT GTTCGCTGGT TTGGCGATCG GATCGTCGCA AGTAAAGTCG TCGGTCAACG CAGCTTCGTC GGTACATCAG CTCGTAATTA AGGATCCCGC TTTGTTGGAG TTGGCTCGCC AACAGAGCGC GTCAAAACAA CAAGAGGACC TTCCACCGCA AACACAACAA GAAGACTCTC CAACGCAAAG CAAACCCAAT TCGCTGCTGC CTCCTTCCTC GTCCGAGCCG AATGACTCTC CAGAGGATAC AAACCGTAAG GCTATATCAC TAAAAGTTTC GCCTGCTGTT GCTTCTGCAG CAGCTTTGGC CGCGTCTGAG CAACCAGACG CAGTCTTGTC AAAGGCTCCA CCAAGCAGAT TAGATGATGT TGATGCCACC TACCCCGACA TGCCATCGGC AGCTTTAACC GATGAAGAAC GGCGAACCCT CCGTCGTCTC AAACGCCGAA AAAAGAGACG AAAACGCAAG GCCGAAGCAA CTCCAGTCAC GGCAGCGGCC ACGGCAGCAC CAGTGATCAA TCGCCATCAC AAGCCGACGA CAAAAAAACG GGGACCTCGG ACGGTGGAAT ACATGTGTGC TTTGTGTAAC GAAGTCTACA ATTCTACCTG TGATTATAAT CCTTGGTGGG CTCTGGCTCA ACATGATTGT CCAAAATGTC GAAAAAATCA GGTTCGTCGA CTTCGTGTAC CGAATTGCGC CCATTCATAT TCGTACCTTC TTGCGGAAGT TCTCACACCC TTTCTCTCTA CTGACAGATA CCGCGGGTAG ATATTAGCGC ACCTGCCAAT ACGATCGAAT ATCATCCGGC GTTGCTAGCT CACGCAGACG AAAATGGCGG TAGTACTCCG ACACCGCCTG CAGCAATAGT GAAGCCAGTC ACAACTGTGT CGGCTCCTGT CACTAGTGTG CCAAAATGTG GTAATGATTC CGATTCGTTC GGATCTGACT TGTCAGACGA TGATCTTGAC GGCCTGTTGT CAGACACTGA CTCGGAGGGC TCGGGAGAAA TAGGTATGGA AAGAATAGAT GCGCTATCGC CTGCGGAACA AGCAGAGAAT GAATATTTTG GGGTGGAATA CAAGGGGCCA AAATTGAAAG ACAGTGAAGC TGCTCGGCTA CTGATTCTCA TGGGGCATGC GTCGACCTGT CCTTGCAAGC ATCAATCGAT CAAACATCGT GAAACCTGCA GAAATACGAA ATGGATGATG TTGCATGTTC GGGATTGTCC AGGAACTACA TCTTCGTTTG ATGTCTGCCC ATTTCCATGG TGCCGCAAAG TCAAGCATTT GTTGTATCAT CTTGTCTCGT GTCGCGATGC CAAGCACTGT GAGATCTGCT CACCGACCAA GCTCAACCAA AATATGATCC TGTTAAAGGG GTTGAATCAG CACCGCTTCA TGCAATATAG GGAGCGGCTG ATCGGCCGTG GAAAGGCGTT GACAAAGGTG TCAAATAGTG CGCCGAAAAA TACTCCAGCT CAGGCGCAGC ACAAAAGTGT GTCATAAAGC AAACCGGGTG TAATTGGTAT TGTAGCTTTC ATCGACGTTT CGCAAATGCT GTAA
|
Protein sequence | VGRLALTLRR RLAGISAAEP LSSVTTVVFG LAKTAQGDAR SFLFWAGKAA EIVKKAPCDT ENEETTVGAK TERIAQHRTD RRQRHVLMVT MAIPTNVCLL LSEIFRFLMM EMEEHDLAQQ KDPGKVSSCS DSKGEGHALE VQLVYVSSAV GFRVSWRSAS SLLSSRGRGY PILSDGCGAR SQSTNEAVSR TTDSSNHTLL QNSNLSQQPP LPLLLPATDS LRHPANPLYS RNRSHDTNSA IGVSDPATHT SRAMSSHTSL QYSSSGGIAN ISTTTDPPHK RLKLDHAMSH TSLGNPSLSY HDFAAHYDSR STLHTSSTMD LGVLRKEDSL GMMRKDGDDE DDENDQNDPI SSTAVRQATV QPTALPNESA KPTHPTTANV ATTNSVSSSD SLRDLSAHRP QHPQNTTRLP VSSSTTTVTS GSNSPLSAGP VSAQAPPSPL LPLKATKMSH LRQKYMQELE YMLCEFQKLE RQLLGAKATT AESAGSRERR EKLHSFITHL SDTIQNIQTG CQLESEGKST VGEASKQDIA QEAALADLTC EKGEEENVQK LEEHILANLL PVKVRLKKQL AAQQGAKHNP AGMPVAQRGL VAPSEGGKGT FAAAAEERRK QLADAAAAAQ GFDHTHVPAE PVHPDQTQFG KPLQGNGSSL TRNLHGSTLG SAIKVGTDKS KILFAGLAIG SSQVKSSVNA ASSVHQLVIK DPALLELARQ QSASKQQEDL PPQTQQEDSP TQSKPNSLLP PSSSEPNDSP EDTNRKAISL KVSPAVASAA ALAASEQPDA VLSKAPPSRL DDVDATYPDM PSAALTDEER RTLRRLKRRK KRRKRKAEAT PVTAAATAAP VINRHHKPTT KKRGPRTVEY MCALCNEVYN STCDYNPWWA LAQHDCPKCR KNQIPRVDIS APANTIEYHP ALLAHADENG GSTPTPPAAI VKPVTTVSAP VTSVPKCGND SDSFGSDLSD DDLDGLLSDT DSEGSGEIGM ERIDALSPAE QAENEYFGVE YKGPKLKDSE AARLLILMGH ASTCPCKHQS IKHRETCRNT KWMMLHVRDC PGTTSSFDVC PFPWCRKVKH LLYHLVSCRD AKHCEICSPT KLNQNMILLK GLNQHRFMQY RERLIGRGKA LTKVSNSAPK NTPAQAQHKT FIDVSQML
|
| |