Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44196 |
Symbol | |
ID | 7204109 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1272675 |
End bp | 1276037 |
Gene Length | 3363 bp |
Protein Length | 989 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186216 |
Protein GI | 219113265 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.362095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGTGTACCAG TGCTAGCTTC GTTCACTGGT GTAGGGTCAT ACTAACAAAG TCCGCTATAG TATACAGCGA AGGACCCGAA TACATAAGTA GTAAATTTTC AAGAACCTCT GTATTGGTTT GGCTTCACAG GCCACCATGG CACCTGCCAC TTGGCAAATG ACGGGCGGAG CGGTCTATGC GCACCTTTTG GATAACGTGC TTCTTCTTCC CCAAGGGCAC CCTATTCGTC TCAGTTTTGC ACAACAAGGA TACGAATCGG CCGATGACCT CCTATGTATT TTTGAGAATG AACTTGAAAC TCTTGAATAC ATTCCTCTTG CCCCTGCTGA CGGCCCCGAA ACTACGGCAC CGGTTGCCTT ACTCATGGCA TATTGACAGA TCATCTGTCA TTTCCTCCGG TGGCAAGCGT CCCTTGAGCG TCAAAAGGGA ACTCCTTTGA AGAACTCCGA GCTTGCAGCC CTGAACAACG AAGACTTTGT CCTGTACCGC CGATCCGCTC TCGGCCAGGT CTCTTCGACT GTTGCTCCAA TAGTCACAAA CCCCAATGCT GCAATCCCCA CCGCTAAAAC TCGACCTGCT GTGGAAGATT TCAAGCGTGG GATCAAATGA GACAAAACCC ATTACCCCGT GCTCAAAGAC GACAGGTACT GGGATAATTT CTATCGGTCC TTCGTCGTCA CTGCCGTCTC CCATAACGTT GAGAAGGTAC TCGACCCATC ATACTTGCCT ACTGATCCAC TGGAAAAGTC GTTGTTTGAA GAACAAAACA AGTTCGTATA CTCAGCCTTG GAGCATACAC TTCAGACGGA CATGGGCAAA AATATCGTTC GAGAACATAG TTTTGATTTC AATGCCCAGG AAGTTTTCCG TAAAGTGGTC AAGCACTATA CAGAGTCCGC CTCTGCAAAG ATCAGCTCCT CTACCACTCT AGGATACCTG ACCACGGCAA AGTATAGCTC ATCATGGACT GGCACAGCGG AGGGATTTAT CCTACACTGG AAGAATCATT TGCGTATATA TAATGATACC GTCCCTACGG GTGAGCAGCT CCCACAACAG CTTTGTCTCA GTCTATTGGA GAATGCTGTC CATGATATAC CCGAACTTCG TCAGGTTAAA ATCACGGCAA CTTTAGACTT AGCAAAAGGT GGCAGCCCTA TTAGTTATGA CGGTTATCTC AGTCTACTAC TTGCATCAGC ATCACTCTAT GACAATGGCA ACAACCTATC TAATGCTCGT GGCAACAAGA ACAAACATCA TGTTTATTCT ACTGACTTAG TCTACCATCC AACTGACTTC GACAATGATC TAGACGTAAG TTACGATATA GATGTGTCAC CCACAGCAAT CTATGAAGCC AATGCCCATG CACGCAACTC CGGTAATAGT GGCAATCGCA GTCGCAACGC AGCTAGCCCC AGAGACCGAC CTTATATTCC CCGGGAAATG TGGAATCAAC TCTCAGAGGA TGCAAATGGC CGGCCGCGCA ACCTTTTTCA CAGGTGCTAC AAGCCAATAC GCATAGCCAT GGTAGCAGCG AAACCGCGGA CACTTTCCAT GATTGCGCAC CGGAGACTGA GTTGTTGGCT CACCTTACTG ACCGCGTCAG TCGTATGAAC GATGGTGATA TTCGTAAAGT CCTTGCAGCA TCACGTGACA ACGTCTCCCC ACAACCAGGA GCGAGACCCA AATCCATGCA ATCCAATATG CTACGTTATC AAGTCTCTCG GCATAATGTC AACGGTACCA CTGCAGCTCT TGTCGATCGT GGTGCTAATG GCGAACTTGC CGGGGCGGAC ATCATGGTGC TCAACAAAAC AGGACGTTCC GCCAATATAA CTGGTATTAA TGATCACACA TTGTCCGATT TGGATATTGT CACCGCTGCA GGATGTGTTG AATCCCATAC CGGTCCTATC ATTGTAATTA TGCATCAGTA TGCGTATCTT GGCACTGGTA AGACTATACA CTCCAGTGCG CAACTCGAGC ATTTCCATAA CAACGTTGAA GACCATTCAC GTACAGTTGG TGGAGACCAG CGCATTGTGA CCTTAGATGA TTATATCATC CCCTTGCACA TCCGCCAAGG TCTTCCATAT ATGGATATGA GGTGCCCAAC AGATGCCGAA TTTACCTCTC TCCCGCATGT GATATTGACC TCTGATGTCG ATTGGGACCC GTCAGTCCTT GACAACGAGA TCGATCTGGC CACCGATTGG TACGACACTG TACAGGATTT ACCCCAACTA CCATATGTCG AACCGCGTTT TGACCACATG GGCAAATATC TCCATCGTCA TATTTCGCTT TGTGACACTC GCCACCATGC CGTTGACTGT ATCCTTCAAT GTCAGCAGCA TGAAATTCAG CGTAATGACC ATGACTACGA AACCCTCCGT CCTTGTCTTG GTTGGGTATC CGCCGATACC GTTCGTAAGA CTATACAGGC CACCACCCAG TATGCACGAG AGGTATACCA CGCACCGTTA CGCAAGCATT ATCAGTCGCG CTTCCCGGCC CTAAATGTCC ATCGGCGTAA CAAGCCAGTT GCCACCAATA CCATTTGGTC AGATACTCCT GCTGTTGATA GTGGTGCCAA ATTTGCGCAA CTTTTCGTGG GCCGCCGATC CCTTGTCACT GATGTTTATC CCATGAAAAC CGAAAAAGAA TTTGTTAACG CTCTCGAAGA CCATATTCGG TTTCGCGGCG CTATGGACAA GCTCATCAGC GACCGTGCAC AGGTCGAGAT TAGTAAAAAG GTCATGGATA TCACCCGTGC TTACAACATT GACCAGTGGC AAAGCGAACC ACACCACCAA CACCAAAATT TTGCTGAACG TCGCATTGCC ACTATCGAGG CTAACACCAA CAACATTCTC AATCACACCG GTGCCCCTGA CTCCACATGG CTTCTTTGTG TCACGTACGT GTGCTATGTA TTCAATCATC TCGCCCATGA ATCCTTGCAC AACCGTACAC CCTTAGAAGT CCTTACTGGT TCCACTCCTG ATATCAGTGT TCTTCTTCAG TTCCATTTTT GGGAACCCGT CTATTATCGA CTCGAAGATG CGACATTTCC GTCTGATGGT ACTGAACAAA CGGGACGTTT CGTAGGCATT GCTGACTCCG TTGGCGATGC TCTTACTTAT AAGATCCTCA ACGATACTTC TAATAGAATC CTCTATCGTT CCAGCGTGCG CTCTGCAAAC CTTCCCGGTG AAACCAACCT ACGCCTTACA TCACGGGATG GGGAGAATGG CCCTAAACCT ATCAACTTTA TCAAGTCTCG TCGAACCGAA AATCTAAATT CCTATGATTT AAAGGAGTTG CCTGGTTTCA CCCCCGACGA CGTTTCTCAC TGA
|
Protein sequence | MAPATWQMTG GAVYAHLLDN VLLLPQGHPI RLSFAQQGYE SADDLLCIFE NELETLEYIP LAPADGPETT APIICHFLRW QASLERQKGT PLKNSELAAL NNEDFVLYRR SALGQVSSTV APIVTNPNAA IPTAKTRPAV EDFKHDRYWD NFYRSFVVTA VSHNVEKVLD PSYLPTDPLE KSLFEEQNKF VYSALEHTLQ TDMGKNIVRE HSFDFNAQEV FRKVVKHYTE SASAKISSST TLGYLTTAKY SSSWTGTAEG FILHWKNHLR IYNDTVPTGE QLPQQLCLSL LENAVHDIPE LRQVKITATL DLAKGGSPIS YDGYLSLLLA SASLYDNGNN LSNARGNKNK HHVYSTDLVY HPTDFDNDLD VLQANTHSHG SSETADTFHD CAPETELLAH LTDRVSRMND GDIRKVLAAS RDNVSPQPGA RPKSMQSNML RYQVSRHNVN GTTAALVDRG ANGELAGADI MVLNKTGRSA NITGINDHTL SDLDIVTAAG CVESHTGPII VIMHQYAYLG TGKTIHSSAQ LEHFHNNVED HSRTVGGDQR IVTLDDYIIP LHIRQGLPYM DMRCPTDAEF TSLPHVILTS DVDWDPSVLD NEIDLATDWY DTVQDLPQLP YVEPRFDHMG KYLHRHISLC DTRHHAVDCI LQCQQHEIQR NDHDYETLRP CLGWVSADTV RKTIQATTQY AREVYHAPLR KHYQSRFPAL NVHRRNKPVA TNTIWSDTPA VDSGAKFAQL FVGRRSLVTD VYPMKTEKEF VNALEDHIRF RGAMDKLISD RAQVEISKKV MDITRAYNID QWQSEPHHQH QNFAERRIAT IEANTNNILN HTGAPDSTWL LCVTYVCYVF NHLAHESLHN RTPLEVLTGS TPDISVLLQF HFWEPVYYRL EDATFPSDGT EQTGRFVGIA DSVGDALTYK ILNDTSNRIL YRSSVRSANL PGETNLRLTS RDGENGPKPI NFIKSRRTEN LNSYDLKELP GFTPDDVSH
|
| |