Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40278 |
Symbol | |
ID | 7195870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 584499 |
End bp | 587303 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184164 |
Protein GI | 219127900 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGACG CTATATTCTT CTCTATCGTA GCTCTCGCTA AAGCGGGTAT TGAGTGCTGT CGAAACGCCC AAATCTGCAA GGATGAGGCA GCCCGCATCG GCAAGCGTCT GACGATAGTG GTCGCACGAG CTAACGAATG GGGAGCGGTT TGTGAAAGTG CGCGCCTTTT TCATTTCCAT GAAGTCGTGG AGAATGCCTT TCTCTGCTTG CAAGCCGTTT CATCGCCTAG AAGCAAGCGT TCCTCGTGGA ACAAATTGTT CAAGTCTGCG TTGCAATCCA CATCTTTGCT CGACCAAATC CTTGAAGCAG AGAGTCATCT GAATACTGCC ATCAACGATC TACAGATGGA GCAGTCCAAC GCCATCTTCT CACAACTGGT AGACGTCTCG AAAGGGGTTG CAGAGTTGCT TGACCACTTT GGAACTCTAG CTATGAGCCA ATCGAATCCT TCCGTGACCA TCCAGCAACA GTTTGATAAG GTGTTGGCGG ATGCACAGGC ACAAGCCCCA GAAGTTGCCG TTTCTATCAC CGGTGACACA ATAAACCATG CAGTACGAGA GTATGACTCT CTCAACCATG CGGAAGCTGA GATTGTTTTA TCTCAACAGC AACAAAAGAT ATTGTTTCGT CCATCAAAAG AGGATGTACT TGCTATCTCG CTTAATTCAT CATCACTGAA GTTCAGCGAT GACCGAAGAA ACCTTCTGGG CGGTGGAGGA TTTGCAGAAG TGTTTCGTGG AACCTATGAC CACCGGCCAG TCGCGGTCAA GCGCCTCAAG GCGTACGACG GAGATCTGGC GTCTCTCTCT CAAAACCAAA TCTCTCGTGA TGTGGAACGC CTCGCTGCAG AAGCCATTTT GACGCACAAG TGCGGCAAGC ATTCCAATAT CATCCACATT GTTGGATGTC TTACCATACT GAATGAAGTG GAGAGACCTC TCATTGTCAT GGAGCTAATG CATACAACTT TATTTGACGC TCTACATGAT CAAAACCAAA AGGATACTAT GGGATATTCT CGTCGGCTGT ATCTGTTAAA AGGTATTGCC GGAGCCTTAG AGTTTCTTCA TCTGCAAGGA ATTGTCCACC ATGATATCAA GTCTCTAAAT ATCTTGCTGA ACAATCAATT GACTGTTGCC AAGTTGGCTG ACTTTGGCGA ATCTAAAGTA AAAGGCCTCC ACACCACGAA ATTCCGTCTG AATACAATAA TGGCCACCAC CAGTCACAAG GGCAATCAGG TAGCAGGTAC AGCCGCCTAC CAGGCACCAG AAATTCTCTC TGAAGAAGTT AGTGACACAT CACGCGTTTG CGAGATGTTT TCGTTTGGGG TAACAGTGTG GGAGTGCCTA ACAGGCAGCA TTCCACACAT GGGTAAAAAG GAATCATCTA TTGCTCTTTT AGCTGCAAAC AAGAAGCACC TACCCATGCT TGATGTGCCC TCACGTCCGG TTATAGATCT TCCAGAGATG GAATTGGCTT CCTGGAAAGC ACTGAAGGTG ATTGCAACAA TGTGCATCTC GCGCGATCGC TCAATTAGGC CGACTGCTTC TGTCGTAGTG GGACTTTGGC ATAGAGTAAA GACTGCAGGA AAGGTCGAAC CTTTTTCGTT CTCTCTCGAA AACTTGTCTA CAGCCAAAAC TGGTGGCATT GTCAACACCT TGTTGCCTAC AGATTTCGGG ACCCAGGGAT CAGCAAGTCA AGCACAAGAC CACGAGGAAG AGTCCAAAGC TGAATATGGA ACGTTTTCAA AGAAACGTTG CTACAGTGGA CTGTCCGTCT ATGCCAGTAT TGTGGTGTTG CTTGGAAGCA TAGTTGTATT GACTGTGACC CTAGTGCCTA GAACGTCTCC GGAGTCCCCT TCGCCATTGT TGGCTCGCCT TTCCTTGCAA ACCACAGAGG ATTTGTATGA TGCTGTTGAT GTCTATGTTG GCGCAACTCG CCCTATAACG TCCATAGCAG CCACTAAATA TGGATATCCT ATTGGATCGT GGGATGTGTC GCAAATCACC AACTTTACTC AAGTTTTTGA CGGATTGGGT CGAAATCGTG CCATTAGGAT GTTTGATGAA GATTTAAGCA GTTGGGATGT TTCGGCAGCA ACAACAATGC AGTCAATGTT CAACGGTGCT GATGCTTTCA ATAGTAACCT CTCAGCTTGG AATGTGAGCC AGGTGACCGA CATGAGTTTC ATGTTTTGGG GCGCATCAAC CTTTAATGGA GACCTTTCGT CATGGAAGGT AGACCGGGTT GAAAATATCA AGTCTATGTT TGGGGGTGCG GGCGCATTCA ATGGTGATCT CTCGGCATGG AATGTGAGCC TGGTAACCGA CATGAGTTTC ATGTTTTGGG GCGCCTCCAC TTTTGATGGG GACCTTTCGT CATGGAGCGT AGACCGGGTC ACAAGTATGG AGTCTATGTT TGAGGGTGCA GGAGCCTTCA AAGGTGATCT TTCGTTGTGG AATGTGAGAC AGGTAACAGA CATGAGTTTC ATGTTTTGGC GTGCATCTTC TTTCAATGGG GATCTATCAT CATGGGATGT AGAGCAGGTC ACAGGTATGC AATCTATGTT TCAGTTTGCA GCAGCTTTTA ATGGTGACCT GTCAAACTGG GATGTTAGAA ATGTAACAAC AATGCAAGAT ATGTTTTATG GTGCAACTTC CTTTACTGGC ACCCTGTGTT CCTGGCTGGA AAAACTTCCA TTGGACTGCA ATGTTGATAA AATGTTCTTT ATGGCACAGT CCTGCACAGA CAGAGCAGAC ATCATACTGC CAATTGGACC AATGTGCCAT ACTTGCACAA CATAA
|
Protein sequence | MADAIFFSIV ALAKAGIECC RNAQICKDEA ARIGKRLTIV VARANEWGAV CESARLFHFH EVVENAFLCL QAVSSPRSKR SSWNKLFKSA LQSTSLLDQI LEAESHLNTA INDLQMEQSN AIFSQLVDVS KGVAELLDHF GTLAMSQSNP SVTIQQQFDK VLADAQAQAP EVAVSITGDT INHAVREYDS LNHAEAEIVL SQQQQKILFR PSKEDVLAIS LNSSSLKFSD DRRNLLGGGG FAEVFRGTYD HRPVAVKRLK AYDGDLASLS QNQISRDVER LAAEAILTHK CGKHSNIIHI VGCLTILNEV ERPLIVMELM HTTLFDALHD QNQKDTMGYS RRLYLLKGIA GALEFLHLQG IVHHDIKSLN ILLNNQLTVA KLADFGESKV KGLHTTKFRL NTIMATTSHK GNQVAGTAAY QAPEILSEEV SDTSRVCEMF SFGVTVWECL TGSIPHMGKK ESSIALLAAN KKHLPMLDVP SRPVIDLPEM ELASWKALKV IATMCISRDR SIRPTASVVV GLWHRVKTAG KVEPFSFSLE NLSTAKTGGI VNTLLPTDFG TQGSASQAQD HEEESKAEYG TFSKKRCYSG LSVYASIVVL LGSIVVLTVT LVPRTSPESP SPLLARLSLQ TTEDLYDAVD VYVGATRPIT SIAATKYGYP IGSWDVSQIT NFTQVFDGLG RNRAIRMFDE DLSSWDVSAA TTMQSMFNGA DAFNSNLSAW NVSQVTDMSF MFWGASTFNG DLSSWKVDRV ENIKSMFGGA GAFNGDLSAW NVSLVTDMSF MFWGASTFDG DLSSWSVDRV TSMESMFEGA GAFKGDLSLW NVRQVTDMSF MFWRASSFNG DLSSWDVEQV TGMQSMFQFA AAFNGDLSNW DVRNVTTMQD MFYGATSFTG TLCSWLEKLP LDCNVDKMFF MAQSCTDRAD IILPIGPMCH TCTT
|
| |