Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39515 |
Symbol | |
ID | 7195347 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 31570 |
End bp | 33853 |
Gene Length | 2284 bp |
Protein Length | 623 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183534 |
Protein GI | 219126586 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAGCG TTACAGATAC TATTAGACAT TGGATGGTTA GAAGGCCTGT TACTACCGCT TCCTTTATGT TAGTTGACCT GTGCTATATC TTTGTACATA GTCACTGCTT CGTCCCCTGC CGTTGGCTCC CAAAAAGCGG CCATCATTTC TGTACTTTCC GCGCGACGAA AACACGATAC GATTTGACGA GACTTACGGG AATCTCTCGA CGTCAAAAAA AGCCACGTGC GAATCTTCAA CGCCGAAGGT AGTCACTATC GACTGACTGT GAGTCGGTGG CAGGAGAATA CGATGATCAA TCTACTATAG AGCACGGTTC AGGAAACGTG GGTTCGAATA CCACACGATG TCTTCTGTAT GAGTTGCCCT AGTTTAGCAA GACTCTTTTG AACCCAACCA GTATGCCGAA CGAGATCAAA ATGGGCAATA CAACCAAATC TTCCTACCAG TCCATCGTGA TTAGCGATGA ATTAAAGACG ACAGAGATCC ATGCTATAAA ACGAAATCAA GAACAGATAC GCAACGGAGT TTTCGACGTG AGCGTAATTG AAAGCGGATA CAGCAGTCAG AATCCGTACA GTTCACCGAG TAACGAGCAA GACGATCTAC TTACACAGTG CTTCAGGTTG TTGGGGTTCG ATTGCAACCC ACTCGTCGTA AATGAAGGCG GAAGAAATTC GGCGTCGGCA GATCTTCGCT TGGCTATGCT GTCAAACTTC TCTACAGCCT ATAACACTGT AAGCATCAGT CTCGCCTTGG CAATCTTACG GAACGTTCGT CCAGTAACTA AAACGAGCAA CCAAGCGTTT TGCTCTTCGG CACTTATTGG AGGTATGATT GTTGGGCAAT TACTTGGCGG CACTATAGGA GATGTAGTGG GACGGCATAG AGCAATGAGT CTTGTCATGC TCCTTCAAAT TGTAGCAGCG ATAGGTTCAG CACTCTCGTG TTCGTTGTTG ATGTGTGGTC GTGAAATTGA TCTGTATCAT ACGTTGACGG GTACGTATCG AGTAAAGTTA CTCCGTTACA TAAGTTTACC AATGTAAGGA CCCCACACTA ACCTCCCTTT ACTCGCAAAA CGTCAGCCTG GCGTTTCTTG CTTGGAATAG GCTGTGGTGG AGTTTATCCT TTGGCGGCTA CCATGACGGC AGAATCCACC CATTGCTCGC AGAAACGCGC CAAGCTAGTA GCACTAACTT TTTCCATGCA GGGTTTTGGG TACTTGGCTG TGCCTTTAAT AGCGTGGCTA CTGGTCGCCG TTCTCGGGGA AGGTTCGAAC ATTTCTTGGC GGGTCCTACT CGCTCTTGGC TGTGCCCCAG GTATTGTGCT ATCGATTGCG CGAACGCAGC GAACACATTC TCTACAAAAG AGACCAATAA GAACGAACGA TTTCTCATCG ACGGCACGTC CACGAACCGC TCCTGTTTCG ATACAAGATG CAGTTATGAA GGAACCCGAT CTGTTCAGGA AACTTCTTGG AACCAGCGGA TGCTGGTTTC TTTTTGATGT GATGTTTTAC GGCAACACGA TTTTTCAGCC TGTTGTCTTG TCTGCCGCCT TCGGACCCGC TGAAACTGTA TCAAAAGTAG CGCAAGATAC ACTGTTGATC AATGCCATGG CATTTCCAGG TTATCTCGCA AGCATTGTTT TGATCGGGCG CCAAAGTCCA AAATTTGTTC AAGCTCAAGG ATTTTTGCTA ATGGGTTTCA TTTACACAGC AATCGGCTCC TTCTTTGGTG AACTAGCCGG GAATCATTGT CTACTCCTAG GGGGATATGG GGCATCCTTT TTCTTTTCCA ATTACGGACC CAACTCAACG GTACGTGTCT TGCGTCGTTT GAGGAGACAG AAGATCTTTG ATAAATTTCT CACCGTATCG CCTCCTCTTG AGAGATACAG ACCTACATGC TTCCATCCGT CACGTTCTCC CAAGCATGTA GATCAACTCT CAACGGTTTT TGCGCTGCTT GTGGTAAGGT TGGTGCACTG TTAGGGGCAC TCTGCTTCAT CCCGATTTTA CATGTCTTTG GTGAGGCCTG GGTCATGTTT GCTTGTGCCG CAATTGCTTT CACTGGCTTT GTTTTGACGC TTCTTTTCGT AAACGAAGAG ACAGATTCGT CCGAATCCCT GAAAGACACA GCAGATCATC TTTGTTGCTC CACTTTTGAA GACCAGCAAG GAATGGGGAA CGAATCAATA GATTTGATTG TGACCCACGA TCGCAATTTA CCTCTGAAAG TCGTGTTCAG CCGGCCCTCG CTCTTTGACT ACTACGATGA TTGA
|
Protein sequence | MSLLRPLPLA PKKRPSFLYF PRDENTIRFD ETYGNLSTSK KATCESSTPK FSKTLLNPTS MPNEIKMGNT TKSSYQSIVI SDELKTTEIH AIKRNQEQIR NGVFDVSVIE SGYSSQNPYS SPSNEQDDLL TQCFRLLGFD CNPLVVNEGG RNSASADLRL AMLSNFSTAY NTVSISLALA ILRNVRPVTK TSNQAFCSSA LIGGMIVGQL LGGTIGDVVG RHRAMSLVML LQIVAAIGSA LSCSLLMCGR EIDLYHTLTG CGGVYPLAAT MTAESTHCSQ KRAKLVALTF SMQGFGYLAV PLIAWLLVAV LGEGSNISWR VLLALGCAPG IVLSIARTQR THSLQKRPIR TNDFSSTARP RTAPVSIQDA VMKEPDLFRK LLGTSGCWFL FDVMFYGNTI FQPVVLSAAF GPAETVSKVA QDTLLINAMA FPGYLASIVL IGRQSPKFVQ AQGFLLMGFI YTAIGSFFGE LAGNHCLLLG GYGASFFFSN YGPNSTTYML PSVTFSQACR STLNGFCAAC GKVGALLGAL CFIPILHVFG EAWVMFACAA IAFTGFVLTL LFVNEETDSS ESLKDTADHL CCSTFEDQQG MGNESIDLIV THDRNLPLKV VFSRPSLFDY YDD
|
| |