Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42758 |
Symbol | |
ID | 7196135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1007388 |
End bp | 1011488 |
Gene Length | 4101 bp |
Protein Length | 1346 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176706 |
Protein GI | 219109906 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.59748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCA CACCGGTATC CCCCACTCCG GTGGAACCTA ACGATGCCGA CGACGAACCC GTTATGCCTT CGTTCCATAT CCCCATTCGA TCATCCATCC AATTTCCCGA CAACGACAGT CTGCACCATG CGCGATTCAT TGAAATATTT CCGGAAGAAA TGAGAGATAC CCCCGTCTCG ACACTCCTGC ATGTGTTGAA GGACGAATCG GCCGAACTGT CGACTTGGGC CGACGCCGGA TGGCACTATA TGGTACAAAA AAAGAACCGC GAGAGCTTGA CCATTCTGGA AGAAGCCTGC GACACCACCG CTGCTACAAC AACAACGACA CCAAGCAACG ACGTGGACAA GACGGAACGC GTGCGCATTC TCGCCGCGAC CGGGATTGCA CACTTGTCCA GTAACGGAGC CGCCGATGCC ACTTCCGGCA ACACCGCCAA ACGTTCCAAC GTCCTCGATG AAGCTCGCCA ACAGGCCGAT CAAAAGTTTA CCCAAGCGGG AAAGATTGAT CCCTTCTTTC CCATGACCTG GATTGGCCGC GGTATGCTCA ATTTGTGGCA AGGAAAACAC GATCAGGCGA CATTTTTCTT CCAAACCACC TTGAAGCAAT GTGGCCCCGT TTTGCCGGCC TTGCTCGGCA CGGCTGCCGT GTCTTTTGCT CAAGGAGACT ACACCGCGGC GCAAACGGCG TACGGACAAG CGCTCCGAAA GTACCCCCAT GCCAGTGGGG CCGCCTCCCG CGTAGGATTC GGACTCGCCT CCTACGCACT CGGACAAGTC GACCGCGCCA AAGCAGCCTT TCGACGAGCC ACCGCCATTG ATCCAGAAAA TGTGGAAGCC ATGGTCGGTA CGGCCATTCT CGATATGGCC AGCGTCGACG TGTCGGACAA AGACTACGCC GCCAAAATGG AAGAAGCGAT CCGCGTCATG TCCATGGCCA ATCTACTCAA TCACGAAAAC GCCATGGTGC AGAATCATTT GGCGAATCAC TACTTTTGGA AATGGACTCC TGTGAACGGC ACGGTGGAAG TGACCAAAGG ATCGCAGTTG GTTCAAATGC CGTCGCAAGC CGTGCCCTTG GAACCGGGAG AACGTATACG GATCGGAACC AAGTTTGAAA CCACCGTTCA AGATGTTGAG GACGATAGTA ACAACAACAC GGCTTCGACG TTTACCGTTA CCGACAGTTG GAGCGAAGGC TCCGCGACTG GCCTCAAGGT CTGGAAGAAG GACTACGATC GCGTAATTGC CTTGGCGAAA GGAGCCTACG GTAGTACCAA CGTTCAGTCC ATGCAGGCCG AATCTCTATT CTTCCTCGCG CGGGTCTACC ATGTGCGAGG GGAAACCGAC CACGCCCTCA AATTTTACGA AAAGGCCTGT AAACTGGCGC CAGCCTTGAC CCCGGCGCGG TTTGGATTGG CACAAACATT AATTGTGAAG GAAGACTATA GCGCAGCCAA GAAACAGCTG CAGCAAGTGT TGGCGACGGC GTCCAACGCG ACGGACGCAT TGGCATTGTT GGGATTGCTG GAAGCACGGT CCGGCAAGCA AGTGGAGGAA GGGTTGATGC ATTTGCGCAA AGCAACCGAA CTTGATCCCC TCAATACCGA TTTGGTAGTG TTGGAAGCCA TGGCGTTACA AAAGTACGAA AATAACAACG TCAAATCCCT CGAGCGGTAC AAGAAAGCGT TGCTGTTAAT GGAGCGAAAA ACCACGAAAG TGTCGTACGA GATCTACGCC AATTGCGGTG TCCTATGCCA CGAACTGAAG AGGCACGACG AAGCGCTGGA CATGTACCGA CGTGCGCTAG CTGTCTTGGA CGAGGATGGG TTTAAAGGAG CCGCCGTTTT TCAGCTAGCG GATGGCCCTG AAAGAGGAAG AATTCGACAC GTGGACAACG CCATGTTTAA CGCATTTGTC AATTCCAATC TCTTGGTGGA AGTGCTTGAT AGACACAGTA AAACCTTCGT CAAAGTACTG GATGTACTCG AGGCGGACGT TGCGGGAATA TTGTCGAAAG GGGATCGAGT CTCCTTGGGA GACGGATTTG AAACCAAGGT AGTGAGTTGG GAGAGCAGAG ACGGTGCTAT TGTGCTCGAG CTGGCTGACG AATACGATCC AATGGATACT GACTCCAAGA AGGCGCCTCT CTTGGTTGTT CGCGAGAACA ATCTTCTCTC GATACCAGAA GCAATCACTG TTGCTTTCAA TATAGCTCGG CTCCACGAAG CCACTGGGCG GACTGTCGCA GCGATTGAGA TTCACAAAGC CATTCTGAAG CGAAATCCGG CCTATGTCAA TAGCTATTTA CGTCTAGCAT GCATTGCTGT GGACTGTGGA TCATTGAAAG AAGGTTCAGA ATGGCTCAAG ATTGCGGCTA GTACTGCACC TGGTAATCCT GAAGTCTTGA CGCTGGTTGG AAATCTTCAT CTTTCACTTT GCGATTGGGC GCCAGCACAG TCCGTTTTTG ATGGCCTTCT TTCCAAGAAG ATCCCCAATG TTGATGCATA CGCTTCACTC AGTTTGGGTA ACATTTATTT CGCAAACCTG CACGTGAACG AGGACAAACG GTACGATAAG CATTTACAGT ATGCAGCCGA CTATTATCGC CGCATCCTCG CCAAGGATCC AGCCAACGCC TATGCGGCCA ACGGGATTGG CACTGTGCTA GCGGAAAAAG CTGAAATTTT TAAGGCCAAG GAAGTCTTCA ATCGCGTCCG TGAAGTCAGT GGAGACAGTA TTCCTGATGC CCTGTTGAAT CTCGGCCACA TTTTTTTGGC TCAGAAGAAA CATCCGGAAG CTCTCCAAAT GTACACGAAC TATATGAAGC GGACGGAAGA CGGGACGACG CCTACTACGG CCAAGAGCAG GGTGGATGAC GTGGTCAGCG TGTTGCTGTA TATAGCTTTT GCCTTTTTTG ATTGGGCCCG GCATACCGAA CTGGCCAATG ATTCGAGTGC AGCGCCAGCT GACGGGAGAT ATAGAGAAGC TATGCAGCAT TTGAATCTGG CCATTGGCAA AGGCAGCAAG CAAGATCTCG TACTCAAATA CAATCTCTGC ATGACCAAGC TGCAGGCAGC CAATTGTGTT TTACAAAAAC TGACACGTAA TATCCCCCGA TCTGTCGAGG AAGTCGAGGA AGCCTTACGA GGACTGGAAG AAAGTTTTCA GATTGTGGAA CAGATTGTCA AGGACAAGGC CGACGGAAAA AAGGTCAATA TTTCCTCAAC GACGTTGCAA GATTTCGTGA AACACTGCAA GGCTAACATT TTAAGTGCAC AATCTCATTT AGAAGACGAA CGAAAACGTG CCAAGGAGGC CGAAGTGGAG CGTGAAATTC GCCGGCTGGC CGCTGAAGAG GCGACAATCA AAGAGAGACT CAGAATGGAC CAGGCTGCAA TGGATGCCCA CAAGCTCCAA GAAGAAAAGG ATCAAAAGGC AGAGGCCAAA ATGAAACAGG TAGAAGAGCT GCAATCTAAT TGGCGCGAAG AGAAGGAGAC TAAGCAATCA GAAAAGGAAA AGCGGGCTCG AGGCCGAAAG GATGAAATGA CCGCGGACGA AGTCGGGCTT GTGGTGGAAG ACGACAATCA CCAAGCAACC AATGGTCACG GTTTGTTCGA TGATTCCGAC GATGACAGCG AAATAGTCGA TTCTCTACCG AACGAAACGA AGGGAATTGG AGGTCTAGAA AAGTCGACCT CTTCAACTAA GGATTTGTTT GGAGATAGCG ATGACGACCA GAGTGGTAAT GACGAGGACA GGAAAGGGAC CGTCAAGCCA GACGCTACCA AAGCTGCAAT CACGAGCATG GATTTGTTTG GAGATAGCGA TGAAGACGAC ATCGAGGTTG CGTACGGAGC GACCAAGCCC ACAAGCGAGG AGAGCAAAAA AGAGCCGCCT GCAACAAGCA ATGACTTGTT TGGAGACACT GATGAAGATA GCGATGCCGA ACCGTCCACA AACAGTGCAA AGAGACCCAA CGAATCAGGG ATTGCGGAAT TGGATGATGA TGGACAACCC AACAAGAAAG CAAGGTTGTA AAGGGCAAAG TGACGCAATA TTTTTATACA TAATTCGATA CTTTAAAAGA TGTTAAAAGG T
|
Protein sequence | MSATPVSPTP VEPNDADDEP VMPSFHIPIR SSIQFPDNDS LHHARFIEIF PEEMRDTPVS TLLHVLKDES AELSTWADAG WHYMVQKKNR ESLTILEEAC DTTAATTTTT PSNDVDKTER VRILAATGIA HLSSNGAADA TSGNTAKRSN VLDEARQQAD QKFTQAGKID PFFPMTWIGR GMLNLWQGKH DQATFFFQTT LKQCGPVLPA LLGTAAVSFA QGDYTAAQTA YGQALRKYPH ASGAASRVGF GLASYALGQV DRAKAAFRRA TAIDPENVEA MVGTAILDMA SVDVSDKDYA AKMEEAIRVM SMANLLNHEN AMVQNHLANH YFWKWTPVNG TVEVTKGSQL VQMPSQAVPL EPGERIRIGT KFETTVQDVE DDSNNNTAST FTVTDSWSEG SATGLKVWKK DYDRVIALAK GAYGSTNVQS MQAESLFFLA RVYHVRGETD HALKFYEKAC KLAPALTPAR FGLAQTLIVK EDYSAAKKQL QQVLATASNA TDALALLGLL EARSGKQVEE GLMHLRKATE LDPLNTDLVV LEAMALQKYE NNNVKSLERY KKALLLMERK TTKVSYEIYA NCGVLCHELK RHDEALDMYR RALAVLDEDG FKGAAVFQLA DGPERGRIRH VDNAMFNAFV NSNLLVEVLD RHSKTFVKVL DVLEADVAGI LSKGDRVSLG DGFETKVVSW ESRDGAIVLE LADEYDPMDT DSKKAPLLVV RENNLLSIPE AITVAFNIAR LHEATGRTVA AIEIHKAILK RNPAYVNSYL RLACIAVDCG SLKEGSEWLK IAASTAPGNP EVLTLVGNLH LSLCDWAPAQ SVFDGLLSKK IPNVDAYASL SLGNIYFANL HVNEDKRYDK HLQYAADYYR RILAKDPANA YAANGIGTVL AEKAEIFKAK EVFNRVREVS GDSIPDALLN LGHIFLAQKK HPEALQMYTN YMKRTEDGTT PTTAKSRVDD VVSVLLYIAF AFFDWARHTE LANDSSAAPA DGRYREAMQH LNLAIGKGSK QDLVLKYNLC MTKLQAANCV LQKLTRNIPR SVEEVEEALR GLEESFQIVE QIVKDKADGK KVNISSTTLQ DFVKHCKANI LSAQSHLEDE RKRAKEAEVE REIRRLAAEE ATIKERLRMD QAAMDAHKLQ EEKDQKAEAK MKQVEELQSN WREEKETKQS EKEKRARGRK DEMTADEVGL VVEDDNHQAT NGHGLFDDSD DDSEIVDSLP NETKGIGGLE KSTSSTKDLF GDSDDDQSGN DEDRKGTVKP DATKAAITSM DLFGDSDEDD IEVAYGATKP TSEESKKEPP ATSNDLFGDT DEDSDAEPST NSAKRPNESG IAELDDDGQP NKKARL
|
| |