Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43575 |
Symbol | |
ID | 7197309 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 851928 |
End bp | 854378 |
Gene Length | 2451 bp |
Protein Length | 737 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177700 |
Protein GI | 219111897 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.835069 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTATGCAA GAGATCGAAC AGCACTCTTA GGCGAAAAGC ATCATCTTTT GTCAGCAATC ATATCAACAT TCATGCAGGT CGCTCGCATC GGATCCAGGC GCTCGGCAAC AAAACTACTT CGCCCACAAT CGACTGGTCC TCCTCCTACT GCTGCAGCTT CTCCCCGTCA AGCCTTGTCT AGAAGCTCTG TAGCATGCGT TAGTCCGTTT GAGAATCGTG GAATTCATTT TCATTCAGGA AAATTTCCTT CACCCTCTAC CAGGCGTAAC TTTTCCTCGT CTTCGATTGA TGGAATCGAC ACCGTGGAAC ATGCGCTGGC TTCTTCCAAT GTGAAGGAAG CACAATCGGC ACTGGACAAA ATTCCGGCGG ATTCTTCACT TACACTGCCT GACCTTCAAA GTCTTCGCGG TAGAGTTCTG GATGCTTGGC TGGAGCATCA AGAGAACCTC CTAGAGATTT ACAACTACAC GAGTCCTGAA CGGAGTCATC TTCGTGAAAT CTGTCTCGCT GCTGAGTCCG CTCACAATTT ACTAGAACAG ATAGAACCCT TGTTTTCCAA CAGCAACTTG ACATCCTACA GTATCTACGG TGAAGAATTC AAATCTGATC TGGACGTGAG TGATGAGGAA GCGCTACGAC GGCCCATGAA GCCGTATAAC ACAGTGCTTA CCCAGCGTTG CAACGCCGTC TTGAGAGCAT GGGCTCGAAC ATCTCGGGCC GGTCAAGGCA TTAACACTCG CCTTACACGA GCAATTCCAC AGCGAGCTCA ATTTCTTTTA GAGGGTATGG AATTGTCGTA TGAAAATGGC AGCGACCGAC TGAGGACAGT ATTGCCCACT GTGGAGAGTT TTAACCAAGT TTTGGAGGCT TGGGCGTACA GTGACGAGCA TCTGCGCGGT GCTATGGCAG AGCAGTTATT TCAAAAGTTA AGGCATGGCA ATCCTGCAGC TGTCCACTTC AATGGAAGAT CGTACCGACT AATCATTGCG GCCTGGTCTT GGAGTCGTGA GCGGAGGCAC GCATTCAATG CCACTGGTCA CTTAATGAAG ATGTTGCGTA AATTAGAGAA GGGCGACGAA AGCATGGAGC CTACCATGGA TGACTATCAT ATGATTCTCA AAGCCTGGAC GAATGCTGAG TATGTGGCTA GAGAAAGGCA TTTTTGAATT CTGTGTCGTT TATGTTGGTG GCCGGGAAAC TCATTTGTTT ATTGTCTTCA TTCTGTCATC TTTTCAGGGA TAGGATGGCT CCAACAAAGG CTGAATCTGT GTTGCAGCTT ATGGACTATG CATATAACAA CGGAATAACA TTTTTGCAAC CCGACATAAC TTGCTACCGC TGTGTGCTGG TGACTGCTGC TCGCCGTCGG TCGCTTCCTG AGCTAGGTAG GCTGGTCGAC AACCTTCTAA TGCGCATGAA GGAGCGTCTC ATGGTTCCTG ATACACTGTG CTATCAATCG GCAATAACGA CGTGGAAAAA TGTTGCTCTG AATCACGAAT TCCCGGAAAA TGTGGAATTA GGAGTGAGAC GAACAATCGA GTTGCTGACG GAGATGAAGT TGGCTGAAAG CCGCAGCACA TTGGTATCTG TCAAGCCATC TACCATCAAC TATAACGATG TGATCGAGGC GCTTACTGCA AGTTCCCATC CTCGCCGAAT ACAACAAGCA CAACATTTAC TTTCCGAAAT GGAAAGCGAA TTCTTTAAAA GAGGAAATAA TCTTCTAAAA CCAAGCGCGA ATAGTTATCG TCTGATGATT GAGGTGTTGA ACAGCGTGGC ATCCGTGAAG AGAGTGGTCG AGGCTAAGGC TGTGGTTCTC AAAATGAGTG ATAAGTTCGA TGAGTTGTTC GATGCAAGCC ACATGAGTCG AAAAGAGAAT AAAGCTGTAG TTGTCGCTAC TTTCAACGCT TTTATTCGCC TTTGTGCTAC CACTCCCGTC GAAGCTGAGG ATGAAGGCAT GCGAATCCTA CGTGAAGCCC TGGCGGTTGT CGATAAGATG AGGAGTCATG TCGTATTGGA GCCAAACTCA GCCACCTACG CTGCCCTGTT AGAGGCTTGC AAAGATTTGT TGCCGATAGG TCCAGAGAGA CGTAGGCTTG TGGAGATGGT ATTCCGAGTT TGTTGCGACG AGGGCATGGT CAATCATATC GTCCTGAAGG AACTACGCGA TGCAGCAACT TCTGAGCAGT ACACGAAGAT GGTCGTAGCA TACGGTGAAG AGATGGAAGG GAAAAGAATG GTTCCTGAAG CGTGGACGAT CAAAGCACTA GGGGATCGGG TGTGCACTGA AGACGGTCGA AAGGCAAAGC CGCTGGGTGT TGATGGTCAA CTGGGGGTGA CCCTGGCGAT GCAAGAATTC AAAATGCGAA AAATTCGTGA CGGTCGGAAC AGAAATCTAT TGCGAGGTGG AAGACTTACG ATGGAAGAGC GAAAGGAGTT GGCGTTGGTA A
|
Protein sequence | MQVARIGSRR SATKLLRPQS TGPPPTAAAS PRQALSRSSV ACVSPFENRG IHFHSGKFPS PSTRRNFSSS SIDGIDTVEH ALASSNVKEA QSALDKIPAD SSLTLPDLQS LRGRVLDAWL EHQENLLEIY NYTSPERSHL REICLAAESA HNLLEQIEPL FSNSNLTSYS IYGEEFKSDL DVSDEEALRR PMKPYNTVLT QRCNAVLRAW ARTSRAGQGI NTRLTRAIPQ RAQFLLEGME LSYENGSDRL RTVLPTVESF NQVLEAWAYS DEHLRGAMAE QLFQKLRHGN PAAVHFNGRS YRLIIAAWSW SRERRHAFNA TGHLMKMLRK LEKGDESMEP TMDDYHMILK AWTNAEDRMA PTKAESVLQL MDYAYNNGIT FLQPDITCYR CVLVTAARRR SLPELGRLVD NLLMRMKERL MVPDTLCYQS AITTWKNVAL NHEFPENVEL GVRRTIELLT EMKLAESRST LVSVKPSTIN YNDVIEALTA SSHPRRIQQA QHLLSEMESE FFKRGNNLLK PSANSYRLMI EVLNSVASVK RVVEAKAVVL KMSDKFDELF DASHMSRKEN KAVVVATFNA FIRLCATTPV EAEDEGMRIL REALAVVDKM RSHVVLEPNS ATYAALLEAC KDLLPIGPER RRLVEMVFRV CCDEGMVNHI VLKELRDAAT SEQYTKMVVA YGEEMEGKRM VPEAWTIKAL GDRVCTEDGR KAKPLGVDEI YCEVEDLRWK SERSWRW
|
| |