Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47780 |
Symbol | |
ID | 7202942 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 10844 |
End bp | 13331 |
Gene Length | 2488 bp |
Protein Length | 716 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182307 |
Protein GI | 219124011 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTAC AGCACAGCTT GCTACCTCAG CTCCAAAGGA GCCGCCGCCT TGGGATCTTA ATCGCTCTGG TTGTCTTTCA AAGTGTGACG GTCCTGTTGC ACAAACAGAC GTCCGTTTTG CATCCGCCAG TTCCTCGAGA CTTGTCCGAT CCCTTTGTAT TGGCCTCGGA ACCTAACGGA CAAGAAGATC TTGAGCCAAT GTCCCGCATT TTTCTAGAGG GAAGAGCAAC CTATACTAGC GAATGCAGAA GGTTCAAGTT GACCAACATC ACCTGGGGTC AATCGGTCGC TCCTTTCGTT GACAAAGTCG GAGCCAAAGT ATTGGTAGAG CAAATGGGAA CTTCCGTCAA GATTGTTCCC ACCATTGCAG TCTACGACAC GGCCAATATA TCAGACTTTG ACGCCATGTA CATGAAGGCC TTGCCAGACT CAATCATCAA ACCAGCTCAC GCCACCGGAT GGACCGCACA AGTCCAAAAC AAGAGCTACG TTTGCTTCAA AGGTTGCAAG CAAGAAACAC ACAAGTTCCA ATCCTGGCTC GGAGAGCCTT CGGAAATCGA ACAGGCGCAC AAAGTCGCAA AGAAGGTAAT GCAGTATACC CTTTCCGACG TTCCCAGCCC CGAGTTTCAA AAGAAAGAAC CCCAGTATGG GTTTGTTCCT CGTCGGGTCC TCATCGAACA CCGACTGCCC GTAGAACGCA TGAAGGAATA CCACTGGTGG ATTGCCAACG GACAACCAGT TTTTGTCTGT ATCCGATGTG ATGAAGGGCG GACGAAACGT GGATCCTACT ATTCTTCCGC CTTTCAAAAG CTGGAAATCA CCAGTATCTT GGAACCTTGC CAGCATTTGT CACGGCCCAA GACTTGGGAA AAGATGATTT CCATTGTGAA AAATATGGGT GAGCACGTTC CTGGGGTAGC GTGCATCGAT CTCTACGCCG ACGATTTGGA TGTCTACTTC TCCGAAATCA CTTTCACACG AGGTAAATGC CGAACATACT TCCAACCGCT CGTGGCCGAT GCCCTCTTAT ACGCAATGAG CAACGATATC CTCGCTGCCA AAAGCATCAC CGCTGACTAC GTCGAGAAAA CAGTTGCCGA TCGATCGTGG GTTCACGTTT CGTTCGACCT CGGCGAGCCT CTTCTTAGCA CTAACAAGGT TGTCAACGCA ACAGGATTTC CGTCGGGGCC TGATCTTTGT CGAAACCAAG CAACTGCCAA TTCCTCCGTC TGCGACCACA CCATAGATTC AGTTGCGAGC TGGGACTTGC ATTGTGTTAT CAGCAAGGAG AACGCTTTAA CAGCGGTGGG TCAATCAAAA ATTCGGACAA TCGGTCGTAT CGTTCAGAAG ATTGACTGGC TGTTGGTTCT TGGTCTGGTA GTGTTACTAG TGCTTGTCAA GTTTGGACAC AAGACACGGC AACTTGACCG ACCAGGGCCT CAAGTCTTTC ATTGCTTCCT TTATTTGGCT GCAGTTGCGG TATTCAAGAC CTTTCAAACC CATTCTGCAG GCCTTCTATC ACCGAGACCA ATCTGGTACA CTGTAGTCGA AAGTTACCAA ACTTTCAAAA TTGTCCATCC TGTGACATCC CCGGCAATTG CATTATCGCA TTTTGCGACT TACTGGATTT CCGTCAGCGC ATTCTTTTCC AAAAGATTGA CTACAATGTT GATTCTTTGG TGCCTGTATG AAGTCTGTAC AGCTTTTGTA AACGAATACT TTCACTTTGG TGAGGAGGAC GATTCGGTGA GATGCATGCG AGTTTCGTTC ATTCTTTACA CCAAAGAATA CGCTATCAAT GACGTCGTGA GGGTCTACTT GCTGCCGCCT CTCTTTGTGT ATGGGTATCT GTTGCCCAAG ATGATGCTCT ATTGGTTTGG TCCCCATGGT ATGTTTCTGG TTTGTACCGT TTCTGCTGGC TTCGGGACCT TTTTGTCTTT GAGCACTTGC CGAAAACGAA CATATTGTGG AGCTACACTA TGCCATCACA CGTCAACAAG AATTGCAAGA TGAATTAGAC AAAACGGTTT GTTCCAACAT CCAAACAATT TTATATTAGA ACATTGTGTA TGCTACTATG CCATTGAGTA CTCTAAAAGA GTGTACAAAG CGGCAACAAT GTAGTATGCA TGGTGCTGTA GATCTAATTA TGCATGTGCC TGACCAAGTG ACTGTTGAAT AGGGATTGGG TTGCCTGTAA GGCTTGTGTG TGGCTTACAT CACAGGTTGC GCAACCTGGA CACTGACCAG ATTGGGATTG GCTGCAAGAG TGTTGAGTTT GTATGGTGTT CATGCCCCTC ACGCTGACCA CAATGGAGGC GCGCTCGGCA CCGCCAACGC AAATCACGGC GGTGGCAAGT ACAGCGCAGG ATTCTTCCAG ACGCCGAGGG TCACTGACAC GGGCCTGCAT GGCGCCACTA GACGCGACGG CACCGGAGGC AACCAAGGAT ACTTCCCAAC AAACGTCAAC CACAGCAAGA CTTTGTAA
|
Protein sequence | MTLQHSLLPQ LQRSRRLGIL IALVVFQSVT VLLHKQTSVL HPPVPRDLSD PFVLASEPNG QEDLEPMSRI FLEGRATYTS ECRRFKLTNI TWGQSVAPFV DKVGAKVLVE QMGTSVKIVP TIAVYDTANI SDFDAMYMKA LPDSIIKPAH ATGWTAQVQN KSYVCFKGCK QETHKFQSWL GEPSEIEQAH KVAKKVMQYT LSDVPSPEFQ KKEPQYGFVP RRVLIEHRLP VERMKEYHWW IANGQPVFVC IRCDEGRTKR GSYYSSAFQK LEITSILEPC QHLSRPKTWE KMISIVKNMG EHVPGVACID LYADDLDVYF SEITFTRGKC RTYFQPLVAD ALLYAMSNDI LAAKSITADY VEKTVADRSW VHVSFDLGEP LLSTNKVVNA TGFPSGPDLC RNQATANSSV CDHTIDSVAS WDLHCVISKE NALTAVGQSK IRTIGRIVQK IDWLLVLGLV VLLVLVKFGH KTRQLDRPGP QVFHCFLYLA AVAVFKTFQT HSAGLLSPRP IWYTVVESYQ TFKIVHPVTS PAIALSHFAT YWISVSAFFS KRLTTMLILW CLYEVCTAFV NEYFHFGEED DSVRCMRVSF ILYTKEYAIN DVVRVYLLPP LFVYGYLLPK MMLYWFGPHG CATWTLTRLG LAARVLSLYG VHAPHADHNG GALGTANANH GGGKYSAGFF QTPRVTDTGL HGATRRDGTG GNQGYFPTNV NHSKTL
|
| |