Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33329 |
Symbol | |
ID | 7204197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 620080 |
End bp | 621606 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186095 |
Protein GI | 219113023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.410276 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTGT GGATTCACGC ACTTTTATTG GTCCTTCTTT CTTTGTCAGC TTCTTACAAA CAGGTTTTGG TAAGGGGTCA AGCTCTCGAC ACTAGACCAC AAACATGTGC GGAGTGGACA GGTGCCGAAT GCGCTTGGGA ACCCAATCTC CAGGCCATGA ATGCGGACTT TGGGCGTTTC AATGAGACCT TCTACGCTTA TGTTGTTCCT GATGTCGCTA CCTTTTACAA TAAGACACCA GGAAGTCTAC AACCGACGAT GTCCACATTC AAAGGGATGT TTGGAAAGTT CACAAACCTC TCACCAGACA CTATCCGCGT TTCTTGGATT TCTGGCAAAA AAAATGATGC GCCCGTGTAC ATTTCCGACG TTTCTCCCTT CGATTCTGCA GGAACCGCAA CCTATGTGGG TCACCAGTTT ATAGTAACGG ATCGCAAGTC AAACACTCTA TTAACAAAGT GGACGATGGT GCAAGGAAAT TCGCTGTACT TCTACGACCC TTTTGATTTT GATATTCGGA AAGCTCTGAA GGCGCTGACT GCGGAACAGT TACCGTTGTA TAATATGCAG CTCCAAAACA AAATGTTCGC GGAGCAGTAC AAGACTTTTA CGGGAACTGA CTGGTTGGCT TTGTACAGAC AAAAGTTCGC CCCCCGATTT CACATGTGGC GAGCCGATGC GATCGGACAA ACGTATACAA TCGAAACGAA CGAAATACAT TTCGTCGATT TCCCAGATGA GGCCGAGTTA GCCAGGGGAA CTTCGATATA TGGTCCTCGC CCAGATGAGC GGGACCGAAT GCGTCGTCTG CGAGCTCGGG ATCCGACCAT GAATATGACC ATGACAGTCT TATCGTGTGT CCCACGTGTG TTTGAAGTCA AGGATTTCTT GTCAGATATG GAAGTAGAGC ACCTACTTAA TATTGCGTCC AAAAGAAAGC TAAAACGATC AACAATGCAT GCAGGTGGAT CCTCAGAAGC TACCACCAAC GACGACACTC GTACTTCAAC AAACGATTGG ATTCCACGGC ACCAGGATCT GATAACAGAT ACCATATACC GTCGCGCAGC CGATTTGCTT CAAATGGATG AAGCTTTACT ACGTTGGCGA CGCAAGTCAG AAATCCCAGA GTTTACCGAA TCACATATTA GCATCTCAGA ACGACTGCAG CTCGTCAACT ATCAAGTCGG CCAGCAATAT ACGCCACATC ATGATTTTAC AATGCCGGGT TTGGTCAATA TGCAGCCTTC GCGATTTGCC ACACTTTTGT TCTATCTCAA TGACGATATG GATGGCGGGG AAACTGCTTT TCCGCGATGG CTTCATGCGG ATGAAGAAGG CGGATCACTT AAGGTGAAGC CGGAAAAGGG TAAAGCAATT CTTTTCTACA ATCTGTTACC AGACGGAAAC TACGACGAGA GAAGTGAACA TGCAGCACTC CCAGTGCGTA GAGGGGAAAA ATGGCTCACT AATTTGGTAC GTGCTTCTCG AGCGCTGGAA GTTGATCCGC TTTCTGGCTC ACAATAA
|
Protein sequence | MLLWIHALLL VLLSLSASYK QVLVRGQALD TRPQTCAEWT GAECAWEPNL QAMNADFGRF NETFYAYVVP DVATFYNKTP GSLQPTMSTF KGMFGKFTNL SPDTIRVSWI SGKKNDAPVY ISDVSPFDSA GTATYVGHQF IVTDRKSNTL LTKWTMVQGN SLYFYDPFDF DIRKALKALT AEQLPLYNMQ LQNKMFAEQY KTFTGTDWLA LYRQKFAPRF HMWRADAIGQ TYTIETNEIH FVDFPDEAEL ARGTSIYGPR PDERDRMRRL RARDPTMNMT MTVLSCVPRV FEVKDFLSDM EVEHLLNIAS KRKLKRSTMH AGGSSEATTN DDTRTSTNDW IPRHQDLITD TIYRRAADLL QMDEALLRWR RKSEIPEFTE SHISISERLQ LVNYQVGQQY TPHHDFTMPG LVNMQPSRFA TLLFYLNDDM DGGETAFPRW LHADEEGGSL KVKPEKGKAI LFYNLLPDGN YDERSEHAAL PVRRGEKWLT NLVRASRALE VDPLSGSQ
|
| |