Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_11785 |
Symbol | |
ID | 7200338 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 349841 |
End bp | 351868 |
Gene Length | 2028 bp |
Protein Length | 631 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179192 |
Protein GI | 219116795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.222713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGCGACAAA CGTGGAACGA TCAAAAACGA GTCATTCATG GCACTATTAA TCGTCTCAAT CTCCAAACAA TAAAACCGCT CGTCCATGAC TTATTCGCCA AAGTTAATCT TGTTCGTCTC CGTGGCGTCC TGGCGAAATC CATTCTCCAA GCCGCCGTCA GTGCACCCAG CTACAATGCC GTCTATGCCG CCCTCGTGGC AGTCCTCAAC ACGAAATTGC CAGAAGTGGG TGAACTCGTC ACGATCCGGG CCATTCTGGC CTTTCGAAGG CATTTTCGGC GGCGGGAAAA GACTTCCTGC GTGGCTATGG CTCTCTTTCT TGGCCACCTG TTCCATCAGG CCGTCACCCA CGAATTGCTC ATTTTGCAAA TCTTGACGGT ACTCTTGGAC GGGGATCCGA CGGACGATTC CGTCCAAGTC GCGGTGCAGC TCTTGAGCAC CACGGGTTAT GCCTTGTTGG AAGTCACACC GGCGGGAGTC CGTGCGGTTT TGGAACGTCT CCGTGGACTG TTGCATGAAG GGTCGCTGAG CTCACGAGTC GAGTACCAAA TGGAAACTCT ACTCAAGTTG CGCAAGGAAG GCTTTCGCTC GCATCCACCC ATTCCGAAAG AATTGGATTT AGTCGAACAG GATGATCAGA TCACGTTCGA GATCAGCTTG GATGATGAAG ATCTTCGAAA GCAGGAAGAG TTGGATGTGT TTGCGGTCGA CCCGGAGTAT GCGCAAAATG AAACCGAATG GGGTATCATA AGGGCCGAAA TTTTAGGATT AGGTAGCGAT GATGATGAAG AAGAAGAAGA AGACTCCAGT TCCGGGGAGA GTGGTGACGA AGACGGAGCT GGCGACGATA ATGAAGGGGA GTTCGAGACG GAAGAGGCCA TTGTTTCTGT AGACCAGAAC AGTAAGCATG GCACAGTCGT GGTGCGAGAC TTATCCGAAG CCGATCTGGT TCATTTACGT CGGACTATCT ATTTGACAAT CATGTCGAGT GCTACGTTCG AGGAATGCGC GCACAAACTA GCCAAGGTTG ATATTCCAGA CGGTCGTGAA GAAGAGCTTA TTAATATGCT GATTGAATGC TGTTCTCAAG AGCGTACGTT CTTGCGCTAC TACGGATTAA TCGCGTCGCG ATTTTGCTTG CTACATGACC GCTGGAAGAA TGCATTTATG GATGCATTTG CCCAGCAGTA TACAACAATC CATCGACTGG AGACGAATAA GCTGCGCAAC GTGGCAAAGC TTTTCGCACA TTTATTACAC ACGGATTCGA TGCCTTGGAG CGTCCTAGCC ATTGTTCGTT TGAATGAAGA CGAGACTACT TCATCCAGTC GTATCTTTGT CAAGATTCTT GTTCAGGAAA TGGCGGAAGC TCTGGGTATC GCCAGTCTTA AGGAACGACT AGAAACGGAC GATCTTGAAA TGGCTGAATG GTTCAAAGGA ATGTTTCCTC GTGATAACGT GCGAAATACG CGGTACGCAA TCAATTTTTT TACTTCGATC GGGCTGGGTC CTCTCACCGA TGGGCTGCGC GAGTTTTTAA AGAATGCTCC CAAATTGATC ATGCAGCAGG CCAAAGTCCC GTTGGACGGA CAAAAGGAGG ACGATTCCTC AGTTTCGTCT TCTAGTTCTA GTGATTCCGA GAGCAGCTCG TCGCTTTCCT CTTCTTCAAG CTCTAGCTCG GGAAGCTATT CTTCTTATTC AGATAATCGT CGAGAAGGAC GATCTCGTGG ACGGCGTAGA AGGGATCGGT CTAGGTCCCA TTCTCAGTCT TCTCGAAGTG CATCGTCGCG AAGTAGCTAC TCGAGGTCTT CATCACGCTC GCTGTCGAGG TTTCCTTCGC ACCCGAAAAA TACACGGGGT GCTCAAAGGC GTCGACCATC CGTTTCGTCT CGGGATTCGA GAAGGTACAG CTCTTCCTCA TCAGCAACGA AACAGGTAGA GCAGCAGGGC CGTTCGCGAT CGGTGGATTC TTTTGGTCGT GAACGACGGG GCTCTAATGC CGCTATCGAT CGACGTTTAA GAAAATAG
|
Protein sequence | QRQTWNDQKR VIHGTINRLN LQTIKPLVHD LFAKVNLVRL RGVLAKSILQ AAVSAPSYNA VYAALVAVLN TKLPEVGELV TIRAILAFRR HFRRREKTSC VAMALFLGHL FHQAVTHELL ILQILTVLLD GDPTDDSVQV AVQLLSTTGY ALLEVTPAGV RAVLERLRGL LHEGSLSSRV EYQMETLLKL RKEGFRSHPP IPKELDLVEQ DDQITFEISL DDEDLRKQEE LDVFAVDPEY AQNETEWGII RAEILGLGSD DDEEEEEDSS SGESGDEDGA GDDNEGDKHG TVVVRDLSEA DLVHLRRTIY LTIMSSATFE ECAHKLAKVD IPDGREEELI NMLIECCSQE RTFLRYYGLI ASRFCLLHDR WKNAFMDAFA QQYTTIHRLE TNKLRNVAKL FAHLLHTDSM PWSVLAIVRL NEDETTSSSR IFVKILVQEM AEALGIASLK ERLETDDLEM AEWFKGMFPR DNVRNTRYAI NFFTSIGLGP LTDGLREFLK NAPKLIMQQA KVPLDGQKED DSSVSSSSSS DSESSSRDRS RSHSQSSRSA SSRSSYSRSS SRSLSRFPSH PKNTRGAQRR RPSVSSRDSR RYSSSSSATK QVEQQGRSRS VDSFGRERRG SNAAIDRRLR K
|
| |