Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47224 |
Symbol | |
ID | 7202203 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 857541 |
End bp | 859018 |
Gene Length | 1478 bp |
Protein Length | 463 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181457 |
Protein GI | 219122238 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000362966 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCCATGACG AGAAATGGAA GAATCAAAAA TGCATCAGCC ACTCTGGCGA TTTTGGTTGC TTTTGTGATT CTACAGCTAC AGAAGGACCG AGATATGAAA TTTCGAACAT CCTCACCACC GAATGAACAA CGCATGCACC ACAGGAATGC TACTCACTTG GCGCCCACAA CGAACAGTGT TATCCCTAGA CGTAACGGGA CGCTATGTGA TGATCTTCAT CCTTTACAAG CTTTAGAGCG TTACAAAGCT CAACACTCGC AAGCAATCAT GCTAAGTGAA TCCCCTGCTG ATGCCGTCCA TCGCAGGTAT GCCATTGGCT ACTACTCCTG TCCTTTCCAA GCCGGCAACC GCTTACATCA TTTCTTCAAT GCCATGATTT GGGCCATAGT TACCAATCGC ACTCTCTTGT GGAAGTACTA CGATGCTAAA ACGTGTCGTC TTGTGTCACA GAGAAGAAGC CAGCCACATC ACGACAGACA AATTTGCCTA GTCGCCAACA CCGAAGCAGA ATGCGAAGTG GTGCTTCATC GAAAGGCATC TTGGATTCCT TCGCTGGAAG AATGGGCGCC TAAGCAGCTG GGGAGTAACA CCACTTTAAC GTCACTTTCG TATTGGAGTA CTCATCGTCC TCCATCCGAC CCGACTCGTT CCAAGGTTCG TTGGCGGGAT ATTGATTCGA AACACCAAGG TGTGGACCTT TGGACTGAAT TTCAGATTGT TGACTTTCCA CAAATGTTAG GGAAGGATGC GGGAAATGTT TTAGGAGACG AAAAAGGACG CCTCGACATG TTGGCGACCG ATTCAGCGCG GGATGCAGCG CGGAACTTGT TTGCTGAAGG CTCACATTTC ACTTACGGTA TGGTACGATC ATTCACGATT GTGTGATTGC CTTGAACATA TTACTGTCCA ATACTCACTG AAAGACCCCT TAAACACGAC AAAGCTTTTT CGAGAAGTAT TTGACCTTCG ACCGAGCGTG CTTTCTTTAG ACAGCACGTC TGTGCTCGAC GCCGTCAGTT TAAGTAATCC CTTTTCCATT GCCTTACATT CCCGACATTC CAAGCCTGAG GACAACGGTT CTGACGTTTC TACAGAACTA AAGTGTTTGA CAAGTCTCAC TCTAAATCGT ACACAAGGAG ATAAATGTGT CGTGTACCTC CTATCCGATC GGGTCAGAAC ACTCGAGCAA TTGACGAATC ATGTGAACGA GAATCTGAAC TGTACAGCTG TTGTTGCAAA CCACGACGGT GGTCACCACA TCAGGGGAGA ACACGGTCCT TTCGCGGGAG CTGACTTTTT CCGGGACCTC GACCTCGCCT CTCGTGCACG AAACGGCTTC GCTGGTTCTA CCCGGAGCTC ATCGAGTCTA TTGCAAGAGT GGATCGAGTA CGATCGTACG ATCGAGTCTT GCAGCCCCAG AACGAAAGGG GTCCTGCCGC CCCTACCGAA ACTCAATACG TGTAAATTAC CAAAATAG
|
Protein sequence | MTRNGRIKNA SATLAILVAF VILQLQKDRD MKFRTSSPPN EQRMHHRNAT HLAPTTNSVI PRRNGTLCDD LHPLQALERY KAQHSQAIML SESPADAVHR RYAIGYYSCP FQAGNRLHHF FNAMIWAIVT NRTLLWKYYD AKTCRLVSQR RSQPHHDRQI CLVANTEAEC EVVLHRKASW IPSLEEWAPK QLGSNTTLTS LSYWSTHRPP SDPTRSKVRW RDIDSKHQGV DLWTEFQIVD FPQMLGKDAG NVLGDEKGRL DMLATDSARD AARNLFAEGS HFTYGMLFRE VFDLRPSVLS LDSTSVLDAV SLSNPFSIAL HSRHSKPEDN GSDVSTELKC LTSLTLNRTQ GDKCVVYLLS DRVRTLEQLT NHVNENLNCT AVVANHDGGH HIRGEHGPFA GADFFRDLDL ASRARNGFAG STRSSSSLLQ EWIEYDRTIE SCSPRTKGVL PPLPKLNTCK LPK
|
| |