Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22666 |
Symbol | |
ID | 7194993 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 48808 |
End bp | 50806 |
Gene Length | 1999 bp |
Protein Length | 546 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183278 |
Protein GI | 219126049 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACTACCAGT TATACCGTCA ACGCCTTTGT CCCATACGAT CCATTACGAT CTAAATTGTT CACTCCCGAT CGTTCTTTCA CTCGCACATT TACAGTTACA GTTAGTTCCT ATTCCACCTT CCGCGCACCA AACTTCCTCT TCGTCCTCGT CATTCTTCCC CGCAGCATGA GCAATTCAAC GCTGCACGAA ATTTCTCCCA ACGCCGAGAT TGTCTCGCGT CAACAGGCAC TCCAGGTCAA CGTCGCCGCT GCTGTCGGAC TCTCCAACGT ACTCAAGTCC AACTTGGGAC CCACGGGAAC TTTGAAACTG CTCGTCGGGG GGACGATTGA GCAACTCAAG TTGACCAAAG ATGGACTCAC CTTGCTGAAA GAAATGCAGA TACAGCATCC CACCGCAGCG CTCATTGCCC GTACCGCTAC GGCACAGGAC GACGTCACCG GTGACGGAAC CACGTCGGTC GTCTTGTTGA CGGGAGAATT GCTCCGACAA GCCGAACTCC TCGTACGGGA AGGATTGCAC CCACGGGTAC TCACCGATGG ACTCGATACG GCACGGGATG CCTGTTTGGA AGTCCTGAAA GCATTTGCCG TAGCTCATCC GGATCTAATC CATAATCGGG ATCTATTGCA GCAAATTGCC CGGACCTCTC TGGCGACCAA ACTCGACGGT CCTCTTGTGG ATCAGGTACG TGCGCTTTTG TTGTTGTGGT AGCGTTAGCT GAGTAGTTAT TTCGTACCGT GCCATACCGG AATTCAAGGA TGCCCCCGTT GTCACTCTGG CGTCTCTTTG CTTACTCACA GTCATACACC ATCAACCCCT CTTTCCTTTC CTCACACTGT CCATTATAAC TTTGTCTACG CTTACGCTTT TATCTAGATG TCTTCGGCGG TCGTCTCGGC CATTCAAACG ATATACGAAC CAGACACGCC GCTCGACTTG CACCGCGTGG AAATACTCAC TTTGGCTCGC CACCGGGCCG TCGATTCCAA ATTCGTTGCC GGTTTAGTGC TCGATCACGG TGCCCGCCAC CCCGACATGC CCACACAACT CCTCAACGTC AAGGTCATGA CCTGCAACAT CTCGCTCGAA TACGAACAAA CCGAAACGCA GGCCGGTTTC GTCTACTCCA CTGCCGAAGA ACGCGAAAAG CTCGTCGAAA GTGAACGTGT TTGGTTGGAC GAGCGTTGTC GGCGCATTGT GGAATTCAAG CGCCAAGCCT GTGCAGACGG CGAGACCTTT TGCATCATCA ATCAAAAAGG TGTGGATCCG TTGAGCTTGG ACATGTTCGC CAAGGAAGGT ATCCTTTGCC TGCGTCGGGC CAAACGTCGC AATATGGAAC GTCTCACGCT CGCAACCGGC GGTAGTATCA TTCTCAGTCT CGAAGATTTG GAAACCAGCA TGCTGGGCTA CGCCGGTAGC GTCAAGCAAG TCACCTACGG CGAAGACAAG TACACGTTCG TGGAAGACTG CCCCAATTCA CAGTCCGGGA CTCTACTTTT ACAGGGACCA AATAAGTTGA CGACCGAACA AATCAAAGAC GCCGCCAAAG ACGGCTTACG GGCCGTCAAA AATGCCGTAG AAGACGGCGC CCTCGTGCCC GGAGGCGGCG CCTTTGAAAT TGCCGCTTCG GAACATTTAC TGCACAAAGT CGTGCCCACG CTCAAAGGCA AAACGAAACT GGGCGTACAA GCGTACGCAC AGGCGCTTTT GGTCATTCCC AAAACGCTCG CCGCTAATTC GGGTTTTGAC GTCCAGGACG TCCTGCTGAA ACTTCAGGAT GAACGCAACT CAACCAACAT GGCGATTGGT TTGGATGTCA AAACGGGGGA ACCCATGTTG AGCGCGGAAC AGGGTGTATG GGACAATGTC CGGGTCAAAC GTCAAGGCTT GCATCTGGCC ACGGTCTTGG CCAACCAGCT ACTGCTCGTG GACGAAGTCA TGCGGGCTGG CAAACAAATG GGGAGAAATG CCCAACCAAA TCCGGAAATG ATGGGATAG
|
Protein sequence | MSNSTLHEIS PNAEIVSRQQ ALQVNVAAAV GLSNVLKSNL GPTGTLKLLV GGTIEQLKLT KDGLTLLKEM QIQHPTAALI ARTATAQDDV TGDGTTSVVL LTGELLRQAE LLVREGLHPR VLTDGLDTAR DACLEVLKAF AVAHPDLIHN RDLLQQIART SLATKLDGPL VDQMSSAVVS AIQTIYEPDT PLDLHRVEIL TLARHRAVDS KFVAGLVLDH GARHPDMPTQ LLNVKVMTCN ISLEYEQTET QAGFVYSTAE EREKLVESER VWLDERCRRI VEFKRQACAD GETFCIINQK GVDPLSLDMF AKEGILCLRR AKRRNMERLT LATGGSIILS LEDLETSMLG YAGSVKQVTY GEDKYTFVED CPNSQSGTLL LQGPNKLTTE QIKDAAKDGL RAVKNAVEDG ALVPGGGAFE IAASEHLLHK VVPTLKGKTK LGVQAYAQAL LVIPKTLAAN SGFDVQDVLL KLQDERNSTN MAIGLDVKTG EPMLSAEQGV WDNVRVKRQG LHLATVLANQ LLLVDEVMRA GKQMGRNAQP NPEMMG
|
| |