Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30394 |
Symbol | |
ID | 7195728 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 438314 |
End bp | 440483 |
Gene Length | 2170 bp |
Protein Length | 559 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184252 |
Protein GI | 219128084 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.382636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGAGTCTTT CCCGCGTGCA CGTCGTCGTG ATCCTGTCAT CCGGACAAGA TCGTCTTCGT CGTCGTGAGA TTGCGTATTC CCACAGCGCT TCGACAGCCG ATTGCCGCAC CCGAGCGCTT TTCAGATCGA CGTTGCTATT CCCATGTCGA AAGAAGCGGC GCAATCACCG AAAAGCGGAA AAACGATCGA TCAGACCGCG GATCGGACGG CCATTGTCGC TGCTCGTCTC AAGAAGCTGT ACAAAAATTC TGTCTATCCC GTCGAAAAGA AGTATCGCTA CGATTATTTC TTTGAAAGTC CACTTTTGAG TGACGTCGAG TTTGATGGTG CGTACTCCAA ACGAAGTTTG AGGTTCTTTT GCTTTTGCCA AATGCTTTTA GTGTTGGCAA AGGACCTCAC ACACTCTTGC ACTCTCTCGT GATAACACAG CCAAGCCTCA AGTGCTTCTG GTTGGACAAT ACAGTGTAGG AAAGACTTCC TTCATTCGTT ACCTACTCGG TAGAGACTTT CCTGGTCAAC GGATCGGTCC CGAGCCTACT ACCGATCGAT TCACCGTATT GCTGAACGGC CCGGAAGAAC GCACTATTCC GGGAAATGCT CTCTCGGTGC ATCCTGATCT ACCCTTTCGG GGTCTCGAGC GCTTTGGAGT CAGTTTTCTG AGTCGCCTCG AAGGCAGTCA ATTACCAAGT AGTGTTTTGA AATCCATCAC ACTCATCGAT ACCCCGGGTA TCCTTTCGGG AGAAAAGCAA CGCACCAACC GTGGGTACGA TTTCACGAAA GTCGTATCCT GGTTTGCCGA AAAGGCGGAT TTGATTATTC TGCTCTTTGA CGCACACAAA CTTGATATCT CGGATGAACT CAAGGGCGCA ATCGATGTCC TTAAGGGGCA TGAAGACAAA ATTCGATGCA TTCTCAACAA GGCTGATCAG ATTGATCGGC AACAATTGAT GCGAGTTTAC GGTGCGCTAC TTTGGTCGCT CGGTAAAACT ATGACCAGTC CAGAAGTAGC CCGAGTTTAC GTCGGCAGCT TTTGGCAGCA ACCGCTGCAG CACATGGACA ACGCCGACTT GTTCGAAATG GAGGAAAAGG ATCTCATGAA GGATTTGGCC GTCCTTCCAC GGCAATCAGC CGTACGAAAA ATCAATGAAC TCGTCAAACG CATTCGCAAG GTCAAGACCT TGGCCTACAT TATTGGCTAC TTGAAATCAC AAATGCCTGC ACTCATGGGC AAGGAAAAGA AACAAAAGAA ACTCATTGCG TACGTTTTGT GGGTGATGCG TGACTCTAAA TATGTGGCTG CTGCGTAATG TTGGTTCATC TTTTCCAGGA GGCGTCGTTT CGGTCACGCA CACACGTTCT CACCATTTTC TCCTTACGTT TCATCCACAG CGACTTACCG ACGGTGTTTC GTACGATTAT GAAAAAGTAC GACTTGGCCC CCGGCGACTT TCCCGAAATT GCCAGCTTTT CTAACAAACT GCACGAAACC AAGTTTGCCG AGTTTAATAC CTTGTCGGAA AAACAAATTG CTGATCTGGA CCGGGTCCTG AATGAGGATA TTCCCAAACT CATGGAAGAA TTGCCCAGTG AAAAGGACTC TCCCGACATT ATCCGATCCA AAATGGGAGC TGCTGGTGGC ATCGCCAAGG TTCCGGTCCC GGTCGCCAAT AATAAATTCG GCAAAAAAGA AACGGCTCAC GAAAGCAATC CTTTTGGCTA CGATGAGGAG AACGAAGATT ACTGGTACGT GTCACAACCA AGAAACGGTA CGCGTTTTGC GTATCTCCAG GTCTAGCGTG TTAACATGCA ATATCTTTAC TCTGTTGTAC AGGGCACTGC AGGATTCAGC AGATCGTCTC CTGCCAAGCT TTGAAGCCTT GGGACCAGAT GGTGGCTATC TCTCGACGGC CAAGGCACGT GATGTGCTGG TCAAAACGGG GCTCGAAAAG GACCAACTTC GCCAAATCTG GAACCTTAGC GACATCGATA AGGACGGTCT CTTTGACCAT GACGAATACG TTGTGGCCAT GTTTTTGTGT GATGCTGTTC TGCAAAAAGG TCGACCCATT CCCTCCGAGC TCCCGGCGAG TGTTATTCCG CCGCGCAAGC GATCACTGTT AGCAGAGAAG AGCAGCGTAT TTTAAAGGGC AATAGCAACA ACATCAAGAG
|
Protein sequence | MSKEAAQSPK SGKTIDQTAD RTAIVAARLK KLYKNSVYPV EKKYRYDYFF ESPLLSDVEF DAKPQVLLVG QYSVGKTSFI RYLLGRDFPG QRIGPEPTTD RFTVLLNGPE ERTIPGNALS VHPDLPFRGL ERFGVSFLSR LEGSQLPSSV LKSITLIDTP GILSGEKQRT NRGYDFTKVV SWFAEKADLI ILLFDAHKLD ISDELKGAID VLKGHEDKIR CILNKADQID RQQLMRVYGA LLWSLGKTMT SPEVARVYVG SFWQQPLQHM DNADLFEMEE KDLMKDLAVL PRQSAVRKIN ELVKRIRKVK TLAYIIGYLK SQMPALMGKE KKQKKLIADL PTVFRTIMKK YDLAPGDFPE IASFSNKLHE TKFAEFNTLS EKQIADLDRV LNEDIPKLME ELPSEKDSPD IIRSKMGAAG GIAKVPVPVA NNKFGKKETA HESNPFGYDE ENEDYWALQD SADRLLPSFE ALGPDGGYLS TAKARDVLVK TGLEKDQLRQ IWNLSDIDKD GLFDHDEYVV AMFLCDAVLQ KGRPIPSELP ASVIPPRKRS LLAEKSSVF
|
| |