Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19552 |
Symbol | |
ID | 7199916 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 19871 |
End bp | 21720 |
Gene Length | 1850 bp |
Protein Length | 529 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179343 |
Protein GI | 219117097 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAGGTCCAA CAGGCATAGG AGTACTCTGG GAAATCGGAT CACCGGAAGC GAAATAAACT AACCACGTTT GAGCGAACTT GGGTCTGGAC GTGCAGATTT CTACCATGCT CGTTTTGTTC GAAACGCCGG CGGGTTATTC GTTGTTCAAG GTACGTCAAC TTGGGGTGGA TTTCGATAGT GAACGATCTC TGTCATCCAC TTTGAACACC TCTGGATGCG GACGTATCGT CCCGCGGAAC GATCATCACT CACACCGTGC CACAAATACA ACGATGTCTA AATCAATGAT GGACCACTCT TCTAGGTCAC GGATGAGAAA AAGTTGAAAA AGACGGACGC CGATGATATT CATGACACGT TTTTTTCCGA TTTTGGCAAG GCTAGCAAAT TCTTGGAAAT GGTGTCGTTC AAGCCTTTTG CCGACACTGC CGATGCCGTA TCAGCCGCCT CCGCTATGGT TGAAGGCAAG GTGAGTAAGT CGCTCACCAG TTTTCTAAAA AAGAAACTAA AAAAGTCCAA CGACTTGAGC GTAGCCGTCG CCGACAAGGC AATCGCGGCC CCACTCAAAG AGTCAGTTCG TGATGACCTC AAAATTGTGC ACGACAGCAA ATCGCAGGAA ATATTTCGTG GCATCCGTGC GCACATGGAT GAATTGCTTA CCAACGACGA TTCAAACGTA ACCAAAGAAG ATTTGCGCGC GATGCAGCTT GGTCTCTCTC ATTCACTGTC GCGGTACAAG CTCAAATTTT CCGCCGACAA GGTGGATACC ATGGTCATCC AAGCCGTGGG CTTGCTGGAC GAACTTGACA AAGAAATTAA CACATATGCC ATGCGTGTCA AGGAATGGTA TGGTTGGCAC TTTCCTGAGT TGCAAGGGCT CGTTGGCGAC AATGCCAAGT ACTCAAAACT AGTTCTTAAG GCCGGTATGC GACCTACTTT CAAAAACTAC GATTTGAGTG ATATTCTGGA AGAAGAAGAC GTTGAAGCTG CAGTAAAGGA GGCTGCTGAA ATTAGCATGG GCACCGAAAT TGCTGACTTT GATATTCTCA ACATTCAGTC TCTGGCTGAT CAAGTACTGA GCATGACGGA GTATCGGTCG CAACTATATG AGTATCTCAA GAATCGTATG AACGCTATTG CACCCAATTT GACCATTCTT GTCGGTGAAT TGGTTGGTGC CCGCTTGATT TCGCATGCTG GATCGTTGAT GAATCTTGCT AAACAACCGG CCAGCACAGT ACAAATCCTT GGTGCCGAAA AGGCACTCTT TCGCGCTTTA AAAACGAAAC ACGATACTCC GAAATACGGC TTGATCTATC ATGCCTCACT GATAGGACAG GCAGCACCAA AGAACAAGGG AAAAATCTCG CGCGTACTGG CTGCTAAGGC GTCTTTGGCC ATTCGGGTTG ATGCGCTGTC AGATGAAACC GCTGATCAGC TTGACACGAC GATTGGTTTC GAAGGCCGCG CCAAAGTAGA AGCCCGGCTT CGACAATTGG AAGGGGGAGT CTTCGTGACC AACGGTAATG TATCAGCATC CAAGACAGCC AGGTACGATC CTGTGGCGGC CAAGACGGCC GCTGCTGCTC CTGCCTACAA CGATTCTAGT GACATGGTAT TGGACGTGGG GACAAATGGA AGTAAGACGG ACGAGAATGC GACAAAGAAA AAAAAGAAAG ATAAGAAAAA ATCAGACAAT GGAGATGAAG AATCACCGAA GAAAAAGTCA AAAAAGGACA AGAAGCGGAA GGCTGAAGCT GTCGACGACG AAGAACAGGA TGATGACGAA GCGAAAAAGT CAGCGAAAAA GGCCAAGAAA GACAAGAAAA AGAGGAAGTC ACAAGAATGA
|
Protein sequence | MLVLFETPAG YSLFKVTDEK KLKKTDADDI HDTFFSDFGK ASKFLEMVSF KPFADTADAV SAASAMVEGK VSKSLTSFLK KKLKKSNDLS VAVADKAIAA PLKESVRDDL KIVHDSKSQE IFRGIRAHMD ELLTNDDSNV TKEDLRAMQL GLSHSLSRYK LKFSADKVDT MVIQAVGLLD ELDKEINTYA MRVKEWYGWH FPELQGLVGD NAKYSKLVLK AGMRPTFKNY DLSDILEEED VEAAVKEAAE ISMGTEIADF DILNIQSLAD QVLSMTEYRS QLYEYLKNRM NAIAPNLTIL VGELVGARLI SHAGSLMNLA KQPASTVQIL GAEKALFRAL KTKHDTPKYG LIYHASLIGQ AAPKNKGKIS RVLAAKASLA IRVDALSDET ADQLDTTIGF EGRAKVEARL RQLEGGVFVT NGNVSASKTA RYDPVAAKTA AAAPAYNDSS DMVLDVGTNG SKTDENATKK KKKDKKKSDN GDEESPKKKS KKDKKRKAEA VDDEEQDDDE AKKSAKKAKK DKKKRKSQE
|
| |