Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18029 |
Symbol | |
ID | 7197080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 97353 |
End bp | 98609 |
Gene Length | 1257 bp |
Protein Length | 401 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177558 |
Protein GI | 219111613 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.187612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATTACACAC AAGCACACAC ACGTCCATTC ACACACGTAT ACGCAAAAAC TATGTGGAGT TCATCGTCGT CGCTCTGTCG GAATCCCTCG TTTCGTCGTG CGTGGTTGTC GACCGTCACG GTCACGCAAA CCGCAGCGCC CACTTCGTCG CGATTGGCTG CCCTACGGAC GCAACTCGCT ACGGAAGAGG CGTCGATTGA CGATTTTTCA TCCACCAACG CACCAACAAC AACCACACAC TACACGAGCA GCAACGGCAG TCCGATTGTG CGACAAAAGG CGGCACCACG GAGTGCCAAG ATTCTGCCCA AACCACGTTG GCTCAAAGCC GCACCAGCCA CGTCGGACAA TTACCGCAAA CTGCGGGACA CGGTCCGCGA ATTGGGACTC GCCACGGTCT GTGAAGAAGC GCGTTGTCCC AACATTGGCG AGTGCTGGGG CGGCGGGGAG GACCAAACCG CCACGGCCAC CATTATGATC ATGGGAGATA CCTGTACCCG GGGATGTCGC TTCTGCAGTG TCAAAACCTC GCGAGCACCA CCCCCGCTAG ATCCGCATGA ACCGGAAAAG GTTGCTACCG CCATAGCCCA GTGGGGACTC GATTACGTCG TACTCACTTC CGTCGATCGG GATGATCTGC CTGATCAAGG CGCCGACCAT TTTCGACAAG TTGTCACGCA ACTCAAGTTG AAGAAACCGT CACTCCTCGT GGAAGCATTG ACTCCGGACT TTCAGGGCAA TATGGATCTT GTACACGCCG TGGCCACGTC CGGGTTGGAC GTGTACGCGC ACAATATGGA AACCGTCGAA GCCTTGACAC CCAAGGTGCG TGATCGACGC GCCACCTACC GACAAAGCCT CGAAGTACTC CGGTACGTCA AGACTATTCA GTCCGACCCG ATCGGTACCA CCAACAACCA CAACAACAAC AATGGTTGCC TCACCAAAAC ATCCCTCATG CTGGGACTCG GCGAAACGGA TGACCAAGTG CTCACCACCC TCCGTGATTT ACGCGACGCC GACGTGGACG TGGTCACCTT TGGACAGTAC CTGCAACCCA CCAAAAAGCA CTTGCCCGTA CAGGAGTACG TTACGCCGGA AAAATTCGAT TTCTGGCAGG AAACCGCCAT GGGTATGGGA TTCGCCTACG TTGCATCGGG ACCGCTCGTG CGCTCCAGTT ACAAGGCCGG TGAACTATTT CTCCAAAAGT ACATTGCCCA AAAGAAGCAA CGCAACGCCG AGGTTGCCGC GGCGTAA
|
Protein sequence | MWSSSSSLCR NPSFRRAWLS TVTVTQTAAP TSSRLAALRT QLATEEASID DFSSTNAPTT TTHYTSSNGS PIVRQKAAPR SAKILPKPRW LKAAPATSDN YRKLRDTVRE LGLATVCEEA RCPNIGECWG GGEDQTATAT IMIMGDTCTR GCRFCSVKTS RAPPPLDPHE PEKVATAIAQ WGLDYVVLTS VDRDDLPDQG ADHFRQVVTQ LKLKKPSLLV EALTPDFQGN MDLVHAVATS GLDVYAHNME TVEALTPKVR DRRATYRQSL EVLRYVKTIQ SDPIGTTNNH NNNNGCLTKT SLMLGLGETD DQVLTTLRDL RDADVDVVTF GQYLQPTKKH LPVQEYVTPE KFDFWQETAM GMGFAYVASG PLVRSSYKAG ELFLQKYIAQ KKQRNAEVAA A
|
| |