Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39894 |
Symbol | |
ID | 7195530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 298740 |
End bp | 299891 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | |
GC content | 63% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183850 |
Protein GI | 219127246 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.462205 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGTA CTAGTCACCA GGGTCCCGGT AGTTTTCCGC ATATACCGCG GGCTCCGTTG CCGACGACGG TCTTGTTGAC CACGGTGGCC CCGTGTTTGC GCACGTCCGG GGCCTTGCGC GAATGGCTCG GCCACGGTGG GGGCTACGTC CCGCGGTCCC TCCAGTGTGT GGGCGATCCC AATGGTCCCG GGACGGCCCT CGTCACACTC CCTCACGCCG AAGCCGCCGC CAAATTGGTC GCGGCAATCC GACACAACCA CACGGACAGT CACAACCACA TCGGAGCACA CTTGGTTCCC GTCAATCCGG ATATTCCCTT GCCACCCGCA TTGGTGGATC CGGAAACGAC ACAAACATTG GCACAGGCCC TCCGCGACAG TTTGCACAAA ATCACTCGCG ACGGACCTTC CACCGACGTG ACAACCCCGG CACTCGCTCC CACTTTGCCC GAGCGCACCC CGGACGCATC TTTACGTCCC GCCGTCGAGC CCCACGCCGA CCCCGAGGAC GAAGATCCCT TGACGTCTCC AGCCGTCTTG AACGCCGTCA AGGCCTTTCG GGATCAACTC GAAGTACAAC AGGGCACCAA AGCCACCCGC CGTAAAGCCT TGGTGGCACA GACGCTGGCA CAAGTCTTGC CCGCGATGCG ACTCCGGAGG CAACAGGAAC CCTCGGTGAC GCACGCACCC ACACCCACCC CACTACCACC AGCAGCAATA GTGCCAACAC CCACCACCAC AACCCTGCCG GTACCGCCGC CACCACCGCT TCCCGCCCAG GCGCCTCCGG CACCCCGGGG TGTTTCCAAC TTACCCGCCT GGATGACGCA AGCCAATCTA TCCGCGGAAC CACCAACCGC GTCGGCCACG GAACCACCCC CCAGCAAACG ACCCAAACTG GATACTGCGC AACCCTTCCC GGCCCTCCCT CCCGCGGCTC ACGCACCACT CCGGGACTTT GTCACGGCAC AAATTCAGCA TTATCTAGGC GAAGCCGAAA CGAGTCTCAT TGAATTGATT GTACAGTTTG TCCTTCGACC CGACGGGGAG CCAGCCCAAG GCCTCCTACC CGAACTCGAT GTCCTCGAAG ACGATGCCCA CGCGTTGCTC CAGGCACTCT GGGATCACAC CCAACACCTG GCCACGGCGT AA
|
Protein sequence | MSSTSHQGPG SFPHIPRAPL PTTVLLTTVA PCLRTSGALR EWLGHGGGYV PRSLQCVGDP NGPGTALVTL PHAEAAAKLV AAIRHNHTDS HNHIGAHLVP VNPDIPLPPA LVDPETTQTL AQALRDSLHK ITRDGPSTDV TTPALAPTLP ERTPDASLRP AVEPHADPED EDPLTSPAVL NAVKAFRDQL EVQQGTKATR RKALVAQTLA QVLPAMRLRR QQEPSVTHAP TPTPLPPAAI VPTPTTTTLP VPPPPPLPAQ APPAPRGVSN LPAWMTQANL SAEPPTASAT EPPPSKRPKL DTAQPFPALP PAAHAPLRDF VTAQIQHYLG EAETSLIELI VQFVLRPDGE PAQGLLPELD VLEDDAHALL QALWDHTQHL ATA
|
| |