Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_37027 |
Symbol | |
ID | 7204473 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 919755 |
End bp | 920833 |
Gene Length | 1079 bp |
Protein Length | 352 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185980 |
Protein GI | 219121515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0567529 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGGAA GAAGAGGTGC AACTGCCAAT GGGGAAGGTA GTCGCCGAGA AAAGTTGTAC GAGAGCATCA TGGGTATCGC GATATGCACT ACTAACGCTA CTGCCATTCC GGGAAACGCC GATGAAACTG CCGCTGCCGA GGCGGCCGCC AAAGCGCTTT CCTCCCGACT CGTTACCTTT GTCACTGGCT CACCAAAAAA GCCCGAGTGG ACGTCCACCC GGAATTGTGT ATTGTCAATG GGGAAGCCAC GGACGGGACC CGAAACGCCC CTGTGCTGCT TTTTGAAAAG CATCCAACGG TAAGGGCCCG ACACACAAGG GATCGCACCA TGGGATGTCA AGTACAAGCA GCCCGGGAAC TCAAACCGGA AGAAGTACTC ATGACCATGC CACAATCCGC CATGATTTCA CCCAATCTGG TGACCGCGAG TGATGCCGGA AAGGTGGTCT TGGCGTGCAT CTCACACGCA ACCGGCACGG AAGGATTCTG GGACTTAGTG GAAAACACAA CCTTGTGCAA AGCAACCTTT TTGCCCAAAG TGGTGGGGAA TACGGGGCCA CAGTTGTTGG TCAAGATTTG GCAGGAACGC AAAAAGGTGG AAGCCCTATT TAACCATCGG AAAAATAGTC CCACTACGGC TACCATTGGG GCGTACTCAT TAGCGGCATC CAAAGGTGTG TCCACACAAG CGCCGGTATT GGCATTGCTT ATTCATCTGC AATTTTCCAA CACCAGTCAA CCCGGGGTCT CATCCGGCAT ACAAAAATTG CAAATGGCGC TGGAGAGTAA CGACGGCAAC GCCTTGCGGT CAGCAACCAG TGTGCAAGTC CCTTCCGGGG CCCCGGAGAC CTTTGCTCCG TGCGCATGGA CTCTACCATC CTCTGTGTCG ATTCCGCTGT GCTGGAAACG TAATGAGCTT GCACTCTTGG CAGGATGCAT TCCGGGAGTA TCCTTGTTGA AAGAGGTTGT CACCAGCACA TTACAGCTTG ACCCGGAGTT CACCGCCTTG CTGGAGGCCG GCATTCTGGA ACACTTCCCG GAAACATTTC CACCGAGACT CTTGACCTGG GAGCATTGA
|
Protein sequence | MSGRRGATAN GEGSRREKLY ESIMGIAICT TNATAIPGNA DETAAAEAAA KALSSRLVTF VTGSPKKPEW TSTRNCVLSM GKPRTGPETP LCCFLKSIQR DRTMGCQVQA ARELKPEEVL MTMPQSAMIS PNLVTASDAG KVVLACISHA TGTEGFWDLV ENTTLCKATF LPKVVGNTGP QLLVKIWQER KKVEALFNHR KNSPTTATIG AYSLAASKGV STQAPVLALL IHLQFSNTSQ PGVSSGIQKL QMALESNDGN ALRSATSVQV PSGAPETFAP CAWTLPSSVS IPLCWKRNEL ALLAGCIPGV SLLKEVVTST LQLDPEFTAL LEAGILEHFP ETFPPRLLTW EH
|
| |