Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16421 |
Symbol | |
ID | 7198799 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 241256 |
End bp | 242542 |
Gene Length | 1287 bp |
Protein Length | 398 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184926 |
Protein GI | 219129501 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TATCTGCTGG TGTACCTGCT CGCCACGCTC AGCGACTGGT TGCAGGGACC GTACGTCTAC GCCCTTTACT CCGATTATGG CTACTCACAG CACGATATTG CCGTCCTCTT TGTGGCCGGC TTTGGATCCT CCATGGTCTT TGGATCGTTC GTGGGCGGCA TGGCCGATTG GGGCGGACGC AGGACCTTTG CCGTTCTCTT TGCCGTCGTC TACGCCTGCT CCTGTCTAAC CAAACGTACG TAGTATGCAC GCACGTGCAT TGTGATGGTC ACACACTCCA TGCTCGTCCA TAGATACTCA CGTTACCTCC ATTGCTGTGG GAATACAGAC TTTAAAAACT TTAACGTGCT CCTCCTGGGT CGTTTGTTGG GTGGTGTCTC CACCAGTTTG CTCTTTTCCG TCTTTGAAGC CTGGTTGATT CGGGCCCACA ACGATGCCGG CCTTAAGGCC TGGTTAGGCA AATCTTTTTC CTGGGCCGCC TACGGAAATT CCGTCGTTGC CATTACGGCG GGACTCGTCG CCAATAAGGC CGCCAGCGCC GTGCCCATGA CCGCTATCCA AACCGGCGGA CAAGTGTACA TGGGTGGCTA CCTGAATCCG TTCGATATTG CGCTCGTGGC CTTGCTGGGA TGCGGCATAG CTGCACTGTC TCTGTGGGAA GAAAACTACG GAGATACGGA CGGATCCAAC GATAGCAGTC GTGGCCAAGC CCACTGGTAC GATGGTCTCC AGACCGCCTT TACCACCACC ATTCGCTCGC AAGATGTCTT GCTCTGTGGC ATTATTTCTT CGCTCTTTGA AGGTAGCATG TACATCTTTG TCTTTATGTG GACACCAGCC TTAACGGAAG GATCCGATGA AGCCTTGCCC TTTGGACTCA TCTTTTCAAC CTTTATGGTC TCCTGCATGG CCGGCTCGAG CTTGTTCTCT ATACAGATTG AAAAAATGCG GGGCGAGCGT CTCGCCGTCA TTGTGTTTGC CACGGCCTCC GCCGCCATGG CCGGAATCGC CCTGTCCTAT TCCAATACCG TCAAGTTTTT GCTCATGAAC GTCTTTGAAG TCACCGTCGG CATGTACTGG CCGATTTACG GAACCCTCAA GGGCGTCATC GTGCCGGAGT CCAAGCGGGC CGCCATTTAC AATCTCTACC GTATCCCACT CAACTTTATC GTCCTGTTCT CCCTCTTGAC CGATTTAACC CCCACGACGA GCTTTTTGTT GAACGCCACC ATGCTGGGAA CCGCTGCGGT GTTGCAGATT ATCCTCATGA AGCGCCGCGA AATGCAC
|
Protein sequence | YLLVYLLATL SDWLQGPYVY ALYSDYGYSQ HDIAVLFVAG FGSSMVFGSF VGGMADWGGR RTFAVLFAVV YACSCLTKHF KNFNVLLLGR LLGGVSTSLL FSVFEAWLIR AHNDAGLKAW LGKSFSWAAY GNSVVAITAG LVANKAASAV PMTAIQTGGQ VYMGGYLNPF DIALVALLGC GIAALSLWEE NYGDTDGSND SSRGQAHWYD GLQTAFTTTI RSQDVLLCGI ISSLFEGSMY IFVFMWTPAL TEGSDEALPF GLIFSTFMVS CMAGSSLFSI QIEKMRGERL AVIVFATASA AMAGIALSYS NTVKFLLMNV FEVTVGMYWP IYGTLKGVIV PESKRAAIYN LYRIPLNFIV LFSLLTDLTP TTSFLLNATM LGTAAVLQII LMKRREMH
|
| |