Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49158 |
Symbol | |
ID | 7195655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 95076 |
End bp | 96559 |
Gene Length | 1484 bp |
Protein Length | 462 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183926 |
Protein GI | 219127404 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.472791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTAGCTACT ACTGATTTAG TTACGACTAC TACTACGCTT GTCAGTTCCC TCCATTGCTT CATCGACTCA TTCAAATCAA AGGAGTAAAC CATAGAATGA GCAGAAATCG TTCGCGAATC CCGCGGAGAC GGTACACGTG GCCAAAGGCA GCGGTAGTGA TGACGGTAAT GGGTTGGGGA CTCTTGCTCC TCGTATGCAC CGAAGCGGCC CAGTCGTCGT CGTCTCCTTC GGGCACCAAC ACTACCCGTA GTCGGAGTAC GGCTACCGGT ACTACCGACG AATCGTCCAC GAGCGTTACG CGGGTGGGGA ATCTGGATTA TCTCGACGCC GCCACCATGG CGTACTACTT GGATGCCCCG CGGTGGACCG CCGATCATCC GGAACACGAC GTCGTCGTTC TCTTTTACGC ACAGTGGTGT CGCAACTGTC ACGCCTTTGC CCCTCTGTAC GATCAAATGT CGAAATTACT ACACGCCGGT ACCAAAGATT CGCAACTAGT AATGGGATTG TTCGATTGTG AACAGGACAA GGCGCATTCT CGAGTTTGCA GTGACGCTGG CGTTACACAC TATCCCACCA TCATGTTCCT CTCCTCCAGC GGACAAGTCC TCCAACGCGG ACGACGGTCT CCGAAAGTCC CACTGCCCAA ACACATCACC ACCTTTCGTG GTAATTGGCA GTACGGCGAC GCCGTCATGG ATTGGATCAA GACACTGCGA GGGTTGAGTC ACTGGCATCG CGCTGGATGG GGGAAAACGC TCCGGAATCT TCTCTTTGGA CGACGCCACC GTGATCCCGC CCGCGAACGA CTCCCCACCG GAATTCCGTC GGGTACAAAC CGGCGGGACG GAACCACGAC GGCCGGGAAC GGGGCCACCC ACGCCTCGCA CGAACAACAA GAATTGCGAG ACGAAATCCG ATCCTTGTCC GATCTCGTCA TTCGCTCCAG TACCATTGTG GACGCCCTAT TGTTCCCCGT CACCGCTGCC GCAAACAAGA CTCTATCCAC CAATGTGATA CGCGACGAAA ACGCCAAGAA CTACACCGAC GTCTTTGCCT TTTTGCGAGA CGCCTGGCAC AGCAACCGCA CCTCCCACCA AGTAATACGA ACCTGTGCCA TGGAAGTGGC GTTAGACTAC TGTGGACGCT TGAATACGCA CGTGACGGAG GATTGGTTGA CGGCATTTCC CTCCATCGAC CGCATTACGG AAGCCGATTT GAATCTGTTC CGGAACGAAT TACCGAAACT TGTGGCCAAG CAGGAACCCT ACTGTGCGGT AGTGGAAGAC TGCATCGTTG GTGATTTTGC CGAAGAGCAT TGTCGCCCCG CTGCTTGTCC CTTTGTCGAT CCCGCCGCGT GCCGGTATCT GACGTGCTGT CTCACCGAGC AAGTCTACGA AGAATACGCC GTGGCCATGG ATTTGGTTGA AAACGTCACG GCGGGCACTT CCGCAAACAT TGACGCAGCC GACAAAGATA CGCC
|
Protein sequence | MSRNRSRIPR RRYTWPKAAV VMTVMGWGLL LLVCTEAAQS SSSPSGTNTT RSRSTATGTT DESSTSVTRV GNLDYLDAAT MAYYLDAPRW TADHPEHDVV VLFYAQWCRN CHAFAPLYDQ MSKLLHAGTK DSQLVMGLFD CEQDKAHSRV CSDAGVTHYP TIMFLSSSGQ VLQRGRRSPK VPLPKHITTF RGNWQYGDAV MDWIKTLRGL SHWHRAGWGK TLRNLLFGRR HRDPARERLP TGIPSGTNRR DGTTTAGNGA THASHEQQEL RDEIRSLSDL VIRSSTIVDA LLFPVTAAAN KTLSTNVIRD ENAKNYTDVF AFLRDAWHSN RTSHQVIRTC AMEVALDYCG RLNTHVTEDW LTAFPSIDRI TEADLNLFRN ELPKLVAKQE PYCAVVEDCI VGDFAEEHCR PAACPFVDPA ACRYLTCCLT EQVYEEYAVA MDLVENVTAG TSANIDAADK DT
|
| |