Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_28694 |
Symbol | |
ID | 7202430 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 474967 |
End bp | 476085 |
Gene Length | 1119 bp |
Protein Length | 316 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181732 |
Protein GI | 219122811 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGCAAATA CTTCGTTGTA GCAACCATGT CTTCCGACAC TTTGACGCTT CGTGGAACCC TCAAGGGTCA CGGCGACTGG ATCACCAGCC TCGCGACGAC GCCTGAAGAC CCCAACCTTC TCCTCTCTTC TTCTCGTGAC AAGTCCGTCA TCGCTTGGCA CTTGACGCAC TCGTCCTCTG GCGAAGACGA CTCGTATGGT TACGCCCGTC GTGCCTTGCG TGGACATTCT CACTTTGTTT CGGACGTTGT CATTTCGTCC GACGGTGCCT TTGCCCTTTC TGCCAGTTGG GATTCCGAAC TCCGCTTGTG GGATATCGCC ACTGGCAAAA CCACTCGCCG CTTCGTTGGC CACGAGAAGG ATGTGCTCTC CGTCGCCTTT TCCGCTGACA ACCGTCAAAT TGTTTCTGGT ACGCTTGCTC GCGAATGCTG CACGTCTTGA AACGTCGAAT AGCCTAGTTC TCACAGGAAG TGTACCTTTT GCCTTTCCAG GATCTCGCGA TGCCTCTATC CGTCTGTGGA ACACTCTGGG AGAGTGCAAG TACACCATCA GTGGAGACTC CGAAGGCCAT TCTGAATGGG TCTCTTGTGT TCGCTTTTCT CCTTCGCAGT CCGTGCCACT CATTGTCTCT GCTGGATGGG ACCGTCTCGT CAAAGTGTGG AACTTGACCA ACTGCAAGCT CCGCAACGAC TTGGTCGGAC ACACCGGCTA CCTCAACACG GTCTGCGTTT CGCCCGACGG TTCTCTGGCC GCATCCGGTG GTAAAGATAG CACTGCTATG CTTTGGGACT TGAACGAGGG CAAGCGCCTG TACTCTCTGG ATGCTGGTGA AATCATCAAT GGCCTCGTGT TCTCACCCAA CCGCTACTGG TTGTGCGCCG CCACGGATGA TTCCATCAAA ATTTGGGACC TCGAATCCAA AATTGTTGTG GATACCCTCC GCCCCGAAGA ATCCGAAAGC GGCAAGATTC CCTCCTGCAC GTGCTTGGCC TGGTCCGCTG ACGGATCAAC CCTCTTTGCC GGATTCACGG ACAATGTCAT TCGTGTGTAT GCTGTCTAAC TGTAAGTCGT ATACTACAAA GGAATAGCTT AATGTTAGCG AAACGTTAAG TAATCTCGT
|
Protein sequence | MSSDTLTLRG TLKGHGDWIT SLATTPEDPN LLLSSSRDKS VIAWHLTHSS SGEDDSYGYA RRALRGHSHF VSDVVISSDG AFALSASWDS ELRLWDIATG KTTRRFVGHE KDVLSVAFSA DNRQIVSGSR DASIRLWNTL GECKYTISGD SEGHSEWVSC VRFSPSQSVP LIVSAGWDRL VKVWNLTNCK LRNDLVGHTG YLNTVCVSPD GSLAASGGKD STAMLWDLNE GKRLYSLDAG EIINGLVFSP NRYWLCAATD DSIKIWDLES KIVVDTLRPE ESESGKIPSC TCLAWSADGS TLFAGFTDNV IRVYAV
|
| |