Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47820 |
Symbol | |
ID | 7203061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 155707 |
End bp | 156949 |
Gene Length | 1243 bp |
Protein Length | 329 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182166 |
Protein GI | 219123718 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00872972 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCCACATCCT TCGTTCCATT CTCTCACCAA TCTCGAACGT TACACATTCA CACTCTGTAC TGTGTGTATT GGCAACATTA TTAACAATAC AGTCTTGTAC TGATTGACTG CGCCAGCATC GATTGATTAA TTGATTGATT GAGACGAACG AAGGAACGAA TTGATCGTAT TGATTTTCTC CGCGAAAGCG CACAACGACA TGGCTTCGAG TGGTGCGGCG GCGCGGGGTC CCCTGGGTTG GTTGCTGCGA TCCACTGGAT CTCAATGGCT AGTACTGGGA GCCGGAACGA TTTCCTTCTT CCCGGAGCAA GTCCGGGAAG TCGCCTGGCC GACGGTGAAG AACGCCTTGG GTTTGACGGG GTCCGATGCG GATTGGCCTT CCTTCTCGTT GTTGTCGTCC CAACGCAGTC TCCGCGCGGA CAGCGAGAGT CAACGATCCG TGCCCAGTTC CATCGTCATT CACACCAGCA ACGGCAGTAG CACGCAGACG TGGATGACGA CTATCGTCAC TTGGACGGTG GGGGCTACCG CCGTCTGGGT CGCCTACTCG GTCTTTAGCA ACTATCTTCC GGATCAGATC AAAACTATGC TGCCCGTTAC CCGCCGCGTC TTTGACAGGG CCACGCAGAC GCTGGCCGAT CACGTCTTCC ACGTCAAGGA TGTGCTTGGC AAGCAATTAC TCAGACTGAC AGCACAGCAG GATGAACTCG CCTCGCAACA ACAGGAAACC CACAAGGATG TCCGGGTAGT CCAACAGGAT ATGAAACAGG CACGCCTCGA ACTCATGCAA CTGCTGTCCG GTATGGATCG CTGCGAAGTG CGTTTAGAAG ATTCGGCTGC CGTACAGTCC TACACGGCCC GGGGAGTCAA GTTGCTAGCC AAGTGCGTCG CCTCTATTAT GCCCGGCAAC GAACGGATTG GCCACGAACT CGAACGTTTC CAACGGGATG ATTATCCGCT GTTGGACAAT ACCACGGCCA ACCATCCCCA CGGTAGGGAC GCGGGTCCCG AAAAGGAACT CTCCTCCAGC CGAACTCCGA AACGAAGCAG CAGTTCCAGT ATCCCCTTGC TCAAACGCGA AAGTATGGAT CGGTCACTTT CGGACGAGAC GGTCGCGAGC TTGGATAGTA TGACGACGAA CGATTTGGAG ACGCTGCTTA GCACCGGTCG TATCGTTCTC GAAACGTAAA ATTTGGAAAA AACATCTTAA TACATACACA TAGTAAAGTG ACAATTACAG TCG
|
Protein sequence | MASSGAAARG PLGWLLRSTG SQWLVLGAGT ISFFPEQVRE VAWPTVKNAL GLTGSDADWP SFSLLSSQRS LRADSESQRS VPSSIVIHTS NGSSTQTWMT TIVTWTVGAT AVWVAYSVFS NYLPDQIKTM LPVTRRVFDR ATQTLADHVF HVKDVLGKQL LRLTAQQDEL ASQQQETHKD VRVVQQDMKQ ARLELMQLLS GMDRCEVRLE DSAAVQSYTA RGVKLLAKCV ASIMPGNERI GHELERFQRD DYPLLDNTTA NHPHGRDAGP EKELSSSRTP KRSSSSSIPL LKRESMDRSL SDETVASLDS MTTNDLETLL STGRIVLET
|
| |