Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47818 |
Symbol | |
ID | 7203059 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 144358 |
End bp | 146170 |
Gene Length | 1813 bp |
Protein Length | 406 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182165 |
Protein GI | 219123715 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.382636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGGGAGGACA AAAAGTATCG TGGTCCCCCA GAAAATCTCT GCAATAGCCC CCCCCAGGCC CCTCCAGGTG TCTGCGGTCG GCCCGAGCAA CCATTCTCGC TCTTTCCTAC GCGGTGAGCG AGACGTATCC CGTCTTCGGC CACGGTAGAC CCCGACCCTC GTGCCCTCCT CCTTTGGGTA AAACTTACTC TCCGCTTCGA CACAGCGTTT CCGATCTCCA CCCAGGTCCA AGGAACACCG TACACTAGTA CTGCACAAGG GCATGTCCGC GAGTGCCGTC TTTTCTTGTT TTACTTGTGG CTACGGAGAT AATACTAACA ACGACGGAGC CCAGCGGACC GGAAACGATC TAGTAGCTAC TACGTCCTGG ACACCGGCAC ACGCTCCCGG AAACAATTGC GACCGCAGCA GCATGCGTCG GGTTTCGACT TCCCGACGTT CCAGCGCCAA TCAATACCCC GCTTCCATAC GCGCGCGATC GTCCGAACGA GCGCACTATA GCGACAAAAG TGACAACGAC GGCAACGACG AAACGAACAC TTTCCGTGGC AACACGCTCG ACGAAGATTT CAAGGATAGT ATGACGTTTG AAGATTCGGA ACGCACGGAT CGTCCACGTG GCGGATCCTT CCATCGTCGG GCCATGAGCG ATCCCTACGA TGCACAAGAA AACGAAAAGG ACAGCGACGA AGACGGTCTC GCCAACGATT TGAACGAATT TCTCGAAGCC CAGAGCGCCA CGCACTTGCC AACCTTGGCG CGATATCCGG TTGCCGAAAC GCGCAATCAG AATTGCTGGA GTGAACCGCC GGTCGACATT TTCAGTGTCC GGGGACCGGA TTACTTTGCT TCCAAAAAGA AAATCCAGTC GGGGCCGTAC TTATTGCAAG CTAGAGGATG TGACTTGTTT CTCAACAACA AGGAAGATAG CTCGGTCAGA TTGGAATCCA AGTAAGTAGC TGTGGGTGCG TGTGTGTGTG TGTGTAATTG TAGCTGTGAG TTCGGCAAGA CGATTCCGCA CAATCCAAAG GGAAGCAGCA ACCTTTCTAT TCACTAGCTG ATCCGATATT GCGATCGTTT TTTAGGAGCG ATATTATTCT AGGCGGACGT CTCCATTCCA CCCCCACCAT TTTAGTGAAT TTTCGGTTTC CGTGGGGCTA CATGGTTCTG TACTTCGAAG TGCCAGCCAA GCTTGCCCCC TATTTGAAGC GTGAGAAAGG CAAGGTAGAT ACAGCTTTAA GCACCGCGGA ACAAACCCTG GCCCGCTGGT TGTTGGGGGA TACGAATTAC AAGAATGAGC GTCTCAAACT GATTCCTTAC GTCGCGCAAG GACCATGGGT TGTCCGCAAC ATGGTCACCG GCCGACCCGC TATTATCGGT AAGAAGCTTC CGGTTACGTA CCGAATAGAG CAGAACGCCC TGTTTTGCAC TCTCGACATT GGCTCGTCTA GTGCCACGGC CAAACGCATC GTATCGGTCT GCCGGCGCTA CATGAGTGCC TTGACGGTCG ATATTGGCTT TGTCATTCAA GGTGAGACTC CCCTAGAACT GCCCGAACAA ATGATGGGGT CCGTTCGTAT CCATGGCGTC GATCCACTGA AAGCACCACG CGTGTAGCCA CCGTGTAGAT AGTAGTGACT CCCATCGTGG CTACCCTTTA CAAGGAAGAC ACCATCCCAG CTATATCCTA CACGCTGAGA GAGTTGGACA TGTGTGTAAA GTCGGTCGCA ATACTCTTTT GTTTCTACGG TTCTAAAAAG CAGTGGTTAC AAGGCAGGCT TGTTAAAAAT AAATTGTTGT GATGAGCGCC GCT
|
Protein sequence | MSASAVFSCF TCGYGDNTNN DGAQRTGNDL VATTSWTPAH APGNNCDRSS MRRVSTSRRS SANQYPASIR ARSSERAHYS DKSDNDGNDE TNTFRGNTLD EDFKDSMTFE DSERTDRPRG GSFHRRAMSD PYDAQENEKD SDEDGLANDL NEFLEAQSAT HLPTLARYPV AETRNQNCWS EPPVDIFSVR GPDYFASKKK IQSGPYLLQA RGCDLFLNNK EDSSVRLESK SDIILGGRLH STPTILVNFR FPWGYMVLYF EVPAKLAPYL KREKGKVDTA LSTAEQTLAR WLLGDTNYKN ERLKLIPYVA QGPWVVRNMV TGRPAIIGKK LPVTYRIEQN ALFCTLDIGS SSATAKRIVS VCRRYMSALT VDIGFVIQGE TPLELPEQMM GSVRIHGVDP LKAPRV
|
| |