Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45227 |
Symbol | |
ID | 7200106 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 543287 |
End bp | 544726 |
Gene Length | 1440 bp |
Protein Length | 431 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179453 |
Protein GI | 219117317 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGTAGGTA TTTCACACCC ATCATTGATT GTCCTACACC CCCTTTGACT GTGCATACCA GCAGTAAGCA CAAATAACAT CAGTAGATTG ATCGCTCGAT CCGTAGCACT AGAAAGCAGT AATGGTGTTT ATTCCCTACC AAATCGATCA CAACGTCAAA AACACGGCGA AAACATTGGG ATTTTCCAAA AAGCGCGTGA GCTTCAAGTT TGGTCTCGCC GATCGCCAAT CCGTCGATAA AGGTGAGACG GGCGTCGGCT GTCGCGGCAG CGAACACGAA GTCACCTTTT TATGGAGTCT GAAAACGGGC AAGCGCCAAT TGTTCCTGGA CGGCAAGGAC GTCCACTTTT CCAATCGCGG CAGAACGGAT GGACCACCGA TCGAGCGTGG CAGCATTCCT TTACTCTACA CGATCGCAAG GACGGGGCCT TTAAGGTTCA TTTCATCTCG CAACCCGTCA ACCGCGACAT GCCCGACGTC AAGCCGTTCG ATTTGCGTCT CAACGGCGTT TCCTACTTTG CCTTTAACCA AATTTATCAG TTGGGGACGC CGTCCATGAC GGTGCGGGAG AGTCATTCGC GAGGGCATCA CGAAAGTGGT AGGGACTCTC CCATGGGAGC AGAAGAACGA CGGGCCCTGG CACAGGCCAA GGTGGAAAGT ATGCGGGATT TTCAAGCCCA GCATGCAAGA CCGCGCATGG ATTCAGCCAA CAGTGGAGCC GCCTCGGCAA TGCGAAGGGA AGAAGAATCA CTCCTTAGCT TTGACGACGA TCCCCCAGTT CAGCAAGTGG CGGCCACGCA CAGTGCCAGC GCTAACAGCA TGACCAGTAG TCAAGGAAAT GGTATGTACG CATCCAATAT TACGCTGGAT ACGGCCATTC AAGACCGATC CAATTCGTAC GGGAATATGC AGGGCTCCTT TGGTAGCGGC GGCGGATACA ACTCAAATCC TCCCGGAGCG GCAGCGTATC AGTATCAAGC TCCTCCACCG GTCAATGTGG CGGCGGCTTC GACAACGACC TTGGCACCCT ACCAAATGCC CGGTGGTGTC GCTCCGACTC CCTATGTTGA CTCGACGGGA CGTCTCGCCA TGGGTACCCA ACCGCAACCG CAGGGTTACA ATTATGCCAC ACCCACCAAC TCCGCTGGCA ACGCGTCGTT TCAGAACCAG TTTCAATCGC CGTCCAATCA GTCGTATGCG AGCTACGGAT CTGCGCCGTC CTTCGCGCAA CCACCACGCC AGCAGGGGGC TCCGGATGGC TTTGCGAATG GCCCACCAGC GAGCAGTAAT CCGTACGGCG CTCCGCCGCA GCAACAAGCC TACGGTCTAC CTCCGCAGCA ATCGTACGCA CCACCGCAGC AGCAACCCAA TGCGGGCAAT TATTACGCAC CTCCACCCCA ACAAGGTGGA TACCCCGCTC CGTCATACCC CGGGTACTAG
|
Protein sequence | MVFIPYQIDH NVKNTAKTLG FSKKRVSFKF GLADRQSVDK GETGVGCRGS EHEVTFLWSL KTGKRQLFLD GKDNGWTTDR AWQHSFTLHD RKDGAFKVHF ISQPVNRDMP DVKPFDLRLN GVSYFAFNQI YQLGTPSMTV RESHSRGHHE SGRDSPMGAE ERRALAQAKV ESMRDFQAQH ARPRMDSANS GAASAMRREE ESLLSFDDDP PVQQVAATHS ASANSMTSSQ GNGMYASNIT LDTAIQDRSN SYGNMQGSFG SGGGYNSNPP GAAAYQYQAP PPVNVAAAST TTLAPYQMPG GVAPTPYVDS TGRLAMGTQP QPQGYNYATP TNSAGNASFQ NQFQSPSNQS YASYGSAPSF AQPPRQQGAP DGFANGPPAS SNPYGAPPQQ QAYGLPPQQS YAPPQQQPNA GNYYAPPPQQ GGYPAPSYPG Y
|
| |