Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45054 |
Symbol | |
ID | 7200071 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 68284 |
End bp | 70002 |
Gene Length | 1719 bp |
Protein Length | 429 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179132 |
Protein GI | 219116675 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.597228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTTCCAGCG TCGGATTGCA CACGACCTCT TTTGCTATTC GCCGTTCCCA CAGCACCCAG AGAGCCGTTG GTAGAACGAC CTTTCCAACT GACCACGTTC ATCTCTGCTC TTCTCCCTCT AATTTGCAAT GGTAATCGTC GTTCGGCACA CCGTAATCTG GCTTTTGCTG CTCTCGCTAC ACGCGGCGGT CAACGTCGAA TCCTTCTCAA GCGTTGCGTC ACAGTTTCGA TCCGGTTATA AAAGTGCGAG GACATTGTCT TCGCTGGACG CCAAGCCTCA ACGATTAAAA GAAAATGTAG ATGGAGTTGT GTATGTCAAC GATAAGGTGT GTAGTACGCT TGCAGAAACA GTCGGTCTAC CGAATTGCAT CGTTCCTCGA AATCATCTTC TAACTCTCTA CTTCACGTTT ATTTTCGATA TTTTGAAAGT GCATCAATTG CGCCGCATGT GCCGCTTTTG CTCCGGCGTC TTTTTCCCGC AGCAATAGCG ATAATGCCCA CTTTGTTCAT CACCAGCCGG AAACCCTGGA TGAGATTGAA CATGCCCGGG CTGCCTTAGC GGCCTGTCCG GTAGCAGCCA TTCGAGTCGA AACGTTGGCC GAACGCCGCC ATCGGGCATC CACTCCGGAA CAGAAACAGC TTATCGAAGA CGATTGGACT GAGCAGGAGG AGTCTCTGGT CCGCAAAATG GCAATAAATC CAGCAAGAAA CGGACTACCC AAACCTTTTC CTAAACCGTT GTTAGGTCTT CCCAACGCGT ACTGGGTGGG ACACCATAAC GAAAGATCCT TTGGAGGTGT TCCGTACTTG TTTCAGGCTT CAGTCGCTGG GAAATCCAAA TGGATCTTGG TGGACACACC CAAGTATAGC AAGTCATCGC TGGACGCAGT TGTTTCCGTA ACGGGACCTG AAGGGCCGGA CTATCTCTTT TTAACACACG TGGACGATAC GGCCGATCAT GGGAAATGGG CCTCCCACTT TCCGTCACTG CAACGGATTT TTCATGACGG TGATTTAGGA GAGCACAATT GGTTGGGCGA CGAGACACTG GCAGACGTAG AAATTCTACT ATCAAAGCAG CCAAGGTCCG ATGAGACGGT CTTGACGGCC TATCAAATTG ACGGAACTAT CTTGTCAGAG AAATGGCAAG ATACATTGGA GGATGATGTT GTCATTTTGC ACACTCCTGG ACATTCGCCA GGGAGTATAT GCTTGTACTG GCGATCCAAC AAGCTTTCTA AGGATCAGCA CGACGGTGTT GTTTTTACTG GAGATACATA CGGCTACACG ACTCGGAACG GTGGACAAAT GACCGCATTT CCTCGTTACG GAAATAATTT ACGAACTCTG GCGAATTCGC TGACGGGACT TTTGTCCTTG GATTGGCATA CTATTGCGCC AGGACATGGG GAGGTGAGAG ATTATTCTAA TCACAATGGC TCGGCAGACG ATGTGAAGGA ACTACAGCGG GCGGAAATGC AAACTGCCGT GGAAGAAATG ATGAGTTCCG GTCGATGGTG AATGCTCGAG ACCGAAACTT TTGTATCGTA ACTTGGTGCG GTTTTTTCAC TTTACCACAG GGCGAAACTC AATTGTCATC TCTTAACCGA AAGCCAACAC ATATTTACAT TGATTGACAG CTACTTGTTG CGTAAATACA GAGTGACTTC ACAAACAGTA ACACAAGCAC ATAAATGATG CACATTTGCA CTCACAGTG
|
Protein sequence | MVIVVRHTVI WLLLLSLHAA VNVESFSSVA SQFRSGYKSA RTLSSLDAKP QRLKENVDGV VYVNDKCINC AACAAFAPAS FSRSNSDNAH FVHHQPETLD EIEHARAALA ACPVAAIRVE TLAERRHRAS TPEQKQLIED DWTEQEESLV RKMAINPARN GLPKPFPKPL LGLPNAYWVG HHNERSFGGV PYLFQASVAG KSKWILVDTP KYSKSSLDAV VSVTGPEGPD YLFLTHVDDT ADHGKWASHF PSLQRIFHDG DLGEHNWLGD ETLADVEILL SKQPRSDETV LTAYQIDGTI LSEKWQDTLE DDVVILHTPG HSPGSICLYW RSNKLSKDQH DGVVFTGDTY GYTTRNGGQM TAFPRYGNNL RTLANSLTGL LSLDWHTIAP GHGEVRDYSN HNGSADDVKE LQRAEMQTAV EEMMSSGRW
|
| |