Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43603 |
Symbol | |
ID | 7197478 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 945381 |
End bp | 946819 |
Gene Length | 1439 bp |
Protein Length | 411 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177724 |
Protein GI | 219111945 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00179449 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACGGA CAATTGCGAG AGACACACAC GGACAAGGAC GCCTAAAATC AGGACACGGA TCTGTCGTCT TCACAATTCT TAACAGTAAG ACATGTCGAT AACTGAACCA GACAACGGTC GGGTTCGGAT GTGACCAAAT GGCAACATGT TAAAATTCCC TTTACCGCCC TGATTCGAAG TCTTTACTTT TATGTTAGCA CGCACTGTTC TTAGGATTTG CTACTTCCGA TCTGACCGGG ATGTCGCTTT CACCGTTGCA ATCCCCTGTC TCTGGTCCAC CTCAGCACCG CCGAGCGTCA GCCTGGCTAC GTTCTGGAGC CCCTCTCACG ATTGGGTTGC TACTGTACAT TGTTGTGTTC CAATATTCTG TATATCGTGC CGATGTGAAA CCAGTAGCAT CGCTACGCGA GGATCGGCAT TTTTTATATT TGCAAAACAA TCTAAAATGC AAAAAGGCAA GCAAATTAAG CCGCCCACTT TCAAAAGAAA ATCCTGGTCT CGATTTTACC CGACAAGACA AGCTGTATTC TCATCTTAAA AACGATACAG TGGTTTTGCC CGTAACAACC TGGGAGCGTT TCAGCATTAG ACCACTCAAG ATACCATACC CAATATTTAT GACAAGTTTG CCGAAAAGTG GCACCACAAG TCTGTGGCGA TATTTTCGGT GCGGAGAAGT CAATGCGTCC CATCAATACG TGACTAAGCA AGGCGAAAGG AAAGCGACAC TAGCGGGTGT GTGTATTCAA GACAATATTA AGCAAGACAA GCCTCCCTTT GAAGGATGTG GTGAATATGA TTTGTTTTCT GATACGGGTG TAAGTGGCGC ACTGTAATCG TGGTGCGATC GTTGCTTGTA GGTCTCTCAC AGTCTTTCCA CCCTCTAAAC TAGTATCTTT TCTATGACCA TGATGAAGGG GAACAGTGCT TCTTCCCTTC GGTTGATGCC TTGGATCGTG TCTACGCTCA CTATCCGAAC GCAACCTTCA TTAATGTTAT TCGCGATACT GCGTCATGGT TTACATCGTT AAAGAACTTC GCGCAATCGT CTTTGTTTGT GAGGCTTCGA CTTTGCAATG GGACACATTT CCCTGACGGC CAGTCAAAGG TGCAGGATTG GTACAAGTTC TACAATTGGC ACAATAAAAT GGTACAGCAA TTCGCTGTAG ACCATCCGTC GCTCACATAC ATCGAAGTCG AACTAGAAAG CAACACAACT TCTAAGGTAC TAGAAGACTC GACCGGCATC CCATCACAAT GTTGGAAAAA ATGTCGACCA AACAAGATGC TGTGCGACGA AGAACTTGAG GCACGAGATC GCAAGACAGT TGAGGACTCA AGAAAGGGGG CAAAGCGCGA GGTGGTGAAA AAGCTGGAGA TCAAAAAATC GCGTATCCTC AAGTCAGGAA AGTTAATCGA AATGACTACA AAGCATTGA
|
Protein sequence | MVRTIARDTH GQGRLKSGHG SVVFTILNRF ATSDLTGMSL SPLQSPVSGP PQHRRASAWL RSGAPLTIGL LLYIVVFQYS VYRADVKPVA SLREDRHFLY LQNNLKCKKA SKLSRPLSKE NPGLDFTRQD KLYSHLKNDT VVLPVTTWER FSIRPLKIPY PIFMTSLPKS GTTSLWRYFR CGEVNASHQY VTKQGERKAT LAGVCIQDNI KQDKPPFEGC GEYDLFSDTG YLFYDHDEGE QCFFPSVDAL DRVYAHYPNA TFINVIRDTA SWFTSLKNFA QSSLFVRLRL CNGTHFPDGQ SKVQDWYKFY NWHNKMVQQF AVDHPSLTYI EVELESNTTS KVLEDSTGIP SQCWKKCRPN KMLCDEELEA RDRKTVEDSR KGAKREVVKK LEIKKSRILK SGKLIEMTTK H
|
| |