Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47096 |
Symbol | |
ID | 7202171 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 388440 |
End bp | 389901 |
Gene Length | 1462 bp |
Protein Length | 324 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181199 |
Protein GI | 219121700 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGACATATC GACATTATCA TTCTCGATGT ACCCTGGGTA GATACAATGT GGACTTTCTC GAGACTGCTG TCCGCATGGA AGTCAGCCGA GCTTTCGTTG TGTGTAGCAG GCGGTGGAGC CCTTATTTAC AAGGCCATCT TATCTCAATA CGGAGGAACC AAGGCGGAAC CGCAAACTCC GAAATTCTTT TCGGATAACG ATACGAGCAC TGATGACGCA TATGAAGGTC AATGTCTTCA TCGCCAGCTA TACCAACCCT GCGGACCCTA TCCGAATTGG GATTACAATT GGGATGGTCG CATGCGGGAT GATTCTACGT TGGAATATCT CTCTACCGAA CAAGGATTGA AACGTTCCAA GTCTGCCGGC AAAACGCGTC ATCTTCTGTT GATTCGACAC GGATAGTACG ATGAGACGAG CAAGGATGAC GACGAGAGGC ACTTGACTCC TTTAGGACGC CGTCAAGCCG AGCTAACCGG ACGACGGCTG AGCTCACTGG CTGCAGGTGG ACTCCGTGGC GCCGAAACAC GCTTTACCAG GCCATGCTCT TTCAAAGCGA TACACCAGTC GGATATGACG CGAGCAAAAG AAACTGGTAC GCGTCAAAAA ACCCTTTTGT CTTGAAGTAG TACTCTGACT TAATCTCTTT CTTATTTTTC GCCGCAGCTG CGATAATAGC ATCCTATTTA CCTCAAGCTC GCTTGAGGAA ACCCGATCCG GCGTTGAACG AAGCTCTCCC CTGCCCGTGA GTTTTGCAAC GTTACAGACT CTTTTCATTT TATTATGCGC TCTCTGACAC TGAAATGTCT GGCCGGGGTA GCATGATTCC TATACGTCCC GATGTCCCCG ACGCCGAGAA GGAAATCGAC GACAATCACG TGCGTATCGA GACTGCTTTT CGCAATTACA TCCACAGAGC CGATCCTGAT GATCTTGGTG ATCCCATTGA GCACGAATTT GAAGTTTTTG TTGGTCACGG GAATATTATT CGGCTTGCCG GTACGTTGAA GGATTTGCCC TGCCTGTCCT TATCGATGTT GACGGATTGG GCTCATCACT TAAAATATGA AAGGGCACTT CAACTTCCTC CAAAAGCTTG GCTGAGGCTG AGCATCTTCA ACTGTTCGAC AACATACATT GTGATACATC CCAATGGTTA TTGCTCAGTT CGAATGCTGG GAGGTGAGTA AAATACTTGT GTACCACAAC CTTGCACCGC GAATTTCCAA CCCATTTTCA CTCCTCACAT TCATGCTATT CTGATGTAGA CATTGGACAC TTGAGCTATG ATGACACGAC CTTCTCCGGA AACCACGGAT TCAACTGGTA AAGTGAGTTA CTGTTCTTGT CGTACGTTGA GGCTGAACGA TCTAGAAGAA AGTAAGGGAA TGCTAGCTAT TGCGTCAGAG CTTTACCAGA GGTTTCGGCG CACGTAACTG GCTTCGGTAG TATTCTCGAT AA
|
Protein sequence | MWTFSRLLSA WKSAELSLCV AGGGALIYKA ILSQYGGTKA EPQTPKFFSD NDTSTDDAYE GQCLHRQLYQ PCGPYPNWDY NWDGRMRDDS TLEYLSTEQG LKRSKSAGKT RRRQAELTGR RLSSLAAGGL RGAETRFTRP CSFKAIHQSD MTRAKETAAI IASYLPQARL RKPDPALNEA LPCPMIPIRP DVPDAEKEID DNHVRIETAF RNYIHRADPD DLGDPIEHEF EVFVGHGNII RLAGTLKDLP CLSLSMLTDW AHHLKYERAL QLPPKAWLRL SIFNCSTTYI VIHPNGYCSV RMLGDIGHLS YDDTTFSGNH GFNW
|
| |