Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19805 |
Symbol | |
ID | 7200026 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 860378 |
End bp | 861578 |
Gene Length | 1201 bp |
Protein Length | 279 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179523 |
Protein GI | 219117457 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000518484 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGCTTGCCA TTGACCATTC CTGTTGATTT CATTCCGTAA ACGTTTTACC ATGTGCTTGT GTTGTTTTAC AATCTCCACC GCCGAAGTCG GCGTCATTGA ACGCTGGGGC AAATACAGTC GCCTCGTACA GCCCGGACTC AACGTGATTT GCTGTCCCAT GGAGTCGCTC GTGGGCAAAC TCAGTTTCCG CGTGCAGCAG CTCAACGTAC GCGTCGAAAC CAAGACTCTC GATAACGTCT TTATCACTTC CGTAGTATCC GTACAGTACC AAGTCCTCCG CGACAAGGTG TACGAGGCCT TCTACGCGCT TTCCAACCCC GCCAGACAAA TCACCGCACA CGTCTACGAC GTGATGCGTT CGCAACTACC CACACTCGAA CTGGACGCCG TCTTTGAAGC CAAGGAAGAC CTCGCGCTAG CCGTCAAAAA CGCACTTTCC GAAATCATGA CTACGTACGG ATATCAGATT GTGCAAACTC TCATTACCGA TTTGGATCCG GATCAGGTAC GTCCCGTGTA ATGTTGGGGC GTCTCTCTAT GTGTTAGTGT GTGTGTGTGC ACGATACTTT GCGCGCCGTG GTTCGGAAGG AGGGTAGTTC ACGTCGCGCA CTTAATCTTT GGTCAACATT GCGAATGTGC TCATACTCTT TTTTTTCGTT CGCCTTGCTC CACAGCGCGT GAAAAACGCC ATGAACGAAA TCAACAGTTC CAAGCGACTC AAATACGCCG TGGCGGAGCG TGCCGAAGGA GACAAGATCC TCAAGGTCAA GGGCGCCGAA GCCGAGGCCG AAGCCAAGTA CCTTAGTGGT GTGGGTGTCG CCAAGCAGCG CAAAGCCATT GTCGATGGCC TGCGTACGTC AATCGTCGAC TTTTCCGATC ACGTGGAAGG ATCCAGTACC AAGGAAGTTA TGGATTTGTT GCTCTTGACA CAGTACTTTG ACATGATTCG CGACGTGGGA GCCGAGAGCC ACTGTAAGAC AACCTTTGTC CCATCGTCTC GGGGTGCACC CGACGACATG CGCAACGCAC TCCTGCAATC GGCCGCCGGA AGACTCTAAA AAGTTTGCGT CCGAACGATG ACTACCTATT TCTGTCGTTC TCCTCGTTTT ATTGCCTTTT GCTCCTTGTT TTTCTTCGCG GCAATCCGCG ATTCTAATCT GTTTCGTGTC TCAATAATCG TACTACGTTT CTTTTATACA T
|
Protein sequence | MCLCCFTIST AEVGVIERWG KYSRLVQPGL NVICCPMESL VGKLSFRVQQ LNVRVETKTL DNVFITSVVS VQYQVLRDKV YEAFYALSNP ARQITAHVYD VMRSQLPTLE LDAVFEAKED LALAVKNALS EIMTTYGYQI VQTLITDLDP DQRVKNAMNE INSSKRLKYA VAERAEGDKI LKVKGAEAEA EAKYLSGVGV AKQRKAIVDG LRTSIVDFSD HVEGSSTKEV MDLLLLTQYF DMIRDVGAES HCKTTFVPSS RGAPDDMRNA LLQSAAGRL
|
| |