Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16401 |
Symbol | |
ID | 7198614 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 279605 |
End bp | 280733 |
Gene Length | 1129 bp |
Protein Length | 342 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184768 |
Protein GI | 219129169 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCATT CCCATTCTGG ACTCGGAAAG CTCACCCTTT GCACAGGTGG TATCTGCATT TGCTACTTGT ACTATGGCAT CTTACAGGAA CGCCTCTTTA CGGGTGAAGA ACGCCTCGGA GCGACTTTTG TTTTGTTAAC GCAATGTATC ACCAACACGG TGGTGGCGTT CATATGGCAT GCGATTCAGG ATATCTTTTT GCAGCAAAAC CAGCAAACAC CATCGAGTAA CATTGTTGGC AATCGACCGT TGCCACAAAT TACGTTATGG ACGACTTCAT TGTGTTACGT CACGGCCATG ACGTGCAGCA ATGAGGCAAT TGCGTACGTA TCATATCCTG TCGCTGTCTT GGCCAAATCA TGTAAGCTCA TTCCGACTAT GCTAGTTGGA CAATTCGTCG AAAAGCGACT GTATTCTACC ATGGAATGGA TGGCTGCCTT GTGTATATCG GCCGGTATTG TCCTTTTTAA TGTCAATCGT ATGCAGCAGC AATTGCGGCA CGACATTCTG CACGACGGTA GCGCTGCGCA ATACGGTACG ATTTTACTGT TGATCAGTTT GAGCATGGAC GGACTGTTGA GTTCATGTCA GAACCTCTTG AAAAACTGTG GAGATCGTTA CCAACCGCCC AACGCGATGG AAACTATGCT CTATGTTAAC GGATACGCTG CCGTCTTACT CATACCCTTG AGTATGTACA GTCAACAGTG GGAAGTAGGA ATTGACTCTC TGTTTCGGCA ACACGGTCCC ATGGCTTCCA ATATTGCTAT TCTCAATGCG ACTGCCGCAA TCGGTCAGAT ATTTGTCTTT TTAACTATCA CATGGTGCGT CGAGTTGTGA ATAAGAGCTG AGGTCTTACA GACTCTTGGT TTTGAAATTT GACGGATTTC TACTAACTGC GCCGTTTGGA TACGCCTTTG TCAGGTTTTC GCCGATCATT ACAACAACGA TTACTACAAC CCGCAAATTC TTTACCATAC TTTTATCAGT GTGGACATTT GGACACGCAT TCAATGCTTC GCAATGGACC GCAATCGGTC TTGTCTTTGC GGGCCTTTTC CTGGTCATTT ATGTACAGCG CCAAAAAAGT AGGGTAGATA CCGCCCCCGC AAAATCTAAA CATTCTTAA
|
Protein sequence | MEHSHSGLGK LTLCTGGICI CYLYYGILQE RLFTGEERLG ATFVLLTQCI TNTVVAFIWH AIQDIFLQQN QQTPSSNIVG NRPLPQITLW TTSLCYVTAM TCSNEAIAYV SYPVAVLAKS CKLIPTMLVG QFVEKRLYST MEWMAALCIS AGIVLFNVNR MQQQLRHDIL HDGSAAQYGT ILLLISLSMD GLLSSCQNLL KNCGDRYQPP NAMETMLYVN GYAAVLLIPL SMYSQQWEVG IDSLFRQHGP MASNIAILNA TAAIGQIFVF LTITWFSPII TTTITTTRKF FTILLSVWTF GHAFNASQWT AIGLVFAGLF LVIYVQRQKS RVDTAPAKSK HS
|
| |