Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16073 |
Symbol | |
ID | 7198316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 399655 |
End bp | 401076 |
Gene Length | 1422 bp |
Protein Length | 474 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184362 |
Protein GI | 219128317 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.760064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATTCG ATCGCATGAC GGAAATTCAA GCCAAAACCT TCGATGCCGC GTCTTTGGGA AAGGACGTGC TCGGGCGCGC CCGCACCGGA TCCGGCAAGA CCGTCGCCTT TCTCCTACCC GCACTGGAAC GACTCTTGCA GGACAACAAC AACAAAAGCA ACAAAAAGTC GACGCGCATG CTCGTCTTGT CGCCCACACG AGAATTGGCG CAACAGATTG CCGAACAAAC CCGGCTCTTG ACGGCCCATA TGCCCAATAT GTCGCACCAA GTCATGGTGG GGGGAACCCC CAAGCCCAAG GACGTCTCGG CCATGAAGCG CAAGGTACCC ACCATTATCA TTGCGACTCC TGGACGACTA CAGGATCATT TGGAATCCAC CGTCGTACAC AACACGCCTT TTAAGGATCT CTTCCGGGAA CTCGATGTGC TCGTTTTGGA CGAGACGGAT CGACTCCTCG ATATGGGCTT TCGTCGAGAA ATCGACAAGA TTATCAAATA CCTCCCGCGC AACAAGCAGA CGCTCTTGTT CAGCGCCACC ATACCGGAAG ACGTCAAGCA CGTCATTCGA CAAACCATGC GCGACCCCTA CATCACGGTG GATTGCATAC ACGACGATCA GGCCGAATCC TCCTCCCACA CCAACGCACA GGTATCGCAA GCTCACGTCA TTCTCCCGAC CAACACCCGC ATGGCATCCG GCACGGTAGA CATTATCCGG AACATTCTCG AAAAACAACC CCACTCGAAA ATTGTCGCCT TTTTCCCCAC CGCCAATCTT GTCGCCTTTT ACGCCTCGCT CCTACGGGAC GTCCTCGAAA TCCCCCGCAT TCTCGAAATA CACTCGCGCA AATCACAGTC CCAACGCGAA AAGGCCTCGG AGAGCTTCCG CAAAACCAAC CACGGCTGTT TGCTCACTTC CGATGTGAGT GCCCGTGGAG TAGACTACCC CGACGTTACG CACGTCTTGC AGTTCGGCGT GGCCGATTCC CGCGAATCAT ACATTCATCG CCTCGGACGG ACCGGACGCG CCGGTAAACT CGGACAGGGC ATCCTCGTCC TCACGGACGT CGAACGCGGC TTTCTACGGC ACCTGAAGGG ACTCGATATT CCTGTCCACC CGGAACTACA AGCCATTGTG GACGGGCCCA CGGTCGAGTC GCAGCAGGAC CTTGCGCCCG TCTGGGCATC GATCGGTTCG GGACGGAACG CGGATTTAGC CCTCAAGGCC ACCAAGGCCT ACGTTTCCGC ACTGGGATTC TACAACACCC ACCTCAAGGC TCGCTGTGGC GTCAAGGGTA CCGACGCTTT GGTCGCTTTT TGCAACGCCT TTGCCTACCA GGTGGGATTC ACGACGCTGC CCCCGATTGA GAAGAAAACA ATTGGCAAAA TGGGACTCAA GGGTATTCAA GGTTTGAACG TG
|
Protein sequence | MGFDRMTEIQ AKTFDAASLG KDVLGRARTG SGKTVAFLLP ALERLLQDNN NKSNKKSTRM LVLSPTRELA QQIAEQTRLL TAHMPNMSHQ VMVGGTPKPK DVSAMKRKVP TIIIATPGRL QDHLESTVVH NTPFKDLFRE LDVLVLDETD RLLDMGFRRE IDKIIKYLPR NKQTLLFSAT IPEDVKHVIR QTMRDPYITV DCIHDDQAES SSHTNAQVSQ AHVILPTNTR MASGTVDIIR NILEKQPHSK IVAFFPTANL VAFYASLLRD VLEIPRILEI HSRKSQSQRE KASESFRKTN HGCLLTSDVS ARGVDYPDVT HVLQFGVADS RESYIHRLGR TGRAGKLGQG ILVLTDVERG FLRHLKGLDI PVHPELQAIV DGPTVESQQD LAPVWASIGS GRNADLALKA TKAYVSALGF YNTHLKARCG VKGTDALVAF CNAFAYQVGF TTLPPIEKKT IGKMGLKGIQ GLNV
|
| |