Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45839 |
Symbol | |
ID | 7200951 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 441739 |
End bp | 443778 |
Gene Length | 2040 bp |
Protein Length | 563 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180047 |
Protein GI | 219118554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACCGGGCGAA GCGATGGACA TACATTATTT CGCCTCGTAA AGATAGCACT CCCGTTTTTC ATTCACATTC GCATCGCGTC CACACGATCA CGAGTCGAAT TTGAAAGTGT TGGAAAAGCA GCACTGTACG TCACGCAGTT CCTCGAGGCA TGGTTTCCCA GGAATATGAG ACATTCGCCG ACAAGCTGCC TCAGGCGAAC GAGAACATAT GTGACTCTGC GGATCCTGCG GCGTCTCGTC GAAATTCATC AGAGGAAGAA TTGTCCGACG ATTTGCCTGC GGATGACCGA ATTGAAAGGA GAGTACGGCA AGGAGCAATT TTGCCCTTGT CCATGAGACT CGTGATGCAA ATCTCTGGTA TTGGCTGTTG CCAGGAATGC ACCGGAAGCA TGGCTCTTCC TGGCAGCAAT TCCATCAACA GAAACGGAGA TGACGCTAAT CACGAAGAGA CTGCTCTACT GGGCGACTAC GATTTGCGTG CCCTGCGCCG TCAAAAGTCG GCCATCTCCC CCCTGGTAGT TGGACCCATG ATGATGCCAG GGAATATCGA CAAAGAAAAT TCACTTTTGG AAAGAATTAT TTCAGAGTAT ATTGCCTCCT GCCGCTTTTA TGGCTGTACG GACCGGGTCA ACGCAGGTGT GCTGACTACT TTACGGTTCT CCTTACCCAG TTTACGGGTC AGCGGAAGTT TCCACGATGC TGATATGCTA GCTTTGGCCG AAATCATGAT TAAGTACGGG AACGGTGCTT TGCGCTATAT CAAACGATTA GACTTTTCGC TGAGCTCCAA AGAAGGAAAG CTGAACGGCA AGGCAGGATT TCGTTCTCAC GGGGCCTTTA CGCTGGCAAA AATACTTCAA ATTTCTGACT ACATTGAAGA AGTTTTCGTT GACAGGAATC GTCTGGGTCC GTACGGTGCC TCTGCTTTAT TCATCGCCTG TTCTTCAAAT TCATCCCTCA AAAGGTTGCT GATGAGACGT TGTCGCGTTC GAGAACGAGG GGCTTTGGCC TTTGCAGAGC TAATATCGAC GAGCTCGGAA TGCGGCTTGG CGGAAGTAGA CCTTAGTGCG AATGGAATCG GTTTCAAAGG AAGTGTGTCC ATTGAGCGTG CAATTGTGGA ACGCAACCAG AACTCTGATT TACCAGCTTT AATTGTGAAC ATGGAAGGCA ATCTAGTACT CCAAGAAGTA CTCAATGGCG TGACTCATGG ATTGGGAATC ATTCTGGCTT TGATGGGCTC GTCACTTCTG TCCAGCAGCG TCCGACACCA ACCATCCCGT CATGTCGTCA GCTGTGCAGT ATACTCGACC AGCCTGATTG TTCTGTATAC CAGTTCTACT CTGTATCATT CGTTTTTTAT GTGAGTATTG CCAAGGTTGC CGAGCTTCTG CTTGTTTGAG AAATCCCGAT TGTCTGAGGC TGGATTCTCT CATACCTTGC TTTGCTTATA GCTTGCAAAA CACCAAGTAC ATTTTCGAGG TCATGGACAA GTGCGCCATT TATATACTCA TTGCTGGTAG CTATACCCCA TTCCTCCAGA TCGTCCTGCA GCACGACCCA TTGTGGGCCA ATGGGCTCTT GGCCTTCATT TGGTGTTGCT GTCTGCTGGG AATCTCAGTC GAAGCATTCT TTCCAACCTA CAAACACAAG GGTCTGTTTT CGCTCGCTAT GTATCTGGGC ATGGGATGGT GCTGTATTGT TTGCTTACCA GAATTTTCCC GCATCGTTCC TACTCGCTTG ATCCACCTCG TGATTCTGGG TGGCGCCGCC TACACTCTGG GTGTCCCCTT CTTCGTTCGC AACAACCATC TCGACCACTG TATTTGGCAC ATTTTCGTCA TGGTGGGCAG TATATTCCAC TGGTGTGGCA TTTACTTTTA CGTCGCCACA TTTGATGACG AGTTGGCGTT GGCAAATGCT TCTTCGTCCT AACAGCAAGA TGAAACACAG GCCTGCCGGT TTCTCACTCG GAGCAAGAGC TGCACGATGC AGGCAACATA TGTTCGAGCA AAGATAAAAT CTAGTCCAAA AAATAAGGTT
|
Protein sequence | MVSQEYETFA DKLPQANENI CDSADPAASR RNSSEEELSD DLPADDRIER RVRQGAILPL SMRLVMQISG IGCCQECTGS MALPGSNSIN RNGDDANHEE TALLGDYDLR ALRRQKSAIS PLVVGPMMMP GNIDKENSLL ERIISEYIAS CRFYGCTDRV NAGVLTTLRF SLPSLRVSGS FHDADMLALA EIMIKYGNGA LRYIKRLDFS LSSKEGKLNG KAGFRSHGAF TLAKILQISD YIEEVFVDRN RLGPYGASAL FIACSSNSSL KRLLMRRCRV RERGALAFAE LISTSSECGL AEVDLSANGI GFKGSVSIER AIVERNQNSD LPALIVNMEG NLVLQEVLNG VTHGLGIILA LMGSSLLSSS VRHQPSRHVV SCAVYSTSLI VLYTSSTLYH SFFILQNTKY IFEVMDKCAI YILIAGSYTP FLQIVLQHDP LWANGLLAFI WCCCLLGISV EAFFPTYKHK GLFSLAMYLG MGWCCIVCLP EFSRIVPTRL IHLVILGGAA YTLGVPFFVR NNHLDHCIWH IFVMVGSIFH WCGIYFYVAT FDDELALANA SSS
|
| |