Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47886 |
Symbol | |
ID | 7203154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 343064 |
End bp | 345099 |
Gene Length | 2036 bp |
Protein Length | 528 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182376 |
Protein GI | 219124155 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0086312 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATGTTTGGA CTTGATGGTT TTGGTGGCTC CCAAAGTGGG TTCGCGGTGG ACGGAGACAA AACTGGAGTA GTGATTGTAG AAGATTGTGT TATGTCTAGA ACAATTGCTG GCCTGTTAGT GGCCTATACT ATGCTCTTTT CCAATTGGTG AATACCAACC TGAAGCAGAC AAAGAGCTCC AGACAGTAAT TCTTAGACTA TTTTTTCTTT CCATGTTACT AGAACCATGG AATGGCTACC ATGGTTGCAC TGGCAAAGGA AAGTAGCGAC GGCTGACTTA CGTACCATTT GTGGCATCTG ATGACATGCC AAAGTGTCGC TTGTGGCTTT GAACACAATT CTAACAACAA TCGCTCTGAC TAGGCCATAG CTCTTGTGGC TATTGTCACC GTACTTTCAC AATTGGTGCC AGAAAATCTC AAACTGTCGT CGAAGTGAGA TTAACAGTAA TGGACATGAA CAATGATAGT TTTCAACCGT TGTCCAAGCG CCAGAAGACA TCTGATGTCG AGAAGAGCGA CGAAGACCTT TCAAATTTAG TTCATGGTGA TGAGTTTGCT CTTGTAGGCC GAGAACAAGA GCTGGATGCT CGGCGAAGAC GCGCAGACAG AAAACTACGG CTTCACAGTG CTCGGCAAAA AGAATCAGGT GAAGGAACAG ACAGAAACGA CGACACGGAC AAGGATTTCT TATTGCCTGT GAGATATGCT CAAAATGATC AGCTCCAGGC AACAGCAACT TTAAGCAACA AGGACATGGA AACAGTTTAT CAGCCGGTCG ACAAGGACAA TTTTGACATG TTTTCCAGCT CTCCGTCTCC GCAGGACAAT ATGGATTACC AAACGAACTC AGCGACATCC AAGTCGAAGC GAGGAAATGA ACAAGGGGAT TGGGACGACA GCGAAGGATA TTACAAAGCT GTAATTGGTG AAATTATTCG TTTAGAACAA ACGATAGACT CCGCTTCTGG CAATTCCAGC AGATCAGAGA TAAGCTTCCG AATTTCAGGA GTTGTTGGGA AAGGGGTCTT CTCCACCGTT CTCAAATGTA CAACTGTAAG CAATAGTAGC AGTATCCAGC TTCCTCCTAC AGTTGCCTTG AAGTTCATCC GGCACAACGA CACGATGGCG AAGGCCGCAT TGAACGAGGT GCAAACTTTA CAGCGCTTAA AGGGATGCGA CGGTATTGTT CCATTGCTGT TACCACTTAC AGAAACTCCG ATGGAACACC GAGGACATGT TGTCTTGGCG TTTTCATGTA TGGAATACAA TCTCCGAGAC GTGCTTCAGA AATTCGGGAA AGGTGTCGGT CTTTCACTAC AAGCAGTTCG ATCATATTTC GGCCAGCTTC TGGCTGCCGC AACGCATCTA AAGAAGAACA ACATAATACA CGCAGATTTG AAGCCGGATA ACATTCTCGT AAGCGCCGAT TTTTCTTTTG TTCAGCTTGC GGATTTCGGT TCAGCTATTG ATGCATCGGA GTCCCAACAA AACCAGCCGA CGCCGTACCT GGTATCACGT TTTTACCGCG CGCCCGAAAT AATTCTTGGT TTGACTCCGA CATTTGCGGT TGATTTGTGG AGCTTGTCGG TCACGGCAGC CGAGTTGTTT CTAGGAGAAG TTTTGTTCAA GGGGTCTTCA AACAATGACA TGCTCTACAG CTTTATGCAG CACATGGGGC CAATTTCCAA TCGCATTATC CGTCAGCATT TGGCAGGATG CCAGCGCTTT CCAATTTCGA AACAATTTTC TCAGGAAGGA GCAAGCTTCC TTTTTAAGCA GCAGACAACA GATCCCGTGA CTGGTCGGCA TGTACACAGG ATGTTGTCGC TTGCAACCTC AAGTAACGGA GGGAGGTTTC CGTCGGCCAC TCCTTTACAT CGCGTGTTGT TGAGGGCAAA GTCCACAAAA GACAATCGCA TTGTGGTCAA TCGATTTTCA GATCTTCTAG TGGGATGCCT CAGTCTGGAT CCGTCCAGAA GGATGAGTTT AAAAGAAACT TTGCAGCACT CTTTCTTCCA GCTTGAAAAC TCGTAA
|
Protein sequence | MDMNNDSFQP LSKRQKTSDV EKSDEDLSNL VHGDEFALVG REQELDARRR RADRKLRLHS ARQKESGEGT DRNDDTDKDF LLPVRYAQND QLQATATLSN KDMETVYQPV DKDNFDMFSS SPSPQDNMDY QTNSATSKSK RGNEQGDWDD SEGYYKAVIG EIIRLEQTID SASGNSSRSE ISFRISGVVG KGVFSTVLKC TTVSNSSSIQ LPPTVALKFI RHNDTMAKAA LNEVQTLQRL KGCDGIVPLL LPLTETPMEH RGHVVLAFSC MEYNLRDVLQ KFGKGVGLSL QAVRSYFGQL LAAATHLKKN NIIHADLKPD NILVSADFSF VQLADFGSAI DASESQQNQP TPYLVSRFYR APEIILGLTP TFAVDLWSLS VTAAELFLGE VLFKGSSNND MLYSFMQHMG PISNRIIRQH LAGCQRFPIS KQFSQEGASF LFKQQTTDPV TGRHVHRMLS LATSSNGGRF PSATPLHRVL LRAKSTKDNR IVVNRFSDLL VGCLSLDPSR RMSLKETLQH SFFQLENS
|
| |