Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26565 |
Symbol | |
ID | 7199825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 626185 |
End bp | 628039 |
Gene Length | 1855 bp |
Protein Length | 346 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178814 |
Protein GI | 219116038 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.460392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCCAGACAGA GATGAAAGTA AGCACAAGAA CAAAACGAGC TGTCGACCAT GGCGCCCTCC GCGGAGAGCA TTGAAATGGT TGTTGAAGGA TATACTTCAA CAGACAAAAA TTACTCCATA ATTTCCCTAT ATACCTTTTT GGAAAAAAGG ATTTCCGAAG AGCAGCTTCA AAATATTCAG AGTGAACTCG AGGCTGTGCT ACAGCGATAC GATGCGCGAG GTACAGTGCT GGTAGCTTGC GAAGGAATCA ATGGTACCAT CAGCGTCCCG CATTGTTACC AACGAGACGT TTTATTGGAA CTCGACAATC ATTTTCCAAC TCTCCGAAGA CGGTTGTCCT ATCACGACGA ACACGTTTTC TTTCGGTTGC GCGTCCGCGT CAAACCCGAG ATCGTCACGA TGGGCTCACT TCCCTCCGAC ATGATTTTGG ATGTAGCAAA TGATGTTGGC GCATACGTCC CACCTGGACC GGAATGGGAT GCCCTTCTAC TAGATCCCGA GTGTGCGGTG ATCGATACCC GCAACGATTA CGAAATCAAA ATTGGTACAT TTCACAACGC CATCAATCCT CATACAAATT CTTTTACGGA GTTTCCAGCG CAGCTGAAAA TCATTATCCA GGAGAGAAAA CCCAAAAAGA TAGCGATGTT TTGTACGGGT GGAATCCGGT GCGAAAAGGC AACATCTTAT GCCTTGCAAC TTATTGCAAC GGATCCAGAT ATTCCCAATA TTCCCGTGTA CCATCTTGAG GGAGGTATCT TGGCATATCT GCAAATCGTG CCACCTCTCC ATTCCTCTTT TCGAGGCGAG TGCTATGTAT TCGACCGACG GACCGCCCTC TCTCACGGAC TGAATCCATC TAGCTGCTAC CAGACTTGCC ACGTTTGTCG ACATCCCTTG ACGACGGAGG ATCGTGCTGG TGTCACTTTT CAGGAGGGAA TCTCTTGTAT GTATTGTGCA GGTGACTCTT CGCGCAACCG AGACCGTCAT GCTGAGCGTC AGCGTCAGTT GGACCTTTCA AAAGATAAGA TGCATCTTCA GCAGCAACGC GGTAGACCCC GCAGCAAGAA CGAATCCATC ACAGTCTAAA GACGTAAGGA AAACAGGACG CAGTAAACAT ATAGACTGCA TACTATAAAT CATTACCGTT TGTCCATTAC AATTCGCGGG TTCCTGTCAC TCCCGATATA CCTTGTGGCA GGTTACGCGA AAGCCGATCA CGAATTTCCT GGGCCGGAAA GGCGCTAAAT CGTGAGGCCG AGCTCTGAAA AGCTTTACCG TCCACATCCG TGCCAGTCCA AACCACATAA AGTATGATAT CACATTCGGT ACCTGAGTTT CCCACGATAG TGTTGCATGT TTCCTCCGGG ACGAAACAAC CTCCGGTCGG CTGCCCGGAC GTCGCCCGGG TCTGCGGAAT TCCATTGAAA TCAAAAGTTA TCTCGTCACA CTCGCCGCCA CCGCAAAACA CGCAAGCATC GTCCCATGCA ATACCAGTCA CACGTCCCGC GTCGACATTA ATAATTGCCG TCAAAAAGGG ATAGATGCGC GTCATTTGAC GATCGTAGTA GACGCGCAAA AATGACAGGG CGTTTTGAGA GACGACACCT AGAGTTATGT TGGCTTCGTC AAAAATAGAG TCATCCGAAA ATGGAACGAG ATTCTGGTAC GCTGATAAGG CCAATAAGAA AGTTCCGTTA GGTAAGTTGT TTCGATCACT GGAAGCACAA TATTCTTAGC CTTCCAACTA GCTTCCGGAT GACTCGTCAA GTTACCTACA GTTTGGCACG TGAAAGCGTG AATACTTATC TGCTTCGGGA GAAAATACGA CACGTGTGTA CTGAACTCCT GCAGT
|
Protein sequence | MAPSAESIEM VVEGYTSTDK NYSIISLYTF LEKRISEEQL QNIQSELEAV LQRYDARGTV LVACEGINGT ISVPHCYQRD VLLELDNHFP TLRRRLSYHD EHVFFRLRVR VKPEIVTMGS LPSDMILDVA NDVGAYVPPG PEWDALLLDP ECAVIDTRND YEIKIGTFHN AINPHTNSFT EFPAQLKIII QERKPKKIAM FCTGGIRCEK ATSYALQLIA TDPDIPNIPV YHLEGGILAY LQIVPPLHSS FRGECYVFDR RTALSHGLNP SSCYQTCHVC RHPLTTEDRA GVTFQEGISC MYCAGDSSRN RDRHAERQRQ LDLSKDKMHL QQQRGRPRSK NESITV
|
| |