Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47970 |
Symbol | |
ID | 7203221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 590423 |
End bp | 592327 |
Gene Length | 1905 bp |
Protein Length | 322 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182257 |
Protein GI | 219123907 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000220023 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTAATCTTTC ATTGTGTTTC ACCATGACGA CCCCCCGTTT GTGTTTGACG CTCGTCCTTA CGTACGGCTA TATTATTCCC TTGACGACAG CATGGACGAG CCCGTTGCCA CTTTTTGGAA GAGATACACG CCACGGCCTT CGGTGTTTGG ATGCGCTGCC CCCAGCCGCC GTTGAGGAGC TCGGTGAGGC ACCGACGGCT CTTTCGCGCC GGGACTGGTG CCGACGCGCA TTCGGGGCCG CCAGCGCACT CGGGATTACC GCTATTGTCC AGTACCCCAA TCCTTCCCAC GCGTTGGTCA AGGGGTCCGA ACCGCCACCG CGTACCAAAA TGAACGACGG CAAACCCAAG TGTACCAACG TGGACGAGTG CCAAGCCCAA GCCGAGCTGC GCGAACAGCA GCTCCGAGAG GAGGCCGCCG CCAACGCTGT CCCCATTAGC ACGACTTCCG GAGGTATCCG CTTTCGGGAT CTTATAGTTG GCGACGGGAC AACGGCGAAA GCCGGCGACG AAGTTGTTCT GCACTACAAA GTATTGAAAC TCGGTAAACG CAGTTACGAC GGCATTTCGG GAGAAGGCAC CGTCGTTTTT TCGAGAGGGT ACGGGCTAGA AGACGACGAA GCTAAACCCG GTGACAAGAA TTTTGTCACG ACGCTAGGCT CCCTCAGCAA CATTGGTGCC GTTAACGATG CCGTTCCCGG CATGCAGACG GGTGGGACCC GTCGCTTTGC GATTCTACCA CCCCAGGGAT GGCGCAAACC GTCCAAGATG TGTGACGGTG GACCGGGTGG TAGTGGTTCG GGTGGAGACT TGCGGACGGA CTACGTTATT GTACCTACCG CCACCATGGT GGACGCCGAA GTCTGTTTTG ATCAATCGAA ACAACCCTTT CCGACATCGT ACGGACAACA ACGCCGAATG GCGCAACGCT TCGATCAGAG TCTCATTATG GAAGTACAGC TAGTTGCGGT CAAGGCGGGA GGGAGCCTAT AAATTGCTAA GTGTAAACAG TTTATAGACG CTACTGTAAG CAAAGCGAAA AGCTGTGTGT TCGCCGGACC GTGTTCGAGG GGTAGCTTGG AGATCCACAG TCAAAAACTA CGAGGTGGAC TCCAATACCC GGAGAAACAG GTCCGTCTGT CCTCCGAGGG ATTCAAAACC CAATCAGGTC GCCGGAAACA GAGTGACAAC TGGAAAGCCA GACACACAAC CGTGGTTTTG CGCCATTGAC AGTGAGGAAT ATTACTGTTA CTGTTAGTGC AGTACCGACC GACCGAAACA CAAAATCGAA AACTACCCGA GTACAGCACA TCGTCTATGA TAAACCTAGG TGCTGCTAGC ACACATACTT AACTTACTCA CACACCTATA CCGAGCGGCA AACACGTTGA AATTCAGCCG ACGCTTTAAG CCGAGACTCA AGCATGTACA AACTGCCTAT AATGGCCGAC GAGAGGAGGA ATATCCGAGG CCGACAACGG TACAGCCGTC GTCGTCCATC TTTCTCATCG TGGCTTACGA ATCCGTCTCT TGCCGTATTA TTCTTGTTTG GAATGATGTT CTTATTCACT GGCAATGGAA TGCTTTGCGC GACGGCGATA GAGGCAACGT GGACCCCCAA CGAGGATGGA GACGCTCCAC TGCCCTTGTC CAACAATCAA CGACAACAGT TACTTCAACT GGAAGACACC ATTCGGAGTG CTCCCGACCC GCAGGCTACT TTGATGCAAG TAGCTCAATC CAACGAAATG GACCCACAAG AATTGGTAAA TTTGCTCAAT CAGAATCGAA GGGAAATGCA AGCCGCGGGA GCGTTACAAC CGCCCGGAAC GTCGGGACGG AATATTTGGA AGCTTGTTTC TAGTATGGGA GTGGTGCTCG TACAGTCGGC ATCCAAACAT CCGAG
|
Protein sequence | MTTPRLCLTL VLTYGYIIPL TTAWTSPLPL FGRDTRHGLR CLDALPPAAV EELGEAPTAL SRRDWCRRAF GAASALGITA IVQYPNPSHA LVKGSEPPPR TKMNDGKPKC TNVDECQAQA ELREQQLREE AAANAVPIST TSGGIRFRDL IVGDGTTAKA GDEVVLHYKV LKLGKRSYDG ISGEGTVVFS RGYGLEDDEA KPGDKNFVTT LGSLSNIGAV NDAVPGMQTG GTRRFAILPP QGWRKPSKMC DGGPGGSGSG GDLRTDYVIV PTATMVDAEV CFDQSKQPFP TSYGQQRRMA QRFDQSLIME VQLVAVKAGG SL
|
| |