Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37220 |
Symbol | |
ID | 7202178 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 420468 |
End bp | 421526 |
Gene Length | 1059 bp |
Protein Length | 330 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181205 |
Protein GI | 219121712 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00536264 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCGT CTGCCAATTT TACCATCTCC GACTTTCCTC ACAAAGTCCT CAATCCAATC GCCACCAACA CCATTGCTCC CTCCTATGCG TCGCTTCTCC TGGCCCAATG CCAGCTCAGC GCCAATGCAT CTGCCATTCC CAGCCTCAAC GGCGGCGGCG CCCATGGTCA CATGGCTCTG ACGCTCACCG CCGCCGCATA CGCCGAACTG TCCGACGTCC CCTTCGTCAT CCCCATTGCT CCCCCTGCCG ACCCCGAACC GGGTACCATG CAACCCCAAA TTACGGAGAA CAATCAACTC CACAAACGCG CTGTGGCCAT CCACAGCCTC TACGTGGCCG TCAACAATGC CCTCTGACAC CAAATCCTCG ACGCCGTTCC TCGCGTCTAC ATCCGCGACT TAGAGCACCC CCAGTTCGCG TACAGCCACG TCACCTGCCT TGACCTCCTC GACCATCTCT GGCGCAACTT TGGTACCATC ACCGCTTCTG ACTTAAAAAG CAACATACAA TCCATGTATA CCCCCTGGAA CCCGGTTGAC CCCATCGAGA CCATTTTTCA CCGGCTCAAT GACGCAATTA CGTTTTCGAC GGCTGGCCGC GACCCTATCT CCGAACCAGC TGCCGTTCGC GCCGGCTATG ATGTTTTCGA GCATTCGGGC CTGTTCCCTC GCGCCTGCGA AACCTGGCGC ACAGCCTCGC CCGACACACA CACCCTCGCC AACCTCCGTA CGCTCTTCAA GGTCGCCAAT ACCGACCACA AGCGTACGCT TACCACCGGC TCCCTCGGCT ATGCCAACGT CCTTGCCGCA ACACCATCGG TTCTCCCGTC GCTTGCGCCA GACTCGCTCA GCCTTCCTTT TTCAGCCCTC TTGGTGTCCA ATTCCTCTGC TACTCTCTCG GAGAAAACTT ATTGCTGGAC CCATGGGTCC AGCAATAACC GTCGACACAC TAGTGCCACA TGCAAAAATA AGGCCCCCGG ACACAGCGAC GACGCGACGG CCACCAACAC CCTTGGCGGC TCCACCAAAG TTTGGACTGC CCCCAAGCCT CCCGAATAG
|
Protein sequence | MSPSANFTIS DFPHKVLNPI ATNTIAPSYA SLLLAQCQLS ANASAIPSLN GGGAHGHMAL TLTAAAYAEL SDVPFVIPIA PPADPEPGTM QPQITENNQL HKRAVAIHSL YVAHPQFAYS HVTCLDLLDH LWRNFGTITA SDLKSNIQSM YTPWNPVDPI ETIFHRLNDA ITFSTAGRDP ISEPAAVRAG YDVFEHSGLF PRACETWRTA SPDTHTLANL RTLFKVANTD HKRTLTTGSL GYANVLAATP SVLPSLAPDS LSLPFSALLV SNSSATLSEK TYCWTHGSSN NRRHTSATCK NKAPGHSDDA TATNTLGGST KVWTAPKPPE
|
| |