Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35856 |
Symbol | |
ID | 7201059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 911112 |
End bp | 912242 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180344 |
Protein GI | 219119155 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00746929 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGTAC GCTCGTTTGG ATCGCTGGTG CGGTTGGTAG GTATAAGCTT GTCGGCGTTG GTCTTGGCGA CTCTCGGAAA GTTTCAGAGC GAGTACGCGC TTCTAAATGT TGATGAGATG CTCGCCAATG TGTTGCAGCG ACAGTCGTCG TCGTCGGAAA CGTGGAGCAA TGTAGATACA GAGGCCGAAG TCAATCTGAC AGTAGGCGCT GCCCTACCTC CTTCGACAGA AAATGTCGTA GAGAAAGCAG CGCCAGCAAA CAAGTCCATC CAAGTCTCGC CATCGTCCGT ACCAGACCAC GCATTTCCAT CTACGGTCCC GGCCAGTATC AACAATGACG ACTCCATGCA GAAGCGGAAG AAAGGCATGT ACTCCAATGC TCGAACGGAC AGATCAGGGT CGGTAATCCA AGACATGTTG GCTGCGCATT CGTATGCTTT CCACCATAAT ATGACGTATC CCGGCGCTTG TTGGACTGAT TCCAAGGCTC CCAACCAAGC TCGCATCAAG ACAAATAAGC AGCTGTTTGC TGCTATTGGC CTCGAAGACG AACTCACCTA CGCCTGCCCC ACGAACATCA GTGACATAGT CAAACGAGGT CGTTACGCAA ACGTTGATCG TCGGTGGTCT CGAGAATGGC TAGCATTTAT TCGGTCCAGG GTAAAGTATC CCGAGAAAAA TGTTTCCGCT GGTCACCAGA CGGCAGTGCA TATTCGCCGT GGGGATGTCA TACCGTGCCC CAAAAATGGA TTGCTGAAAC GATACCGATA CTTGCCGAAC TCGTATTACC ATGCTGTGAT TGATACGTAT GTTCCCTCCA ATAGCACCGT CACAATTTAC TCGGAAGAGG AGTCTTACGA GCCCTGGGAC AATTTCAGGC AATACAACTT GCGCCTGAGT GCAAGTTTGG TGGACACGTG GCGAGATATG ATGATGGCGG GCACTCTCAT CCTTTCCAGG AGTACTTTTT CCCTTGTCCC TGCTCTTTTG AATAGGCACG GGACTGTATG GTATGCCCCG TTTGGGACGC CCAAGGTTGA TGGCTGGGAA GTAGTCCCAG ATAATATCAC AGCAATGGCT GACCGTGACT CTGCCAAACT GAGAAAAAAG GACGCCTGTG TCGCAAAATG A
|
Protein sequence | MGVRSFGSLV RLVGISLSAL VLATLGKFQS EYALLNVDEM LANVLQRQSS SSETWSNVDT EAEVNLTVGA ALPPSTENVV EKAAPANKSI QVSPSSVPDH AFPSTVPASI NNDDSMQKRK KGMYSNARTD RSGSVIQDML AAHSYAFHHN MTYPGACWTD SKAPNQARIK TNKQLFAAIG LEDELTYACP TNISDIVKRG RYANVDRRWS REWLAFIRSR VKYPEKNVSA GHQTAVHIRR GDVIPCPKNG LLKRYRYLPN SYYHAVIDTY VPSNSTVTIY SEEESYEPWD NFRQYNLRLS ASLVDTWRDM MMAGTLILSR STFSLVPALL NRHGTVWYAP FGTPKVDGWE VVPDNITAMA DRDSAKLRKK DACVAK
|
| |