Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46031 |
Symbol | |
ID | 7201521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 39001 |
End bp | 40155 |
Gene Length | 1155 bp |
Protein Length | 359 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180364 |
Protein GI | 219119198 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00169986 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTAGCCCAAC ATATTACTCC CTTCGAAAAA CAAACCGTGT GTCTAAATTC AATCTATGCG ATCTCGGGGC TTCCGATGAA AAGCTTTGTG TCCAAGGCAG TTTTCTTGTT TGCTTCACTC CGACGACAGC GCGCGGGTGC CTGGACTTCG CCACTTTCGG TGCGCCGACG ATGCCCACGA GGCCACGGAT CCTATCCTAC GCAGTGTCTG TCACACGCCA GTCTACTGTC GGTAGCGGAA TGTCTAGAAT TGTACCATAA TTCCACACGT AGCGGGGAAC GTAGCATCCG CTTTGTCGAC GGTTCTTGGT ATCACAAGGG AAATCGAAAT GGCTTGTTCG AATTCCTGAA CGGACCCCGT CTACCCGACA GTGTTTACAT GGACATGGAC GACATTAGTT GCCAGACCGA CCTTTTTCCA ACTCTCAATC CCTCCGATCT GTACCTGATG CAGCCACCGC GAGCCTTGTT GAGTGCGTGG ATGGACTTTT ACAAAATTCG TCGTACAGAT CAAGTGATTG TGTACGGACG TTCCGGCAGT GTCTTTTTGC CTCGCACTTG GTTTACCTTG CACGCTGTGT TAAGTCACGT CAGGGTAAGT ATAATGCAAG GCAGTTTAGA AGACTGGATG CGCGCGGGTG GTCCACTAGA TGAAGGAGTA TTGGAACAGT CCAGCAGCGT AGTTAGGGCG GCTGATCTCG ACTGGGAACA ACCGACCAGG TACGACAGCA ACAGCCAGTC ACCGAGCGAA CAGGCCGTGG TCAGCATCGT GGACGCGAAC TACATGCTTT ATGTAATAGG GGACAACAAG TGCAGTACGA AGATCTTGGA CGCGCGGGGT TCCAGTTTTG CAGCGGGTCA CATGCCCGGT GCGGTCCACA TTCCGTACAG TAGTTTGTTG GTAGATCCGA CCAGCGGAAG TCAATACAAA CCGGCTGAAG AAATGCGGAA GATATTTCTT GCGCAAGGTG TGGATCCCAC AGCGAATACT CCTCTTGTGT GTTCGTGTGG TAGCGGCGTT TCGGCGTGTA GTCTCTATTT GGCGCTACAC GTTTGTGGAC GTTCTCCGGA GCAAAGCACC AAGGTGTACG ACGGCAGCTG GAATGAGTGG AAAACGCTGC CTTACACACC GAAAGAACAA GTGCCGAAAA AGTGA
|
Protein sequence | MKSFVSKAVF LFASLRRQRA GAWTSPLSVR RRCPRGHGSY PTQCLSHASL LSVAECLELY HNSTRSGERS IRFVDGSWYH KGNRNGLFEF LNGPRLPDSV YMDMDDISCQ TDLFPTLNPS DLYLMQPPRA LLSAWMDFYK IRRTDQVIVY GRSGSVFLPR TWFTLHAVLS HVRVSIMQGS LEDWMRAGGP LDEGVLEQSS SVVRAADLDW EQPTRYDSNS QSPSEQAVVS IVDANYMLYV IGDNKCSTKI LDARGSSFAA GHMPGAVHIP YSSLLVDPTS GSQYKPAEEM RKIFLAQGVD PTANTPLVCS CGSGVSACSL YLALHVCGRS PEQSTKVYDG SWNEWKTLPY TPKEQVPKK
|
| |