Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45207 |
Symbol | |
ID | 7200236 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 482729 |
End bp | 484201 |
Gene Length | 1473 bp |
Protein Length | 448 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179223 |
Protein GI | 219116857 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.1729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCCAAGAA TGAGTCTCTG CGTTTGTTTC CAGATTAAAT AGATTTGCAC CTTCAAGCAT GAGCATTTGG AAGGGAGCTG CAGGTTTGAC GCTTTCATGC TTGCTTATTT CGACCTATGG GTTTACAATA CCATTGCCAA AGGCGGGCCT GCATTGGACC ATGCTTCGTA CCGATCTGAA TGTTCCACGT GGGAAGACCA TAAGCGAACG AATGCATGGA CGCAAATCAT CCATCGATGG ATTTCATAGT TTATTTAGTT CGAAGAAGCT CGAGGATTGG GAATCCGTCG ATCAAAGACA TGAGACGTCA CAATCTGAAG ATTCTCCAAA AACAGAAGGC CAAAATTCAG CTTTCGATGA ATACAATACG TGGTGCCGGT GTATCGATGA GACAGTGGCG TCATTGGTCA AAAAGCAGAA CGCTCTCGAG AAAGAGCTGG AAAAAGCAAA CGACGTTGAG CAGACAGTTA TTCGAGCTCA GCTTCTCACA GCCAACTTGT TCAAGTTTAA AGGTGGAGTC ACGTGCGCTG CAGTTCAAGA CTGGCATAAC AATGGCACTA AATTGGAATT GATTCTAAAC CCTCGGTACA AAACTGCGGC CCTTGAAGCT GATGCTCTCT TCAAGCGGGC ACGCAAATTG AAACGAGGAT CAGCTGTGAT TTCCGACTTA CTCGACGAAA CACACAACGC CTTGACTTTT TTATCAGAGA TCAGAACGGA TTTCGGCACC ATAGAACACG GTGATAGCTC AAAGATTGAC GTTGACATGC TTAAATTGTT TCAGGATCGA TTGCTTCAGT CCTCGAGAAG AACTGGCTTC AAAGCCCCCA TGGAAGACTC GTCTACCGTC TCCACAGCAG GCACCATGAA CCGTGCGAGG AAAGCTGCGG TTGGAACGCC TGCTTCGAAT CTGCGAAAGC TGACTTCACC AGCAGGCTGC ACAATTGTTG TTGGGAGAAA TCGAAGAGGG AATGAATATT TGACTTTCAG CATGGCTCGA GGCAATGATA TTTGGTTGCA GTAAGTTCCG TTTTGATGCT ATTATTTTTC ATTCATTGAC TAATTGAGGT GATTGTCTCT CGATTTAGCT CCCGAGGTTG TCCCGGTGCC CATGTACTCA TTCAACAGCG TCGCGGGAGT GCACAAGTGA CAGAAGAATG CCTCCAGCTA GCTGCAGACT TGGCTGTATT TTATAGTGAT GCGCGTAGGG AGCAGCGAGC GGATGTAATG GCCGCGGAAC CCAAGCACAT ACTAAAGCCC CGCGGGTCTC CATTAGGTAC AGTCAAGGTT CGGGAAGAAT GGAAGGTGCT AGCTGGATTT CCCGACAATG TCCCACGCGA GCTCAAAGAA GCCCGCGAAG AGTCAGGCCA GCTGGAAGAA TACCGTGCGG CCAACAAGGC AAAGCATCGC AAGCGGAACA AGGAAGCTAC GAAACAGCAG AAGGCGAAAG AACGGTCAAA AACAACCAAC TGA
|
Protein sequence | MSIWKGAAGL TLSCLLISTY GFTIPLPKAG LHWTMLRTDL NVPRGKTISE RMHGRKSSID GFHSLFSSKK LEDWESVDQR HETSQSEDSP KTEGQNSAFD EYNTWCRCID ETVASLVKKQ NALEKELEKA NDVEQTVIRA QLLTANLFKF KGGVTCAAVQ DWHNNGTKLE LILNPRYKTA ALEADALFKR ARKLKRGSAV ISDLLDETHN ALTFLSEIRT DFGTIEHGDS SKIDVDMLKL FQDRLLQSSR RTGFKAPMED SSTVSTAGTM NRARKAAVGT PASNLRKLTS PAGCTIVVGR NRRGNEYLTF SMARGNDIWL HSRGCPGAHV LIQQRRGSAQ VTEECLQLAA DLAVFYSDAR REQRADVMAA EPKHILKPRG SPLGTVKVRE EWKVLAGFPD NVPRELKEAR EESGQLEEYR AANKAKHRKR NKEATKQQKA KERSKTTN
|
| |