Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42917 |
Symbol | |
ID | 7196180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1514249 |
End bp | 1515988 |
Gene Length | 1740 bp |
Protein Length | 509 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177307 |
Protein GI | 219111111 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCACTGATCG GTCCAGCGCC CACGTTAGCT TATAAGATAA GCAGCCCTGG CATTCTACAC ACCCAAAATT TGGGAGCTGT TCTAAGATAA CGTGGCTTCC GTGCGACTGC TATCTACCTT TGCTTGTGTA ATGGTGCTTT GGGGAGGGCT CCGGCCAATT CTCTCTCGCG TTCTACTTGA CTCGGCCGCC CTGTTTGGCC TAAGCGTATC GGCAGGAACT TCGGAGTGTA CACGAAAAAA CACCGTCGTT CTGGGACTGA CATACATTGG AACTTTTGTT TCGGCCGGGC CCACAAATCG TTGGACTTTG ACACTACCAA TGGTACCATT TCTGTTTTTA CAGCTAACGA GACGATTATC TCTCTCCAAG AAAGGCGGAT GCTGGTTTCA CCGAATTGTC GATGGGTTGA GTCTATTACT GATCGTGTCA GCCGCTACTT TATCGATACT CTTTCCAGCT GTAGAGCTTC CTGCAATCAA AGGTCCCTAC AATGTAGGCG TGGTAGATTT TTTTATGCCA ATAGAATCGT CCAATATCGC AGGGAGTGTT TCAAGAGAGA CCTGTGCTTC ATCTTCTCAC GTTTCGGTGC GATTACTTTA TCCTACCAAC GAAAAACCCG TACGGATACC ATTTTTAAGA CCTGACATTG CAGCTGATTA CTGTCAACAA TTTATGAGCT TTGGAGCCCC TCCGCCTTTG AAAACGTTTG GTTGGCTGCT ACACACGTGG CGTCTAGCAC GAATGCAAGC CAAACCCCAC GCTTCGCTTT CAGATCATCC TGACGCCTTC CCTCTAGTTT TATTCTCTCA CGGTCTTGGA GGCACAGCAG AAATTTACAG TTACCAAACC ATGTCTCTTG TGGCGCACGG TCACATTGTG TTGGCTGTGA ACCACCAAGA TGGAAGTGCT CCAGTCATAA GGCAAAGAGA CGGGAACATT AAGCTGTACG ATCACGAACT TCCGAAACTC TGGGCTGGAG GCAACCATGT CGAGTATGTT CGTGAACGTC GTGCACGAAC TGATCTTCGT GTGGACGAGT TGGTGGCAGC TGCGGAAGGT ATGCATAGAT TGAATGAGAG TGACCTGGCC GAACTCCTAC TGTTCGGTCT CTCCTTCCGT GATCGCATCC AGATAGATCA AACATTTTTT ATGGGGCATT CCTTTGGAGG AGCAACGGCG CTGTCAGTGG CAAAGCGGCG ACCTGATCTT GTAAAGTCCG TGATTGCACA CGAGCCAGCC GTTGACTGGA TGCCTGATGA TGCTCGTCGC TCGTTGTTCG ATCTAAAACG ATTAGAAGGA CTTTCAACTA ACTTCACGGG AGGGACCGGA GGTTTTCTTG TCGAATCTTC AGACTCAGAA TCATCTATTC ACGATGTGGA CCTGCTTATA CTTTTCTCGG GCGAATGGCG ATCGAAGAAA TGGGGTTGGA ACCATGTTCT AGAAGAGATG CATCAGGAAG CTCGGTTGGG TCGGGAAGGA GGCTACTCTT CCTTTGCTTT TATAGATGAT GCACACCATA CAGAGTTTTC AGACACTTCC ATGATGTTGC CGTTATGGCT GGCACGTCTT ACCAATATAA CAGGGCCAAG GAGTCCCTTG TCGACGGCGA AAGAGATTCA TGAGCGAACC TTGGTGTTCA TGCAAAAAGT CAATCTATAG GGACACGTTG ACTGTAAGAA ACCATCAAAG CTCTAAGTTT GAAACTGTTT ACAATACAGA ATGACGGTGA ACTGTTTCCT
|
Protein sequence | MVLWGGLRPI LSRVLLDSAA LFGLSVSAGT SECTRKNTVV LGLTYIGTFV SAGPTNRWTL TLPMVPFLFL QLTRRLSLSK KGGCWFHRIV DGLSLLLIVS AATLSILFPA VELPAIKGPY NVGVVDFFMP IESSNIAGSV SRETCASSSH VSVRLLYPTN EKPVRIPFLR PDIAADYCQQ FMSFGAPPPL KTFGWLLHTW RLARMQAKPH ASLSDHPDAF PLVLFSHGLG GTAEIYSYQT MSLVAHGHIV LAVNHQDGSA PVIRQRDGNI KLYDHELPKL WAGGNHVEYV RERRARTDLR VDELVAAAEG MHRLNESDLA ELLLFGLSFR DRIQIDQTFF MGHSFGGATA LSVAKRRPDL VKSVIAHEPA VDWMPDDARR SLFDLKRLEG LSTNFTGGTG GFLVESSDSE SSIHDVDLLI LFSGEWRSKK WGWNHVLEEM HQEARLGREG GYSSFAFIDD AHHTEFSDTS MMLPLWLARL TNITGPRSPL STAKEIHERT LVFMQKVNL
|
| |