Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49693 |
Symbol | |
ID | 7198322 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 448066 |
End bp | 450075 |
Gene Length | 2010 bp |
Protein Length | 525 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184464 |
Protein GI | 219128530 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAACT TCTCCTACTT GGCCTCTGCC TTTATGCTTG GCAGTCCCTT CGCTTCTGGG GATCACACTT GGGTACCCAT TCGTAGTCTT AAAGGTAGCA AAGAGAAAAA AGCAGGTGGA GCTCGCGGGA ATGAAGGTAC GCCTACGAAA GCCAAAAGCG GTCGAGACGT CAACGTTGTC AATCCAGCCG AAAGCTATCA GCAACATTAT CTGGTACGTG TCTCCGGTCT CGGCAAAGAA GATTTCAAAT GCCCAACCAC AGCCTTTCTT TGCTGAGTGT TCAAGCCAAA GATTATATAT CTCTTACACA AGGTTCTGTT CCCTGTTCGT TCATCCTTCC AGGTCGGACT TTTAAATTAT CCCAATTCTA ATATTCCCGA TATTCTCAAG GATGTTACGA GTGGAACCAA GGACATCCCT GGCGGTCCAT ACAACTACGT TCCAAGGTAT GTAACTCTTC TTCCATTATT TTGGGCTCGC ACCGTGTGAT CTCAAATTTC GCTGTTCCTT TGCTTTGATT GTGCAGTGTT GGCTTTTCCG ACATTGAGTT CTATTACGAC AAAGAAGGCA GCGCTGTTCG GGGCGAATAC TTGTGTCTGT CCGACAATGG TTTCGGATCG TCGGACAATT CGGAAGATTA CGCACTCAAT ATTGTGCATA TGAAGATCCA AAAGCCATTT ACTTACCGAC ACGGAGAATC AACGTTCGAA ACGTACACCG AGACAGAGAA TCTCGAGACG CTGCTGATTC ACGATCCTCA CGCTTTCATC AAGTGGGAAA ACGGCGCCGA TATCCAGGTC ACCTACGCAG TTCCCGACTC TACTTGGGAC GAATACAAGA GACTTCGTGT ACTGACTGGT CGCGATTTCG ACGTGGAGGG TATGGCGGTG ATTAACCACT CCTGCGCTAT TGTTGGTGAC GAACTCATGC CGGCCATTTT TGCCATAGAC CCGAGCACCG GTATTGTCCA ATCCAACTTT GTACGTACTC CGGATATCCA CCGTCACGGT CGTTTCAACG GAAAGTTCCT TTCAACGCGT GGCGACAAAG TTCACTGCAG CCGAGAGGAT TTGGAGGCCA ACACGTGCTC CTCGGTAGCT TCGGAAGTGG TGGAGGCGTC GGAATATCGC AAACACGATC CTTCCGGTGG ATTTGAAGGC TTTTCGGTCT TGGCCGACGG CTCCATTGCG GCATTTTTAG AAAAGACTAC AGGAGACTCG ACTCTGGGGG ATGAACCCGG CGTCCGCGTT TACCGCGTAG ACCCTGGAGA TTGTACCAAG GATCACCCGC CCCGGTTTGA TGAATTTCTG GGATACTATC CGTTTGAACT CAACGGAGAA AATATTGCCG ACGTTTCGGC CATTCCCGGA TCGAGCTCAA AGGTCGTCGT GATTGAGCGG AACGGCTTTC CGTCGGGATA CTTCTTTCCT TCTCCGGTCA TGCCGGCCAA CAAGGTGTGC GTGGTGGACC TGTTGGACAC TGACAACGAT ATGGTGATGC GCAACAAGAA ATGTGTGCTC AACTATCACG ACATTAGTGA CCCGTGGGAT GTGGACGGCA ACGGCATCTT CAAGTACGCT CAGACGCAGG TGACCAATGA GCAACTCATT GTGGTGGACG ACTACTGTGT TGTAGCGGGC ACGGACACCA ATTTTCCTTT TACCGACCAG TTTGGAGTGG AAGGCGTCGT GGAGGTCCCG TTCCAGCAAG AAGTTACAGA CACTCGATTT ATGGTGGTGT GCTTTCTCGA GCCAATTTTC CACGCGGACT ACCCAATCTT GGAGTAGGTA GAAAGGTAGG CAGTGAGGGC GACGGGCTAC TATTGATTGG CAAAAGTGGA AAGTCTTTTT TGTACAGTGC TGGCAGGTAA CGTCTACTGT AGTGATTCAG GTGAATAATA CTTCAATTTG TGTGAGCATT TTTCGGTTCG TGTACCATGG CGTGAAAGTA AAAGTCGTCG GGTTGTTCCG TCACGTTTTC CGACTGGCGC TGGCCTTGAT GATTGGGTCG
|
Protein sequence | MTNFSYLASA FMLGSPFASG DHTWVPIRSL KGSKEKKAGG ARGNEGTPTK AKSGRDVNVV NPAESYQQHY LVGLLNYPNS NIPDILKDVT SGTKDIPGGP YNYVPSVGFS DIEFYYDKEG SAVRGEYLCL SDNGFGSSDN SEDYALNIVH MKIQKPFTYR HGESTFETYT ETENLETLLI HDPHAFIKWE NGADIQVTYA VPDSTWDEYK RLRVLTGRDF DVEGMAVINH SCAIVGDELM PAIFAIDPST GIVQSNFVRT PDIHRHGRFN GKFLSTRGDK VHCSREDLEA NTCSSVASEV VEASEYRKHD PSGGFEGFSV LADGSIAAFL EKTTGDSTLG DEPGVRVYRV DPGDCTKDHP PRFDEFLGYY PFELNGENIA DVSAIPGSSS KVVVIERNGF PSGYFFPSPV MPANKVCVVD LLDTDNDMVM RNKKCVLNYH DISDPWDVDG NGIFKYAQTQ VTNEQLIVVD DYCVVAGTDT NFPFTDQFGV EGVVEVPFQQ EVTDTRFMVV CFLEPIFHAD YPILE
|
| |