Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37845 |
Symbol | |
ID | 7202650 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 205786 |
End bp | 208649 |
Gene Length | 2864 bp |
Protein Length | 845 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181864 |
Protein GI | 219123090 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCTT TGATCATTAA TCTCGTCGGT TCGTCGGATG CCGAAGCAGC TTTGTTTGCT GAAGCCGGTA TCACCGATCG GACTACCATC GCGTCGTTGG ACCTTCCCGC GTTCACGCAA GTGTTGGGAG ACCAAGTTGG GACGCTGGCA AAGCGATACA AACTGAAGCG TGTGGCTGAG TACCTCAGCA GCGGTAAGGA AATCGGTCCC GATACGACGA TGGGAGATAT CAACCTCGTG TTGTCATCTA ACCATCAGAA TGTTGGAGCG AATGCTCCGA ACCCGGCTGC TGGACTCGAC CCTCGCCACG GAACGATGAA AGCGAGTATT AACATGATCT CAAAGTTTTC AGGTGATCTG GAAGATTTCG AAGACTGGTC GACGAAAACC GCAGCGGCTC TCGGAATCAC CGTTTATCGT AAACTTCTCA ACGGTCCTCC GGAAGTCGGA AATGATTTCG ATGCCGCCAG AAACAATGAG CTCTATCAAA TGTTTGTGAT TGCTCTTGTG GATGGTGCCG CAATGCATAT CATGGAAGGC GTGAAGGACC AGAACGGATA CGCCGAGTGG ATGGCTATCA AGGAATGGTA TGGTATGTCG GATACCGGTC GGACGATCAT TGACAAGTAT CGTAGCAAGC TGGATGCGCT ACTGCTTGAC AATGCAACCC CAGCCGGAAC CTTTGTCAAC CATTTCAAGA GATTCAGCCA GAAGCTCAAA GAAAACGGTG AGGGCTACAC GGCGGATACG AAACGACGTC AGTTTCTCGA CAAGATCATT GACAAAGACT ACGATGTGGT CAAACAGCAG TTGGAAGGTG ATTCAACTGC GGATTTCAAT GAATGTGTTG CGCGCATTCG TATGCGCGAG CAAGTCCTCA TGAAGGACTC CACGATGTCG GCCAAGAAAG CTAGACGCTT CAAGTCGAAC GAGGGCGGCA AGTCGAACGG TGGAGGTCCG TCAAGTGGGA AAATTCCTTC GATTCCAAAC TCTATACTTA ACCTGGTCAA GCCGGCAAGT GCTCGCAAGA ATCTAATCAA GTGGAGAGGA GTTTGAACTC CGAAGGGCGC ATTCTTCGAT CGGACGAACT GGCAAGCACT GAGTATGACG GGAAAGGTAA AACTCCGTCG AAAAGAGAGC ACGACAATGA TAGTTCCGAC GAATCCGTCA AGACCCGCAA CACCCAAGGC AAGGGGGGTT CCTTGTCCAA GAAGGGCAAG AGAGGCAAGG GGAGGCGAAT CACTGGTGTT GTCCGTCGAA CGGAAACGAA GACATCCGGC ACGCCTGACT CCTCTATCCG GATCAGTTTG AAGGATCCGG ACGACCACGT CGAAGTTGAT GAGTATGACG ATGTCGAAAT TGAGTCCGGT CAAGACGAAG ATTCTACTGC ACCTGTGAAG AAAGAACGGG CGAAAAAGCA GAAGAATCCG TCGAAGCGGA AGCACAAGCG TGCCAAGTCT CGTCGAAGCC CTATCTCCCG CCACGGACGC GTAGGCAACG AGAAACCAAG AGCTATCCTA GACCCAGGGA CTGAGTGCGA TATTGTTGGC GGGGACGTAT GGACAGTTCT GGAAAAGGTG ATTGGTGTAG AAGCCCAGCT AGGCGGAGCT TTAGCAGGGA TGGGCAGATG CAGTCTGCCA CTGGTTAACG CGGTGGCTGT GTACAATCAT TCTAAAGGAG GAACAATCCT AATTGGTGCC GGTAACGTGG GATACGATGA ACGAAGCACC CAGACGGAGT CGTTGTTTAA CACACACGAG TTACGAAAAC ACGGTGTCAT TGTTTCGGAC ACGACCCTCT GAGATGGAGG GCTTCAAAGC ATTGAAGTCG ACGGAATTTC CATTGCATTG GACTTTGTCG ACGAGAAAAC GCTTTCGTTT TACCTATGCA AACCGACCGA AAAGGAGCTA GAGAACTTGG AAATTCACTG GTTAAGTCCT CGAAGGCTAG TTAGATCTAG CATCCATCCT ATTCGATGCA CGCCGGTTGC TATAGTTCCT GAGCGGGCTC CGTGGGCCGA ACGGCTTGGA AACTGCCCGG AGTTTACTTT ATCGAAGACT CTCCTGGCGA TGACGCAACT ATGTGCTGCC CCGGTCGAAA TGGATAAACG AGAAGCTCCG CGTCAGCATC GCAAGTCTCG TATTCATGCT TTGCATCCTC GTCAAATCGA GGGTCGCACT GATTCGGATA CATTCTTCTC GTCCGTCGAG TCTATTGAAG GGTTTCTGTG TGTGCAGATT TTCTCTTGCC ATGAATCTAA CTATACTTAT CTTAGAGGTA TGAGAAAGGA GTCGCAGTCG CACGGAGCGT ATCAAGATTT TTTACGGAAT GTGGGAGCAC CTAATGTTCT ATTAACCGAC AATGCTAGAA CGCAAATCGG TAAAAAGTGG ACTAAGACTA GTCGGGAAAA TGTAACTCGA CAGATCAAGT CCGTTCCGAA CAATCAAAAC CAAAACCAAG CCGAACGCAA AATTCAAGAT GTGAAAAAAC GAACTATTCT CACTTTGCGA TATGGAAAAA CACCGCTCAC ATTTTGGTGT TTTTGCCAAC AATTTATTGT CGACTGTTTG AATCATTCGG CTCACAAGGA TTTAAATTTT CGCACCCCAA TGGAAAAAAT GTACGGTCAC ACACCTGATA TTTCCATGTT TCGATTCCGA TTCTGGGAAC CCGTTTGGTA CTATGAACCG ATGGCCAAGT ACCCAGCTCC TAATTTTCTC CCTGGTTGTT TCGTTGGAAT TGCCTGGGAC CATGGCGACG ATTCAGCACT TCAGTCTTTC GACCCGCTGA ACAACGACGC GGAGGTCGAG ATGACAAGCG AGATCAACGA CTACCTTGAC ACAAATGAGT CCGCGGCTTC ATGA
|
Protein sequence | MEPLIINLVG SSDAEAALFA EAGITDRTTI ASLDLPAFTQ VLGDQVGTLA KRYKLKRVAE YLSSGKEIGP DTTMGDINLV LSSNHQNVGA NAPNPAAGLD PRHGTMKASI NMISKFSGDL EDFEDWSTKT AAALGITVYR KLLNGPPEVG NDFDAARNNE LYQMFVIALV DGAAMHIMEG VKDQNGYAEW MAIKEWYGMS DTGRTIIDKY RSKLDALLLD NATPAGTFVN HFKRFSQKLK ENGEGYTADT KRRQFLDKII DKDYDVVKQQ LEGDSTADFN ECVARIRMRE QVLMKDSTMS AKKARRFKSN EGGKSNGGAG KCSQESNQVE RSLNSEGRIL RSDELASTEY DGKGKTPSKR EHDNDSSDES VKTRNTQGKG GSLSKKGKRG KGRRITGVVR RTETKTSGTP DSSIRISLKD PDDHVEVDEY DDVEIESGQD EDSTAPVKKE RAKKQKNPSK RKHKRAKSRR SPISRHGRVG NEKPRAILDP GTECDIVGGD VWTVLEKVIG VEAQLGGALA GMGRCSLPLV NAVAVYNHSK GGTILIGAGN VGYDERSTQT ESLFNTHELR KHVPERAPWA ERLGNCPEFT LSKTLLAMTQ LCAAPVEMDK REAPRQHRKS RIHALHPRQI EGRTDSDTFF SSVESIEGFL GMRKESQSHG AYQDFLRNVG APNVLLTDNA RTQIGKKWTK TSRENVTRQI KSVPNNQNQN QAERKIQDVK KRTILTLRYG KTPLTFWCFC QQFIVDCLNH SAHKDLNFRT PMEKMYGHTP DISMFRFRFW EPVWYYEPMA KYPAPNFLPG CFVGIAWDHG DDSALQSFDP LNNDAEVEMT SEINDYLDTN ESAAS
|
| |