Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43176 |
Symbol | |
ID | 7196769 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2234459 |
End bp | 2236609 |
Gene Length | 2151 bp |
Protein Length | 464 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176938 |
Protein GI | 219110373 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTGCGTGTA GGTGAGCATC GGACCATGGT GGGATATTGT GACATTTTCG GGCAACCTGA AAGCGTGATT TAGTGTTTGC GTGTCGATAA TGCTTCTAAG CGTCGCGGAG CGTCTTTGGT CCCGTCCAAA ATGGTCTTGG ATACGTCGAG GGTTGCCTCT GGTTGCGACG TACGTTTTAC GTGAACGCAG TACTACGCGC CGCTAGTCTG CGCCCGAAAA ATGAACCCGT TGCTTGCTGC GGACCCAGCC GCTTCGATCG TCGGTATCCT TTGGTTGAAT CGTTTCGACT TCAATCCTAT GACTTGAATA TGTCAAGTCA TCGCACCGTT GATAAATATG CACTTGATCC CGTAGAGAGT CGAGGACGGA AACACATGAT TGCCTGATTC TGACTGACCA TTATTGTGAA TTCTGTTTTT GCCATTTTGT AGAAGTGGTA CATTGATTGA TCGACGGGGG TTCTTGACGC GTGGGTACTA GAACGCTCAC ATGGAGGGGG CTAGTACAAC ATCCTCGGCG ACGTCTTCTG TCGACGGACG AAGCGCCGGG GGCGGTACCT CCCACAACAC TACCGCATCC GCGTCGGATT ATTTGAACAT CGGAATGTTT GCCTCGGGAA ACACGCCCGG GGGCAGCGAC TTACTCGGAA ACAGCAACGC GGTCTTCTTG GGGAATGGCT TTGGAAATAT TGATTCTACC AATACAGCAG CAGTATCAGC ACTAGTGAAT TCGACAGGGA CAAGCTTTGA TTTGACAGAT TTTCCTAGTT TAGCGGGAGG AATAGGTGGG GCAAATGCGA GCGTAGCCGG GAATGGCCTA GCTGCCGCGT TACGACAGCA GCAACAACAA CAACAACAAC AGCTATTGGC GCACCAACAA ATGCTGCAAA GCAAGGGAGG TGTAAGCAAC GCTTCAAACT TATACCGACT GGCTATGTCG GGAGCCAACG GAAATTTTAA CATGGCAACC GAAGACTTCC CGGCACTAGG GGCCAACGCA CCACAGCCAG CACCGAGTGG ATCCTCAGCG CTGAATCCGT CATCGCTGTT GTCGGGAAGC ATGCCTGTAT CGAGAGGCGG CAATGCCAAC GGAAACGTTG GTGGCTTGTA CGCCGATATC GATACCAACA AGAACAATAC TGCTAGCAGT GGCTCCCAGT TAGATGTTTC AGGTGGTCTG TTAGGTGGTA CAGGTCTTGG TGGACTAGGA GGTATTCGAG GTCTTCAGCA AGCCGGTATG ACCGGGGCTG GAAATGCAAT GGGACGAGCG CCGTCATCGA CGGTACCGGG TGCGGGAGCA ATAGGTTCTT CGAGCTCCGG AGGGGCAGCG GCTGGTGGTG CACTGACTGG TGATTATGGT TTGCTGGGAT TACTAGGAGT CATCCGAATG ACCGATGATG ATAGAAATAC TCTGGCACTG GGGTCAGATC TGACAATGTT GGGTCTGAAC TTGGGATCGA CCGAACAGAT TTACAGTACA TTTTCTAGTC CATGGTCGGA CAATGTCGCA ACAAAAGAAC CGCATTATCA GGTACGTTAG TGAAGTCGCA CTTTGATTTC TACCGGAAAC TATGGCTGGC CCTAAACCGA CGATTTACGC ACCTTTACTA GCTTCCTGTG TGTTACTATA TGCAACCACC AGCACTGAAA ACAGGCCACC TGTCAAAGTT CCAACTCGAA ACCTTGTTTT ATATCTTTTA TGCTTTGCCA AAAGATGTTT TACAAGCGTA CGCAGCACAG GAACTATATT CACGGGAGTG GAGGTATCAC GGAGAGCTTA AGTTGTGGTT CAAGCGAGCA AGTCCTTCGG ACGGCGTGTC TAGCAGTTCA AGTGGATCAC CGCAGTACCT CTACTTCGAC ATTAACTCAT GGGAGCGACG CCTTTTTAAT GGCAGCATGA ACCAGAACAT TACTAGCGGC TTCATTACGG AAGACGAGGT ACAAGTCAAG TTCCCAAGCT CATGAGTTCA TTGTCTCAGT TAAGTTGGTT TCATGGCTAA ATTAGCTAAG GCGTCGTAGT CAGGCAAGCT ATGGAACTAA TGATATACTA GTTGCAAATG AAAAAACGCA TTCTGCTCTT TGCTGACTTA CTGCCAATAA ATTGTTAGTC GATAGGTATA ATTCCAAGTA ATTTAACAAC CAATAAGAGC T
|
Protein sequence | MEGASTTSSA TSSVDGRSAG GGTSHNTTAS ASDYLNIGMF ASGNTPGGSD LLGNSNAVFL GNGFGNIDST NTAAVSALVN STGTSFDLTD FPSLAGGIGG ANASVAGNGL AAALRQQQQQ QQQQLLAHQQ MLQSKGGVSN ASNLYRLAMS GANGNFNMAT EDFPALGANA PQPAPSGSSA LNPSSLLSGS MPVSRGGNAN GNVGGLYADI DTNKNNTASS GSQLDVSGGL LGGTGLGGLG GIRGLQQAGM TGAGNAMGRA PSSTVPGAGA IGSSSSGGAA AGGALTGDYG LLGLLGVIRM TDDDRNTLAL GSDLTMLGLN LGSTEQIYST FSSPWSDNVA TKEPHYQLPV CYYMQPPALK TGHLSKFQLE TLFYIFYALP KDVLQAYAAQ ELYSREWRYH GELKLWFKRA SPSDGVSSSS SGSPQYLYFD INSWERRLFN GSMNQNITSG FITEDEVQVK FPSS
|
| |