Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43144 |
Symbol | |
ID | 7196751 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2148174 |
End bp | 2149932 |
Gene Length | 1759 bp |
Protein Length | 409 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177457 |
Protein GI | 219111411 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.471661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGACACCGG TTGCCATGCA ATACGGCGGA GCAGTATCTT GTCGATGCGC TTCATTGAAT TCTACCAACA TAGATTGAGG TCGATAGGGG GCACGCCGTT TACATTTCTC GTTGCAAGTT TTACTTCGGT ATAAGTGATA CTGGAGTGCG GCAGCTTACT ATTAGATTGG GGAGAATTTG AACGCCGTCA AAAACGATGG TAGTCCCACA TATCCTTGGC CCGTCAGCTT TCCCTACTGT GCCAATGGCA CCCCCGGCGG CATGCTTGTC GGTATTGAGC TACAATATCC TGCTTCCAAA TAGTATGGAT GGCTGGTGGA ATTACAAAAT GTATTCGCCT CCTTTGCCGG AATCGAAGCA ACACGTTTCG TCTTGGAATT TTCGTAAGGA CTTGTTACGT GAACGAATCG CCACAGTCGG TAGGTTCTTG TAGTTTGCGT CGGAATTCCT CTAATGCTCG CAACGCTGAC TCTTTGTTGT GGTATAGATG CCGACATTGT ATGTTTACAG GAAGTATCGC CGGTTTCGTT CGATACCGAT TTTGATTTTA TGCGAGAGCT AGGCTACGAT GGAAAAGAAA TGTTCAAGAA GGGTCGATTT CGCCCCGCGA CGTTCTGGAA AACGTCGCGA TGTGAGATTG TGACCCCTCC GGTTCACAAG GATCGTACAT TACTAACGGC CTTTCGGGTA TTGCCGCCGC CAACGGTGTC AGATCCAGCA GAGACCCATG TGTGGTACAT TTTAAACTGC CACCTACAGG CTGGCAAGGA AGGGGGTCGA AGAGTTCGAC AAATCCACGA AGGAGCTCGC TCGGTTCTGA CCTTGGCAAG AAAATTAAAA CGTACGTTTG AGCCTTTTCC GTTGTGAACA TATGCTGCTC TGGTGTCTCT GATGCGATGT AATAAATGTT TTTTTTATTT CGCAGAACCT AATCCCGAAC AATGTACAGC ATTTATAGTT TGTGGGGACT TCAATGGAGG CCCCGAATGT GGCGCTGTAC GTTACTTGGA AGACGGGTTC GTGGATGAGT CTTTTATCGA AGATGGAGAT AGGGTCACCT CCAAACGCAA AGACTTTCCG TTCGAAAAAC CACTGACAGA TGTTATGGCT GCTTCTGACC GATCGCCGCC GCCAACGCTA GTTGTAAGCG AGCTTATATC TACCATGGTC CGCAGTAATG CGTACGAAAA TCCAGAGTTT TCGGAAGATA TGATGGAACG TCTTGTCCGT ATTTATGAAA GATTAGCGAC AAAGTCGCAA GAATCTGGTT GTAAGATGAT GGACGAGGAA GATGTCGAAC GCTGGTTGAT CACTACTAAC GGGCAAGTTG GCCGGGGTAG CGAATTTCGT AACGCAGCGA AAGAGATGGG ATGGACCGAG GGATGCAGCG CAAAACGCCA AGACGGCAAA CCGCACGTTG AACTTCCCAA AAGAGGGATT CTTTCACTGG AAGGTTTCGT AAACGTGTAT CAAGCAGAGC TTCGGCAAGG CAAATTCTGG GGCATTGCGC ACGACATGGC CGTTTTGGGG GAGCCCCTAC CTGATGCAGG CGTGTTTCAG TCAAGGTTCG ATCGTATGTA CTGCTCCAAA GCTCTACAGC CAACCGCCGT AATGGACTTT TTGTGCTTGG ACCCTTGCCC AAACGAGATT GAACCGTCAG ACCATCTCCC CGTAGCAGCT TCATTTACAC TTTTTAGCTA AGGTCGTAAG GCTACTGGAC ATAGCAGTAA ATTCGAAAAT ATAACACTGT TTACATTCC
|
Protein sequence | MVVPHILGPS AFPTVPMAPP AACLSVLSYN ILLPNSMDGW WNYKMYSPPL PESKQHVSSW NFRKDLLRER IATVDADIVC LQEVSPVSFD TDFDFMRELG YDGKEMFKKG RFRPATFWKT SRCEIVTPPV HKDRTLLTAF RVLPPPTVSD PAETHVWYIL NCHLQAGKEG GRRVRQIHEG ARSVLTLARK LKQPNPEQCT AFIVCGDFNG GPECGAVRYL EDGFVDESFI EDGDRVTSKR KDFPFEKPLT DVMAASDRSP PPTLVVSELI STMVRSNAYE NPEFSEDMME RLVRIYERLA TKSQESGCKM MDEEDVERWL ITTNGQVGRG SEFRNAAKEM GWTEGCSAKR QDGKPHVELP KRGILSLEGF VNVYQAELRQ GKFWGIAHDM AVLGEPLPDA GVFQSSQPP
|
| |