Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43530 |
Symbol | |
ID | 7197210 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 726897 |
End bp | 728126 |
Gene Length | 1230 bp |
Protein Length | 356 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177992 |
Protein GI | 219112481 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.760626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATCA TCAACGCCTT TTTCAGCCTT ACCCACGTGA CATCGCTTCG CTTTCGATCC TTGTTCCGCG ATTCCGAAGG ACAATACGAC GAGATTCCGT CCAACGACGA GGAGCATCCC GGTCTACATG CGTCTTCCTC ACGTGACCAC AGCCTGACTC GGGAGTATGA AGATTTTTCG CATTCCTGGA GTGCCTTGCG CTTGTGCCTG TACCACGCCA TTGTCTACTA TGCGATTGCG GTCTTCGCTC TTTCTTTCAT TCTCTACAAA TGGCCAATCA TCGATTCTTT GTACTTTGCG ACAGTGGTGT TCACTACCAT CGGATACGGA GACCTGCATC CCACCGATCG ATCCGGACGA GTGTTCACTA TATTTTTGTC GCTTTACGGG ATCGTAATTT TGGGCCTCTT CTTGGGAATT TTGGGTGATG CCGTAGTCGA AGGTCACAAC CGTGTGGTAG AAACACGACG GCGCAAGCTG AACAAAAAGG TTTTGGACGC GTTGGCACAA GATCAAGGGG CGAAGAAAAA TGTAGCAGAG TCCAACGGAG ACAATGGCTC CAGCAGTAGT GATGACGTGG TAGAAGTGAA GAGCTTGATG CAAGATATAT GGTCGATTGT GGTCCTGGAA GCTCCGATTG TTTCGTTAGT TGTGCTTTTA GCCTTCCTCG TTGGTTACGT CGAGAAATGG CCACTCATTG ATAGGTTGGT ACCAAGTTGG TGGCCCAAGC AGAGCATCTT CTGTTGAAAA AGTACGCTAG CTCACCCTTT GTCTTGCCAT ACATATTTCC ATTCCACGGA CAGCCTTTAT TGGGTCGTCA TTTCCGGTAC GACGGTCGGC TTTGGCGACG TGACCCCGCA CACCCCCGCC ATGCGGGTGG CGGCGATTTT CTTTCTGCCG TTCGCCGTGG CCGTGTTGGG TGAGTTGTTG GCCCGGGTCG CTTCGGCCTA CATGGAACGG AAACAGCGGC AAACGGAACA CGAATTCTTG TCCCGATCGT TAACATTGTG CGATCTCGAA ACGATGGACG CCGACCAGGA CGGACGCGTG GATCGCGCCG AATTTATGAT TTACATGCTG GTGGCTCTCC AAAAGGTGGA GAAAGCAGAC GTTGACCAGG TTTGCCAGTT TTTTGAACGA CTCGATCAGA CCAACGACGG GTATCTTACC AAACAAGATT TGCTGGATCG CCAGTGGAGC GAGAATTTCC GTTCGTCTTT GGCGGGGTAA
|
Protein sequence | MHIINAFFSL THVTSLRFRS LFRDSEGQYD EIPSNDEEHP GLHASSSRDH SLTREYEDFS HSWSALRLCL YHAIVYYAIA VFALSFILYK WPIIDSLYFA TVVFTTIGYG DLHPTDRSGR VFTIFLSLYG IVILGLFLGI LGDAVVEGHN RVVETRRRKL NKKVLDALAQ DQGAKKNVAE SNGDNGSSSS DDVVEVKSLM QDIWSIVVLE APIVSLYWVV ISGTTVGFGD VTPHTPAMRV AAIFFLPFAV AVLGELLARV ASAYMERKQR QTEHEFLSRS LTLCDLETMD ADQDGRVDRA EFMIYMLVAL QKVEKADVDQ VCQFFERLDQ TNDGYLTKQD LLDRQWSENF RSSLAG
|
| |