Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35903 |
Symbol | |
ID | 7201522 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 45031 |
End bp | 46136 |
Gene Length | 1106 bp |
Protein Length | 320 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180366 |
Protein GI | 219119202 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.202335 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCGC ATCGTAGGAG AAGTCGCACT TGGTCGCAAA ACCGCGGAAA TGGTAGTATG CCCGATGCGT TCGGGTCCAT ACAGGAAGAA GGAATCAGCG CCACTCCATA CTCTGCGCAA ATAGGAACAG TACGGCAAGG GCCACTTTGT ATCGGTCTTG CGTTGCTTTT CGTTATCCCG ATGTGTACGT AGTTCATTGA TTGCTTCTTT ACCTTGCAGG TAACACTTCT CGTCTTATCT TGGTTTCCTG CTTTTCCTAT ACTTTAATGC CATATGCATC TGCTACAGTA ATCCTTAGCT TCACTGATAC TACCATCAAT AAATCAGTAC AAGAGGCTCA TAACGACGTC AGAGCTGGAG GGGCATGGGT ACCAGTACGA GACTCCTCGG AAAATGTCGT GCAGGGTTTT GATCCCAAGC AACAGCAGCA ATCACCCCAC TCAATACCAG ACCCCGTAAA AGAGATGGAA GAGGCCAACT TAGAACGATG CTTACTTCAA AGTATTAAAT CGGATACGGA CGGAACGGTC TCGAAAGAGC GGCTTATAGT CGAGGCACCG AACCGAGTGC TCGTAGCCGT AACGGAAAAG GAAACATCGG ATTGGATATT GCTACAACAC GCAGGGATCT TGATTCCTCC CAGGACCAAG CAACCATCAC TAACACCTGT TGAAGGACAA ATCCTGCACT CGTCCACACA TCCTTTTGAC TTGGCGAAGA AGATGAGCCG CGAACAGCTC GGGCTCGTCT CGCCACATAC ATCGCAACTT TACGCCTCTC GAGAGCAAAT ACAGTTCGAT GTGGAAGGAA TCTTAGACGG TACTCTACCT TCCAAGTTAC AAGACGATAC GCATTGGAAA TATCTGGGTC GGTATCGGAA CGATGCAGAT TACGGTGGCG GCTTTACCTT TGTCTATTTC TTGAAAAATG CTGTCACAGT CAAGAGTAAT TCCGTAAATA CTAGCCGGCT ACTCCCGGAT CCTTGGAGCC GAATCCCAAA TCAGATCTCC CTGTCGACAG CACAAGTCCG TAACGCGCTC ACAGCGGGAG AGTTTGGGAC GCTAGCAAGT GCGCTTGCGA TCAGTCTCGC CTTGCAGCAC GTCTAG
|
Protein sequence | MSAHRRRSRT WSQNRGNGSM PDAFGSIQEE GISATPYSAQ IGTVRQGPLC IGLALLFVIP MLQEAHNDVR AGGAWVPVRD SSENVVQGFD PKQQQQSPHS IPDPVKEMEE ANLERCLLQS IKSDTDGTVS KERLIVEAPN RVLVAVTEKE TSDWILLQHA GILIPPRTKQ PSLTPVEGQI LHSSTHPFDL AKKMSREQLG LVSPHTSQLY ASREQIQFDV EGILDGTLPS KLQDDTHWKY LGRYRNDADY GGGFTFVYFL KNAVTVKSNS VNTSRLLPDP WSRIPNQISL STAQVRNALT AGEFGTLASA LAISLALQHV
|
| |