Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48235 |
Symbol | |
ID | 7203541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 598733 |
End bp | 599836 |
Gene Length | 1104 bp |
Protein Length | 326 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182567 |
Protein GI | 219124557 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.652945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACTGCGAT GGGCGGGCCA CTTTGCATGG TCAAAAGAAG GTAACAAGGG CTGGATTTGT CAAAATACTT GGTGAAGGAC CCGCGACCTA TGTCCACACG AGCATCAGCA CCTTTCAATT TTAATGTGGT TTCGTCATCA GACTGCATGC ATTTTTCGCA GTGTTAAACC AGATGCCGTA CTTGCTTTTG CGATACCCGC AGCATGTATG GCTGTTCTAG CCGAGGCTTT CTTTCGCACT GAATACGCCG ACGCTGACGC TACCGCTGCC TGCGAGTTTG ATGCCAGTAG CAATCTCATC AAGCCACACC ATGTTCGGGA ATTGGAAACG AAAGGCATCG TGGTGATTGA GAATGCAGTA ACGGCCACAA CTCTGAGAGG CGCACGCAGT AACATTCGTG ATTTTCAAAA AGAAGGTTCG ATTTGGACAA ATACACGCGT GGGCGGTTTT GCTCCTAGTG GAAACGATCC TGACGTCCGA CAGGATCTGA TTGCTTGGGT GCGTAGTAGC AACGTGTCCG ACGATGAAGC TTCCACGAGG TCGCACCACG TCGAAACTCA AGGTAGCTTA CTGATAGTAC AGGACAAGAA CAATCGCCAA GAAAACAGTC CCCTTGGGAA AGATCTCCTG TACAGTATTC AGATACTTCG TGGGATTCCA TTTGCTTTGG AACAATGTGG CTACTCGGCT TCCAAAAATC ACCGTGTGCC AAGACAATGC CAATTAGCTA TGTACCCGGG CAATGGTTCA GCTTCTTACG AACGCCATTT AGATCAGTGC GATGCTAGTG TTTACGATCT TGGGATTCTG GAATGGTTGC GGCTAAGTGA CTACCGTGAG AGAGCCATCA CGACAATTTT GTATCTCAAT GAGCCGAATC GTCCTGAAAG CCATGGCGGA GCTCTCCGAT GCTGGGTAGC TCGTGACATC GATACGAAAA GAGACAACAA AAACCGAAAT GAAAAAGATG ATTTTCGCCC TCCATTTGAT GTTAAGCCAA CGGGAGGGAC GATGGTCATT TTCCAAAGTG GAAAGGTCGA TCACAAAGTA CTGCCATCAA CAGAGGAGAG ATTTGCTTTA ACTAATTGGG TAGCTAGTTC CTGA
|
Protein sequence | MWFRHQTACI FRSVKPDAVL AFAIPAACMA VLAEAFFRTE YADADATAAC EFDASSNLIK PHHVRELETK GIVVIENAVT ATTLRGARSN IRDFQKEGSI WTNTRVGGFA PSGNDPDVRQ DLIAWVRSSN VSDDEASTRS HHVETQGSLL IVQDKNNRQE NSPLGKDLLY SIQILRGIPF ALEQCGYSAS KNHRVPRQCQ LAMYPGNGSA SYERHLDQCD ASVYDLGILE WLRLSDYRER AITTILYLNE PNRPESHGGA LRCWVARDID TKRDNKNRNE KDDFRPPFDV KPTGGTMVIF QSGKVDHKVL PSTEERFALT NWVASS
|
| |