Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41413 |
Symbol | |
ID | 7199308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 106060 |
End bp | 107193 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185379 |
Protein GI | 219130452 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 0.659296 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCA ATCAAATAGC CGTCCAGAAA TGGACCTTGG TAAATTACCC CGAAGACCAT TTCGTAGCGT CCCGGGACGC GAAATACGTG TCGGACGAAA CGATCGAATT GGACAACCTC CAGTTTGACG GTACCAAGGA AAGCACTTCC TCCACGACCT TCCCACCCGA CCACGTCGTC GTTCAAGTCG AAGCCTTATC GGTGGACGCC TTTATCCGCA CTATGTTGGA TCGTGAAGCC TACCATGGCA GTGTCAAGCC GGGTGCCATC TTACCAGCTA TGGGATACGG TGTAGTCCGT GCCGCAGGCC CGGACGCCAA GTACAAGCCC GGTAGTCGTG TCACGGGAAT ACTCGGGGCC ACCACGGTCG CCATTCTCCC CAACGATCAA CTCAACCCCG TCCTCATCTT CCCCCGCATT CCGTCGTCAC TCAGTCTGGG ATATCTGGGA CTCTCCGGTC TAACGGCGTA CGTGGGTCTC TTCGGAGCGC CGCCCAAGGC CCCAGGCCGT GGCGACACAG TCGTCGTGAG CGCGGCGGCC GGGGCGGTCG GACATATTGC GGCGCAAATG GCGCGATTGG CTGGAGCAAC CCGGGTCGTT GGTATTGCCG GTGGTCCCGA CAAAAAAGCC TTTCTGTTGG AAACGCTGGG ACTTGATGGC GCGATTGACT ACAAGGACGA AACGCAGAGT CTGGAAGCGC AGCTGGACAC GCAGTGCCCG GACGGTGTAG ATTTCTTCTT CGACAGCGTC GGTGGGGATA CGCTGGAGAC TGTGTTATCA CGCATCAATC AGAGCGCGCG GGTGGTCATT TGTGGTGCCA TTTCGCAGTA CTCCAGTGGC AACATCAACA AGAAGAATCA AGTGCAGGGC CCGTCTACGT ACATAAAGTT GGCGGAAAAG TCGGCTTCTA TGACCGGGTT CAACGTGATG CACCATCCCT GGGCCATGGC TAAGGCTATT CCGTATCTGG TATGGCACTA CTATCGGGGT AACATTCACG TTCCCGAACA CGTTGAAGGC GATTTGAAAG CCTTTCCTTC GGCGCTGGAA AGTATGTTTG AAGGTGGCAC CCATTGCGGT CGACTGTTGA TTGACGTCGA CGGTAGTCTC GGCAAGGCCC GTGACCGATC GTAG
|
Protein sequence | MNANQIAVQK WTLVNYPEDH FVASRDAKYV SDETIELDNL QFDGTKESTS STTFPPDHVV VQVEALSVDA FIRTMLDREA YHGSVKPGAI LPAMGYGVVR AAGPDAKYKP GSRVTGILGA TTVAILPNDQ LNPVLIFPRI PSSLSLGYLG LSGLTAYVGL FGAPPKAPGR GDTVVVSAAA GAVGHIAAQM ARLAGATRVV GIAGGPDKKA FLLETLGLDG AIDYKDETQS LEAQLDTQCP DGVDFFFDSV GGDTLETVLS RINQSARVVI CGAISQYSSG NINKKNQVQG PSTYIKLAEK SASMTGFNVM HHPWAMAKAI PYLVWHYYRG NIHVPEHVEG DLKAFPSALE SMFEGGTHCG RLLIDVDGSL GKARDRS
|
| |