Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_2607 |
Symbol | |
ID | 7203336 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 540008 |
End bp | 541204 |
Gene Length | 1197 bp |
Protein Length | 370 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182553 |
Protein GI | 219124527 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCAACCAAG GGGACTACGA TAAACGAACG GCTTTGCACT TGGCATCCGG CGAAGGACAC GCATCGATCG TGCTGGCCCT TTGTGAGGCC GGCGCCGATC CCAACGTGGA AGATCGCTGG AAGCGACGTC CTCTCGATGA CGCTTTTGCC GGCGGGACGG ATGGAGCCTA CGAAGAATGT GTCGCCATTC TCCAACGATT TGGTGCAGCC AGGGGCTTAC AACGATCGAC AACTTCCAAC GTAAATCTAG AACTCGACAA ATCCAGTAAG CGACAGAGCG ACAATCTCAA AATTAATTTT GGTGAGCTCG AAATGATCGA CCGCATTGGC GCTGGCGCGT TTGGAGAAAT CTACAAATGC CGTTGGCGTG GCACCTTGGT AGCGGCAAAA ATTATTAAAA CGGCTAAGAT TCGAAAAGAA TGGGTCAATC GCCAAATTTC CGCCGCCATT AAGAAAGGAA AAGACGTGGA TGAAGCCATT CACGAGCTGG ATGAAGCGGA AATGGCACAG AATGAACGAG ACTTGGCCAT TGCCGACTTT CACCAAGAAA TATCGGTACT CAAGTCTCTA AGGCATCCAC AGATTGTGCT CTTGCTCGCT TACTCAACCA CAGCAGACTA TGAAGTCATG ATTTCCGAGC TCATGAAGTG TTCCTTGCTG GACGTATTCA AATCCCACAT GGTCCAAGGA ACGCGCATGC GGAAGCGAAC CCAAATTATT TACGCGACAC AGCTAGCACG CGGCATGAAT TACTTGCACA CGTGTAGCCC TCCCATCATT CATCGTGATC TCAAGCCTGC TAATTTGCTG ATCGACCACA GCGGAGTATT GAAGATCTCC GACTTTGGCC TCTCCAAGAT ACGCCCCGAT CCAGGAAAGA AAGAAACGGA AAAGTATACC ATGACGGGTG AGACCGGTTC CTACCGCTTC ATGGCTCCAG AAGTGTTTCG TCACGAAGAA TACAACGAAA CCGTCGACAT TTATTCCTAC GCCATGATTT TGTTCTATTT GCTCGTTGGG CGACCGCCGT GGCCTACCAT TTCGGGTATG AATGCAGTCA AAAAGGCCGC CGAAGAAGGA GACCGACCTA ACGTTCCTCG AGATATGGAC TTGCGCATGC AAAGTCTACT CAAGGAATGC TGGGATGAGA ATGCTTCAAT GCGACCAGCC TTTCAACGTA TTCTCGCCAA TTTGGAA
|
Protein sequence | VNQGDYDKRT ALHLASGEGH ASIVLALCEA GADPNVEDRW KRRPLDDAFA GGTDGAYEEC VAILQRFGAA RGLQRSTTSN VNLELDKSSK RQSDNLKINF GELEMIDRIG AGAFGEIYKC RWRGTLVAAK IIKTAKIRKE WNERDLAIAD FHQEISVLKS LRHPQIVLLL AYSTTADYEV MISELMKCSL LDVFKSHMVQ GTRMRKRTQI IYATQLARGM NYLHTCSPPI IHRDLKPANL LIDHSGVLKI SDFGLSKIRP DPGKKETEKY TMTGETGSYR FMAPEVFRHE EYNETVDIYS YAMILFYLLV GRPPWPTISG MNAVKKAAEE GDRPNVPRDM DLRMQSLLKE CWDENASMRP AFQRILANLE
|
| |