Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39968 |
Symbol | |
ID | 7195466 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 482047 |
End bp | 483489 |
Gene Length | 1443 bp |
Protein Length | 457 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183883 |
Protein GI | 219127315 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGC CGGAGCATCC TCTACCGCCT TCGACCACGG ATGATTACGT GGCGAACGGA GAATTTTCTG CGTCATCAAT GAATGCCGAC GACGCTAGTA TGGTCGCATC CGATGCTGCT GCTTTACACT TGCCAGAATC GTCCGAGGTA CTGATGAAAA AGACTAGGAT TACAGCTCTC AACAAAGCAC TTTTTGCAAC GTACTTTTGC AATGCCGTAT CGGTAACACT ACCAGTCATT CTCATGCCTT TGATTGCTGC CGAGCAGACT TCTCTGGCTG GCTCTTCGCT CGCAACTGCC GCATTTGTGG TATCAACTGC ATCCGTTTCT ACCTTGGGCG GGGGTTTCGG CAAGTTCATC AACGGGTTTG TGTGTTAGGC ATTGGGCGGC CGAGTGTCGG CTTCGCTGTA CCTTACAGCC ATGGCAGGCT TCCATTTGTG GTTGTCTTTT AACAAGACAG GCCCCATATT TGGATGGATT CTTGCTGGTC TGGACTTTTG CGCTTCAATT CAATGGACAG CATGCTCCCT CATTTTGGCA AATCACTACG ACACCAGTCC TGCCGAATTT GCAGCTGGGG TTACTGTTCT GAGTTTGGCC AGCACGTTCG GAGTTCTTTT CTCCAAAATA GGAGGAATAG TATTGCTCCA GTATGTATCA TCCTGGAGTA TTGTTGCTCG AGTTGGAGCG GTAGTGGCTG TGGTCGGAGC AATCCTCGTC CGCTCTTTGG TTACCGAAAT GCCACTCCAG GCCGGAGGAA TTACTCCATC GTCGATCAAA CGGTTTAACA TTAGAGGAGT TGTGCGGTCT CTAGGCAATG TTTTGGGAAG CAGAATATTT TGGTTGGTGG GATTGGCACA CGCAACCACC TTCTTGGCTC GCACCAGCGA TCGCGTGTTG GGGTCATTCT TTTTAGAGTC TACTTCTCTT CCTCGCCATC TGTGCGGGGG TCTTACGGCT AGCGTGACCC TCGGTTTTGT TCATGGTCTG GGTAAGGGGA AAATGTTTTA CAGCCTGAAG GATACGCAGT CCAAGACAAG ATTGTTACGG AAGAACTACG CCAAGGCGAC GTTATCTTGC CTGGCTTTGG CTTTGCTGGC GAATCAGAAG GTAGCTACTG TCTTGTTCCC CTCCAAGTAT GTTATTGCGG GGTTGGTCGC ATTGCTAACA GGAGTCATGG CGTCGTCGCT CTCCTTTCAA TTTTACCAGA TCCCGCCTAT GACGTCTAAG ATGTTTGGTG AAGACAAAGC GGTATGTCTT TCGTTTCTGG ACGGCATGGG TTTCTTCTTG TCAGCTCCCA TATGGGCTGT TACAAGCCAA ATTGTTGGAG GTCTTGGAAT TTATGGGTGG TCAACTGCCT GGGTGATGTT GGCCTTTCTG TTCGGCTCGG GAGGGGCGCT GATGCTAAGA ACGCTCCCGC AAGTCCTTGA TGAACAACGG TAA
|
Protein sequence | MTEPEHPLPP STTDDYVANG EFSASSMNAD DASMVASDAA ALHLPESSEV LMKKTRITAL NKALFATYFC NAVSVTLPVI LMPLIAAEQT SLAGSSLATA AFVALGGRVS ASLYLTAMAG FHLWLSFNKT GPIFGWILAG LDFCASIQWT ACSLILANHY DTSPAEFAAG VTVLSLASTF GVLFSKIGGI VLLQYVSSWS IVARVGAVVA VVGAILVRSL VTEMPLQAGG ITPSSIKRFN IRGVVRSLGN VLGSRIFWLV GLAHATTFLA RTSDRVLGSF FLESTSLPRH LCGGLTASVT LGFVHGLGKG KMFYSLKDTQ SKTRLLRKNY AKATLSCLAL ALLANQKVAT VLFPSKYVIA GLVALLTGVM ASSLSFQFYQ IPPMTSKMFG EDKAVCLSFL DGMGFFLSAP IWAVTSQIVG GLGIYGWSTA WVMLAFLFGS GGALMLRTLP QVLDEQR
|
| |