Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36534 |
Symbol | |
ID | 7201842 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 685024 |
End bp | 686336 |
Gene Length | 1313 bp |
Protein Length | 430 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181053 |
Protein GI | 219120637 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.472214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGACG GTGAGGTCGG TGACAATGTG GCGTTTGGCT TTTGCGTGAG CCTCACTGTC AATGACATCG TTGCCGACTC ATTAATGTAA TGCATCTATC TTTCAGTGTC ATCGTACCTG TGTTGTACCT ATTTTCACAA GGTCGTTTGG TACACTCACT TTCTGTCCGA AAGTGTCCTC GAAATCTTCC TTCGATTGAA TGTTTCCCAT TGCAAACGCT CTCGCCTCCT TTTGAAGACG ATGCCTATCG AAGTATCTGC AACTTCTACC GGGAAAACTA CTCGCGGAAA CGTATTGTCC CGTCGCTTCC CAAATTTAAT TGTACGACCA CCGACGACGT GGTCTGTTTT ATGCGCTGTG AGGGTTCCGA CAGGATTGTC GCCGCCGTGA GGCTCACCCA CGATTCAACG CATCCCGAGT ATGTCTTCAT ACGATCTTTG TGCATCGAGG TAAGCTTACG GAGACAAGGT CTCGGTTCGC AGCTGCTGCA GCGAGCGATG GACAACTTCG GGGCATGGTC CTACTACTGT CTCGCAGATC CCCTGTTGAC ATCATTTTAC GTAGCAGCAG GCTTTACAAA GGCTGACTCC TCGAACGACG TACCTACATC CATCAGTCAG CGCTACGAAT CTATCGCTCG TCGGGTCCAG CGCAAGAATC GAGAGCTGCA TTTTTTTGTT CGAAGACCAC AACCGCCTCC TTGTGCCGTT ATTCTACTGC AACATGTCAA CGAAAATAAT CGAGCCACCG CTACGGGCTG GCTCGTTGAT GACGAAGCCT ATCAAAAAGC GACTGGTACA AATCTGACCA CTTTACAATC CCAACTTACC GTGGCGCGCT GGACATGGTC TGGGCGTGCC GACAACGAAC ACATACAAAA GATGCTGTCC GATCTACCAT CAACACCGAT CTTGTTGTGG GCTAAGAAAG CTGACAAGTC GTATAGAATG GTAGCAAGTG GTGAACATTT TACACCAATC TATATTATAC TGGATGGCAC CTGGCAGGAA GCCCAGTCCA TGTTTCGGAA AATCCCGTCG CTTTGGCAAT TGCCACGTTT GTCGCTTTCC TCCAGCACGC GGTCCTCCTA CGTGTTGCGA CACGACTATA CGGGATGGAA ACAACGGTTT AGCAGTCAAG GTGGAGAGGA TCTACTCTGT ACTGCCGAAG TGATTGCGGC ACTTTTAGAT GAGAGCCTAA ACCGAGAAAG TGGGAACCTA ATTCGGAATC GTTTGCGTTA TTTTCAAGAC AACTTCCCCC AAGTAAGTGC TCGTTCTTTC GACGCTGATG CTGACAGCGG CACCGAAGTT TAA
|
Protein sequence | MADGEVGDNV AFGFCVSLTV NDIVADSLIV IVPVLYLFSQ GRLVHSLSVR KCPRNLPSIE CFPLQTLSPP FEDDAYRSIC NFYRENYSRK RIVPSLPKFN CTTTDDVVCF MRCEGSDRIV AAVRLTHDST HPEYVFIRSL CIEVSLRRQG LGSQLLQRAM DNFGAWSYYC LADPLLTSFY VAAGFTKADS SNDVPTSISQ RYESIARRVQ RKNRELHFFV RRPQPPPCAV ILLQHVNENN RATATGWLVD DEAYQKATGT NLTTLQSQLT VARWTWSGRA DNEHIQKMLS DLPSTPILLW AKKADKSYRM VASGEHFTPI YIILDGTWQE AQSMFRKIPS LWQLPRLSLS SSTRSSYVLR HDYTGWKQRF SSQGGEDLLC TAEVIAALLD ESLNRESGNL IRNRLRYFQD NFPQVSARSF DADADSGTEV
|
| |