Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40023 |
Symbol | |
ID | 7195497 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 620905 |
End bp | 622493 |
Gene Length | 1589 bp |
Protein Length | 522 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184033 |
Protein GI | 219127626 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAGA TGAAAGCTGG GGCAATTGCA AGTCGCAGAC GGCGCCATCG AGCGCCATCA TTCCTTGTCC CCCTTCTGGG TGGGTTGGTC ACGTATAGCT TGCACATCAA TTCTTCATTT TCTTTCACAT GGCAGCATAA CGCTGATTCA CAGGAATTCT CCATGCAGCT TGCCACAGTC AATTCTACCG CAGCGAAACA ATTACAATCG GCTCAGATAC ATAACATCAC ACGCAGTGGA CTTCAGTATG GCACAGCGAC CGGACAAAAT CATTCGAGAC ACAGCAAAAC AGCAGAATCG CCTTTTCAGT CTTTGCAAAC GTCACTAGAA TCTCAGCGAC TTACACCTAT CGCTACCAGC TCAACTGTTA GGGGCAATCG CTCCGTCACT GATCTTCCAT ACGCAGATGC TCGCGACGAA GATGGTTCCT GGGGGTACAT CGCCGACGCA ACCCAGGTGA GGAGTCGAGT CCTGGCGCTT CTACCCTCAA ACCATACACT GCACAACAAT GTCACCAGTT TCATACCCAT GACGGAATCT GAACAAGAAG AAATATGCCA AAAGCCACCC GGAAGCGGAC CGGAGCAAGA ATTGGGCTGG AAACTGATGC AGCGTGTTGT CGTCAATGCG CCCGAGCCGA GGTACGCCAA CGAGTCTGCA GTCATTGTCG CCACAATCTG CAGTCATCGT CACCAAAGAG TCTCCAAGCA GTCATCGTCA CCAACTCGTC CTCCATCTCA GTAAGCCACC ACACAGAAAC AGCAGCACCC AAAATTCTTT GTGTCGTCTA CACGTATGAT GCTCATCACG ATCGAGTTGC GGCGATTGGT GATACCTGGG GTTGGCGCTG TGACGGCTTT TTGGCCGCCT CCAACCGAAC TGTTCCGGAG CTTGGGGCTG TAGATTTGCC CCACGTTGGA CCCGAAGCTT ACGGCAATAT GTGGCAAAAG ACGCGTTCTA TATTGGCGTA CGTGCACGAA CACTACATTG CGGAGTACGA CTATGTGCAT GTGGCAGGAG ACGACACGTA CGTGATTGTG GAAAATTTGA GAAATTACTT GGAGTTTACG GTAGAGGCAA AACACGGTCG TGGCAAAGTA CCATTGTATT TGGGTCAGCG TGTTTTTTCT GGAGGTGGTT ATACATTTGT TGGCGGCGGG CCGGGGTATA TTTTAAATCG CTTGGCCTTG CAGCGTTTCA TTAAAGAGGC TCTGTCAGCA TGTCTGGCTA ATCAGCAGGA AGCAGCCGAA GACCGTTCGC TTGGATATTG CTTCAAAACC TTGGAAATTA CTACGGAAGA TACGGCGGAT GCATTTCATC GGCAAAGATT TCACGGTGTG GATCCATACT TTTTGGCGAC AAAGAATCCA CAGAAAGGCT TCTGGAAACG GTTATACAAG TTTTGGGCCC GCGAACATGG GTACAAATGG GGCATTGGCT TGGTGTCACC ACAAACTGTA ACTTTTCATC TCATAAAGTC GCCGATTTGG ATGAAGCGCA TGCATGCTAT GCTCTACCAT GCCTGTCCGA CGGGTACGGC AATGGGTGAT CTCCTTCCCA GACCTACGAA GCTGGCGAAT ATTTCCTGA
|
Protein sequence | MTQMKAGAIA SRRRRHRAPS FLVPLLGGLV TYSLHINSSF SFTWQHNADS QEFSMQLATV NSTAAKQLQS AQIHNITRSG LQYGTATGQN HSRHSKTAES PFQSLQTSLE SQRLTPIATS STVRGNRSVT DLPYADARDE DGSWGYIADA TQVRSRVLAL LPSNHTLHNN VTSFIPMTES EQEEICQKPP GSGPEQELGW KLMQRVVVNA PEPSHCRHNL QSSSPKSLQA VIVTNSSSIS VSHHTETAAP KILCVVYTYD AHHDRVAAIG DTWGWRCDGF LAASNRTVPE LGAVDLPHVG PEAYGNMWQK TRSILAYVHE HYIAEYDYVH VAGDDTYVIV ENLRNYLEFT VEAKHGRGKV PLYLGQRVFS GGGYTFVGGG PGYILNRLAL QRFIKEALSA CLANQQEAAE DRSLGYCFKT LEITTEDTAD AFHRQRFHGV DPYFLATKNP QKGFWKRLYK FWAREHGYKW GIGLVSPQTV TFHLIKSPIW MKRMHAMLYH ACPTGTAMGD LLPRPTKLAN IS
|
| |