Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33150 |
Symbol | |
ID | 7204270 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 187832 |
End bp | 189168 |
Gene Length | 1337 bp |
Protein Length | 362 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186295 |
Protein GI | 219113423 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.461728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCAA CTAAGGGAGT TGCCGGTCTC TTTGCACTTA CGGCTTCTGC TGCCTTCCTA TCCATTGCAG GAATCTTTAC TGCTCAGGGA ACAGTCCAGA ACATTCAAGG TACCACTGCC CAGCTCAAGG ATTCACTAAT GAATATGGAC ACTTTATTTC GGTCCGAGGC CAATTTCCAG ATTATTTCCG CTCATGATCC GTTCGCTTTC AGCTCCAACA AATCCGCAGA GCAATACATT GAAACATCGG ATGATAGCAT GGCGGAATAT CAGACAGGGC CCATTCGAAG CGATACTAGA CTGGAAAGGA ACGGTAAAGA GCCGGAAAAA GACTTTCGCA AACCAGTCTG GAGTAACTCT TCTGATTATA GCATCGACTC CCCGACAATT CTTGTCCAGC TTGGGGGTGA GCTAGCAAAT AATCTGGGCC ATATGGCTTG AGGGTTTGGA CTGGCATGGT GGCTGGAGCG GGAATTTGGT TTAAACGCCA CCGTCATGCT GAGACACGGA GTGATACATG CTAAGTGGAC GAGCGCAGAA GGGATGCGAC ACACTGTTTT CCATACTTGC AAGACTTCAA TTTTTCGGCC GGAAATACTG ACAGTATCAC TAAGGAGTTG CAGTCTTTGA AACAGCGTGG GCCAGACAGA ATCATTGCCG AGAAGAGAGT TTTTCAAATA GACAAGAAGG GACCGCTTGA CCAAGGTTTG AAATCTTTTG TGAGTCTTTA CGCTGCCAAT CACACAAACA TGGGGCAAAG GAATGGTTAC ATCACCATTC CTTTCCTAAC TACACATCAA ATGAGCGCAA AAGACCTCAT TGTGGACAAG TATTATAACG ACATTCGCAG AATATACCAT TTTGATAAAA GCTGCTGCAT GGATGTACCC AACCCGGATG AATCCGTTTT TGTAAGTGGT ATTTCAGCTA ACAAGCATGT ACAGGAAAAA GATTTCGCAA GACTTTCTCA CTGAGTCTTT TTGTATTTCA AGCATTTTCG CAATTTTATT AGGGAAAGTG GTAGACTACG ACATCGTGCA GGCTACGAAG AGCTAGCTCC TGAGCAAGTG GCAAATGAGT TGTTTGCGCA TTTGAATCCT GGCGACAAAG TAGCTATTGC ATCACGATTC TCCGATGATT TTCGAACGCA AATGATTGTG GACGCTCTTG AGAAGCGACA GCTTCGGGTT CGAGTCACGG AGCCACGATC GGGGGTTGCG GATTTTTGCT TCTTGCTGTA TGCCCAAAAG GAGTTGGTTG GCACGGCCAA ATCTCTTTTT TTATTTGGGC TGGCCTACTT GGAAATGCCA CAAGGGTTCG ACCTTATACA GCAATGA
|
Protein sequence | MASTKGVAGL FALTASAAFL SIAGIFTAQG TVQNIQGTTA QLKDSLMNMD TLFRSEANFQ IISAHDPFAF SSNKSAEQYI ETSDDSMAEY QTGPIRSDTR LERNGKEPEK DFRKPVWSNS SDYSIDSPTI LVQLGDFNFS AGNTDSITKE LQSLKQRGPD RIIAEKRVFQ IDKKGPLDQG LKSFVSLYAA NHTNMGQRNG YITIPFLTTH QMSAKDLIVD KYYNDIRRIY HFDKSCCMDV PNPDESVFHF RNFIRESGRL RHRAGYEELA PEQVANELFA HLNPGDKVAI ASRFSDDFRT QMIVDALEKR QLRVRVTEPR SGVADFCFLL YAQKELVGTA KSLFLFGLAY LEMPQGFDLI QQ
|
| |