Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47247 |
Symbol | |
ID | 7202300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 59062 |
End bp | 60699 |
Gene Length | 1638 bp |
Protein Length | 467 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181476 |
Protein GI | 219122280 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000828474 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGAAGTCAG ACAACCTCCG GGATGAACTT CAAATACTAA CTGTAACAAG GGATACTCCA CTGCCTTTTT GCGAAACGAA GACATACCAT GCAACCATGA ACCACTAGAA AGCCAGCCTC TTGTGAACTC TGATAGTCAA AGTCATTGAA AATAGCGCAA GGGCGTAATA TGGGCGAGCT TGTGGCTCGG CCCCCCATTC ATACGGCGTC GTATGATCGC CTGCAGCCAG ACCGTCTATC CACAATTCTG CTTTTCCACG AAGACGATCC CGTCAACTAT CGCTATAAAG TTGTAGTCGT GGACCGAATG CTGCTTCCCT CCACCCCCCT TCAATACGGG ACTGCCGCAT TTCTCATTCC AGCTGGACGG GAAGCTGAAT ACATCTTTGC TTCGGAGATT GGATTAAAAT CAATAGCGGA ATCTGCCAGC ACGGCCCGAC TGATCGCAAT CAGTTTTGGA AGACATCATA GGTTCGGCTC GCAGATAATC GTTCAGGAAG AACTGTCGTT CGTCGTACAG GTTCTCTCTC GCCAAGGCAC CTTTCTACCG AAACCACACC AGGAGCTGCT TGCCGAGGTT GAAATCCCAT TCATGGCTGT AGATGGGATT GGAAATCGCC ATATTGTTGC AGAAGGAGAG AGTCAAATTT CAGGCAAGTA TCTTGTTGAG CAAGTCGATG TTGATGGAAT GCAGGTCCGT CGACTATATT TTGCGAACAA TCCTTTTGTA ATTCAGTCAG AAGCTGTGCT GCGCGATCAG GGTCGGGTTG ATAAGAGCTG TTCTGCGTTT GACTATCACA AAACTATGGC TGCCGGTATT CTGGCACTGG TTGATTCGGA TGTTTTGACG CACGGTCTTC TGGTTGGTCT GGGGGGTGGT TGTTTCGTCA ACTTGATTGG GCATCTCCTG AATGATCTTG AATTGTCGGT AGTCGAACTT GACCCTGCTA TACTGAAAAT TGCCGAAGAG CACTTTGATC TAGATCTGGA GAGCAACCGC TTGGACATCC GGGTGGGCGA CGGCCTAGAG ATCATGCCAC TTACCCATGA CGCCGTTTCG GGTTGTCCGA CAACCTTCGC AAAAGAAAGT ATGGCATTTG TTGCCATTGA TGTTGATTCG AAGGACCAAT CAGCTGGTAT GTCATGTCCG CCAGAATCTT TCGTAGAAAT TGAGTATTTG TCGAGGTTGG CTGAGCTTAT CCACCCACAT GGAGTTCTAG TCATGAATAT ATCTGCTCGA GATCCTGAAA AGCTAGATCA TGTTTGTCGA AGAGTGCAAC AAGTTTTTCG AAACGTTGCG CTGGCAGCAC CTCACGACGG AGAGGGCAAT GGGAAAAAGA AAGATATAAA TATGGTTCTA TTTGGGAAGC ACGCCGTCAT GGAAGTGTCA ACATCAAAGC TTTGTCCACT GGTGGAACCT CATACTACCT ATGAATCAGG GCCTGACCTA AAGAGGGCAT TGGCAAATCT TGTTGCATGG AACGAAACAA AATCAATTGA CACTGGTTCA TCAAACAAGA TTGGGCGAAC CAAGACCACT CGACAGAAGA AAAAGCGTGG AAAGAGAAAA TAAAACTGCG CTGAATCCAC CAAGAAGTCG AGTTTGGCCG CAAACAAAAT AGCGCAAAAT TATTTTAT
|
Protein sequence | MGELVARPPI HTASYDRLQP DRLSTILLFH EDDPVNYRYK VVVVDRMLLP STPLQYGTAA FLIPAGREAE YIFASEIGLK SIAESASTAR LIAISFGRHH RFGSQIIVQE ELSFVVQVLS RQGTFLPKPH QELLAEVEIP FMAVDGIGNR HIVAEGESQI SGKYLVEQVD VDGMQVRRLY FANNPFVIQS EAVLRDQGRV DKSCSAFDYH KTMAAGILAL VDSDVLTHGL LVGLGGGCFV NLIGHLLNDL ELSVVELDPA ILKIAEEHFD LDLESNRLDI RVGDGLEIMP LTHDAVSGCP TTFAKESMAF VAIDVDSKDQ SAGMSCPPES FVEIEYLSRL AELIHPHGVL VMNISARDPE KLDHVCRRVQ QVFRNVALAA PHDGEGNGKK KDINMVLFGK HAVMEVSTSK LCPLVEPHTT YESGPDLKRA LANLVAWNET KSIDTGSSNK IGRTKTTRQK KKRGKRK
|
| |