Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43656 |
Symbol | |
ID | 7197365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1115941 |
End bp | 1117344 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177755 |
Protein GI | 219112007 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCACGT CGAAGTTTTC CGTCTCCCCC GGGCTTGTGG CGGAAAGCAG TGACATCCGC GGCTACGAGA ACTTTCCGCA AGGATGCGCG TTTTCACCTG ACGGTCTATG TGTGTTAACG AGCACCGCGG GCGATTCCCA ACTGCGACTG TACAATACGC CCCCACCCCC GGAGGAGGGG TGCCCGGACG ATCAGCAAAA CGCAGCAATA CCTGATCTAA CGACGCCATG GCAAGCAGCT CTAATAAGTC AAGGTACCGA CACAGTCCGC TCCTATGAAT GGTATCCTAC CATGGCCTCC AACAACCCCG CGTCGTGCTG TTTTCTAGCC ACGTGTCGCG ATCAGCCGAT TCATTTGTAC GACGCCTACA CGGGAGTTGT CCGGGCCACC TACCGTCCCT ACAATGCACT CGACGAACTG GAATCGCCGA CGGTTCTGTC GTTTCGTCCC GACGGCCAAA GAATTGTGGC GGGAGGCTTT CGCACCGACC GTGTCCTGCA CGTCTTCGAT ACGGCCATTC CGGGTCGGGA AAGCACCACC CTGCGACTCG GAAAAACGCG AAGATCTCGG GACGGAGCCA AAGGACTCGT GTCGGCACTG GCGTGGGGCG GAGACAGCGG AAACGGGAGA CTCCTCTGCG TAGGCACCTA TGCGCCTGGC TCGATTTACG TGTACGACGA TCGGACCGGC ACCACACCCA GTGGAGCCAT TCTGGATGGA GTTTGTGTGG TAGGGCACGG TCGTAGTCAC TCCAAGAAGA AGAGGAGGTT TGCGGTCGTC CAGAACGAGA ACAGTGACGA AGATAATGAG GATTCTAAAA CATGGTTGAG TGCCGCCCGG GTCAAGTGGT TCCAACAGCG AGCGCAAGGG GGTGTCACGC AGCTTGCATT CGCGCCACAC AATCACAACT ACACTCTCTT CAGTACGAGC CGACGAGGGA ATGCTGTTCT GGTCTGGGAT TTGCGGATGC TATCGTCGCA GCCTGACTAC CAATCAACTC CGGTGCGGGG CTTAGGGAGC TTTGCTACAG ATAATCAAAC GAACCAACGA ATTGAGTTTG CGCTGGACGA ATCGGGACAA ACTATTTTTG TAGGAGGCAT AAAGAGGTGT GTACGTATAT ATGATGTAGC TTCCGGAGAC TGTACGGGAA CGGTGGATGG ACTTGATGAC GTGGCGAACG GAGTGTCCTT TACCAATTCC AGTCGAGGAG GACTTCTCGC CATCGCCACT GGATGCCGGC GGTTCCCTTC CGAAGAAGAT TTCGATAACG ATACTTCAAT TACCGACGGT ACGGGGGATG CAAAACCAGG GTTTTTACGC GTGTATCGAA TCCCGAAACC TGCAGATTTG GAGGATGAAG CATTGACGGA AAAGTCAGAG CAATTAGATG ACGGGTCCAT CTGA
|
Protein sequence | MVTSKFSVSP GLVAESSDIR GYENFPQGCA FSPDGLCVLT STAGDSQLRL YNTPPPPEEG CPDDQQNAAI PDLTTPWQAA LISQGTDTVR SYEWYPTMAS NNPASCCFLA TCRDQPIHLY DAYTGVVRAT YRPYNALDEL ESPTVLSFRP DGQRIVAGGF RTDRVLHVFD TAIPGRESTT LRLGKTRRSR DGAKGLVSAL AWGGDSGNGR LLCVGTYAPG SIYVYDDRTG TTPSGAILDG VCVVGHGRSH SKKKRRFAVV QNENSDEDNE DSKTWLSAAR VKWFQQRAQG GVTQLAFAPH NHNYTLFSTS RRGNAVLVWD LRMLSSQPDY QSTPVRGLGS FATDNQTNQR IEFALDESGQ TIFVGGIKRC VRIYDVASGD CTGTVDGLDD VANGVSFTNS SRGGLLAIAT GCRRFPSEED FDNDTSITDG TGDAKPGFLR VYRIPKPADL EDEALTEKSE QLDDGSI
|
| |