Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47533 |
Symbol | |
ID | 7202759 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 37171 |
End bp | 38398 |
Gene Length | 1228 bp |
Protein Length | 384 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181989 |
Protein GI | 219123350 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCA ATTCGCAGAG GAGCAAACAA TCTCCAATCA GTTTTCGATT TCTAATACTC ATCGTTGTGT TTGATATCAC ATGCTTCATC CTTGCCGTTC GTGGCCTTCA CAAAATGGAA GGCTCCTACA ACGACTCTCG TAAAGCACAC GTCGTGCCAG TTTTCCGTCA AAGGATAAAC GATACTGGCG ATTGCACAGA CAAAGAGCTA CTACTTCAAA TTCTTGCCGA CGCTCTGAAA AACGCGTCTG CTACAGAAGA CCAAACATAC GGCAACTGCT CCGCTTTGCC CGCTTGGCAA GAAGTGATCA AGTTGTACGG GTCCAAACCT GTGATCTTGG GTCTGGAGCA CTGCGCAGCT TTTCGCAACA ATGTCACCCT ACAGGACCCG CTGGGTGGTC TGAGAGTCGC CGGGTTTTAC AATTCGGGAA CCAACGCTTT GGAACAAACA CTTTTGAGAA ATTTGAACAA CGCGGATACT GATGGCCGAC AAGAGCTTCC GACCGTAGTA CCTTGGTCTA AGCACAGACC GCTGTGGACA GCCAAAGAAT CATATTTTCT TGAACATCGC CATGTTCTAC CTGTCGTCGT GGTTCGCGAT CCGTATCGAT GGATGCAGTC CATGGTAAGT GAAAGTTGTG TTTTGGTTTT CCCAAAAGAA ACGGTGTCGC TGCTTTTTCT CACTTTCCTT TGGTCAGTGC AAAACCCGGT ACGATTTGTT TTGGCAAAGG AACCTCAGAC TTAACGGAAT TGAACACTGT CCCAATCTTG TCCCTTCTAG CCAAGACGAA CAGAATTTCA ACCAAAATCG TTCCACTTTT GCGGTGTGGT TGCGCAATCC TATTCAAAAT AGCACGCACG ATTCGTTAGC GGCCTTGTGG TCTAGATGGA ACGGTGCGTA TTTGAATACG AGTATTCCAC GACTAATTGT ACGGATGGAA GACTTGATTT TTCACGGGCC GGAAATGGTG CAAAAATTAA GTGAGTGTGT CGGCGTTGAC CGGACCGATC CTTATGTCTT CCTTACCGAA GCTGCCAAGT CCCACGGACG GTCAGCGGAT TTGGCGACCG CCATGATCAA GTACGGTCGG CGGGATGGCC GCTATGCTGG AATGACGACG CTAGACTTGG CGTACGCAAG GCATGCTTTG TCAGGCGATC TCATGCAAGC ACTACGTTAC GAATACGATG ATTTTTCGCT GGACGCAAGT CCAAGAATTC TGTGGTAA
|
Protein sequence | MNTNSQRSKQ SPISFRFLIL IVVFDITCFI LAVRGLHKME GSYNDSRKAH VVPVFRQRIN DTGDCTDKEL LLQILADALK NASATEDQTY GNCSALPAWQ EVIKLYGSKP VILGLEHCAA FRNNVTLQDP LGGLRVAGFY NSGTNALEQT LLRNLNNADT DGRQELPTVV PWSKHRPLWT AKESYFLEHR HVLPVVVVRD PYRWMQSMCK TRYDLFWQRN LRLNGIEHCP NLVPSSQDEQ NFNQNRSTFA VWLRNPIQNS THDSLAALWS RWNGAYLNTS IPRLIVRMED LIFHGPEMVQ KLSECVGVDR TDPYVFLTEA AKSHGRSADL ATAMIKYGRR DGRYAGMTTL DLAYARHALS GDLMQALRYE YDDFSLDASP RILW
|
| |