Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38736 |
Symbol | |
ID | 7203786 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 86980 |
End bp | 88179 |
Gene Length | 1200 bp |
Protein Length | 361 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182768 |
Protein GI | 219124979 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCATTA TGAGCTCGTG GAGCGGGCAC CGCTTCTTTG GATTCGGGGT AACAGTTAAG CCTTTCTGCT CTGCCACGGG CGCTCTGGGT CGCCAACTTC GAGTACCCCG ATCTATGCTG CAGGTGGAAA TCGATGCAGA GTGTCTGTCG TCCGTCCAAT GACCCACTTC GAAGATGGCA TAGGAATCAG GCCTGATAGT TACATTTACT GTTACTGTTA CTGTTACCGT TAGTTAGTTT TTTACTGTCA GCACAACGTC CGTCCAACAA CGGGACCTCT TTGGGTTCAG ACATTGGCAC TTCTCGTGTT ACGTCAGTCT CGAGACCATG GTCCCACTGG TTTCGTCGAT CCGTCGTTAT GGCGGGCGAT GCATGCCCTA CATCTTTCTG GTTCCCAAGG AGGTTGCATG CACCATCGGT TTCTTTGGAA CGCACGGCTT GATCATGAGA CTCTTTAATG CCATTTTAAT GCTGTGTGTA TTAGCGGACA CATCGAGTTG TTTCGTGGCT TTTACTTCAG TCGCTGCACC AGAACGCCGT GTTCCTTCCG CCTTGTCGCT ACACAAATCG GTGGAGACTC CTGAATCTGC AGCGGAAAAA TCGATGGTCT TTCAACACCA AACTATCACG CGCAAAGGAA TGCTCGTTCG TTCCACTGCT GTATTATCGT CGATACTATC CAACCCTTTT ACAGCCAACG CTGACACTCC TACCAGCATT CAGTCGTGTC CCAAGGGCAC AGGAAATTCC AAAGTCAATT GTGCTTCGAC GGCCGCCGTG CGTCAAGTCG ACAACTATAT CGAGCCGTGG ACGTACCCGT CATCCATGCC CGTAAATGAG GTTATTGCTC GCCTCAAAGG AGCCGTCAGT ACAGATATAC ACAATACGAT CGTCGAACAA AACGAAACTT ATCTCAAAGT AGAGGCCGTC CGCAATTTTT CTACCGATAT GGTAGAGTTC CTAGTAAATC CAGAAGATCA CATTGTCACA TTCATTTCCC GACAGACGGA TGGTCCGGAT TTTGGAGACT TTGGCGCTAA TAGGAAGCGA TTAAACGAGA TTCGCAGAAA AGCCCGCGTC TTTGACGTCA TGGGTGGGCA AGGGTACGCC CGTGAATCCG CGCTTGGCCA GCTCAAAGCG TTCTACGGAT TACAAAGTGG CGCCGGCTTT GAAGACGTCA TTCTTGACTT GGACAACTAA
|
Protein sequence | MVIMSSWSGH RFFGFGVTVK PFCSATGALG RQLRVPRSML QFFTVSTTSV QQRDLFGFRH WHFSCYVSLE TMVPLVSSIR RYGGRCMPYI FLVPKEVACT IGFFGTHGLI MRLFNAILML CVLADTSSCF VAFTSVAAPE RRVPSALSLH KSVETPESAA EKSMVFQHQT ITRKGMLVRS TAVLSSILSN PFTANADTPT SIQSCPKGTG NSKVNCASTA AVRQVDNYIE PWTYPSSMPV NEVIARLKGA VSTDIHNTIV EQNETYLKVE AVRNFSTDMV EFLVNPEDHI VTFISRQTDG PDFGDFGANR KRLNEIRRKA RVFDVMGGQG YARESALGQL KAFYGLQSGA GFEDVILDLD N
|
| |