Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40662 |
Symbol | |
ID | 7198578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 89887 |
End bp | 91221 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184644 |
Protein GI | 219128910 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTGTA CTACTTCAGG AATAGACGAT ACATTATTGA GAAGTAGAAA TAGCACATTG GATCTGTGGC TAGACTGCAA CGCACTCGAG GGCGTCAATT TCGCCGATGC GCTGCCTTTC CTGCTAGCCT TGAGGCAGAA CACGCACGTG ACCGACGTAA ATCTGGCTAT CGTTCGATCA CCTCATGCAG CGGAGACGAT ATTGGCCGTC AGTCAGATTC CCAATCTGCA AAGCTTAAAG GTCTCATCCA TCTTTGACTG TTCCGGGGAT TTCGGGCTGT CGCTACCGAC ACTGACACAG GCTCTGGGTC AGGCACGACG ACTGGAAGCT CTATCTCTGG ACTGGATTGG CATCTTGGGA GATGCCGACA ACAATACGCT GCGGGAACAA CATACCGCAT TAGAGCGGAC ACTAGAGTCG CACCCAAGAC TCTGTGAGAT TGTTTTGACC AATTTTTACT TTCCGAGCCA GACGAGTCCA AATGATTGGA TTCAGGCGGA TCGATTCGTG CGTGTCATCC TCGCATTGCC CAACTTAACT ACGTTACGAA TGGATGCCGT CACAAGATTC TACGGGCGAC CACTTTTGAC CTCGACTACG ATCAAGCTCT TGTTTTCCCA CGACTGCCTC CAGACGATTC TTCTCAAAAA CATATGTGTG TGGAGCACCC GGTGCGACCC GCAGGTCAGC AAGGCGTTGC GAAGGAACGA GCGCCTACTG GAACTATCAC TTACATCATG TTACCTGGCT AGTCACGGTA GTGTTCTGGC TGGTTTGGAC GAGAATCGAT CAGTCCGGGA TCTAGACGTG AGCGACTCGT CCCTGTTATT GGAAGCGCTT TCGTTGGGTC GCGCCCTCGG AATGAATCGC GGGTTGCAAA AGTTGACACT CTGTCGGAGT CGTCTGGATG TGAACCCTGC GACCCATCAC GACTATGCGC TCGCTCTGTT GCAAGCTCTG GCGAATCATC CCACAGTCAA GCAATTTCGG ATGAGTATCC TGTACGATTG CACGCTCCTT GCCGAACGTC CGTCCTTTCA ATCTGTGGAG GAAGTTTTAC AAACAGCATT GCGCGTGCTA GAATCAAATC ATGTACTACA AGAGTTGTGC TTGGATGGAA TGTGCCAAGA TGAACCATTT TGGGCATTGT CGGAAGCTAT TCGGCTACGC TTGGGTCTAA ACAAAGCTGG ATTTTGGGGA CTATCGCAGA CTACAAGTAA AGCGTCGGAA TGGGCTGACG CCTTGGGTGC CGTACGATAT GATGTGGGAT GTCTGTACCA TGTGCTCCGC GACAATCCTC TCTTGGTAGC GACGACAGCG GTGTCGGTCA AGTAA
|
Protein sequence | MECTTSGIDD TLLRSRNSTL DLWLDCNALE GVNFADALPF LLALRQNTHV TDVNLAIVRS PHAAETILAV SQIPNLQSLK VSSIFDCSGD FGLSLPTLTQ ALGQARRLEA LSLDWIGILG DADNNTLREQ HTALERTLES HPRLCEIVLT NFYFPSQTSP NDWIQADRFV RVILALPNLT TLRMDAVTRF YGRPLLTSTT IKLLFSHDCL QTILLKNICV WSTRCDPQVS KALRRNERLL ELSLTSCYLA SHGSVLAGLD ENRSVRDLDV SDSSLLLEAL SLGRALGMNR GLQKLTLCRS RLDVNPATHH DYALALLQAL ANHPTVKQFR MSILYDCTLL AERPSFQSVE EVLQTALRVL ESNHVLQELC LDGMCQDEPF WALSEAIRLR LGLNKAGFWG LSQTTSKASE WADALGAVRY DVGCLYHVLR DNPLLVATTA VSVK
|
| |