Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43688 |
Symbol | |
ID | 7197235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1190295 |
End bp | 1191483 |
Gene Length | 1189 bp |
Protein Length | 378 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177774 |
Protein GI | 219112045 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.569479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAGTC GACGATCCAT TCCAGCTGCC CTGCTGCTGC TGCTGGTCGC TGGGTTAATA TCAACCATCA ATGGTTTTTC CAGTCGACAA ATGAGGTCGC ATTCTTGCGC AAACTTGACC CAGTGCTGGA GCAAGGGTTT TGGCGGTGAG GGTCGTGCTG GGTTCGGCGC AAAGACACCA CCCGCAATCA AAAAGGCGAA TCAGAGATCG GCCGTGAAGC GAGCGCAAAA ATCGTATGGT GGAGCCTCAG CTCGTGAGAT CGCACAGGCC ACTCAGCAAA AGATTGAGAA TGAAATGATG AATTTACCGC CACACTATCA AATGGCGACA CAGCTATACC AGCAACTTCA AACAAGGAAT GCACACGTCG CAGATCTGAC GGTCTTGGAG CAAGCCGGCT TGAGCCTCCA GGAATTGGAT GGGGCCAAAC GAGCGCAAGA CAAGTTGGAG CGACTGTACC TTGAGTACGA CTTTTCGGAG AATGACTTAC ATAATGTTTT CCAAAGGATA ACTTGGGACG CTTCGGCTGA CGCCAAAGCT GCGAAAGCTA TGCTGGGTGA AATGCCGAAG GAGATCTCGG ACCGCGTGGA CCGTGCATGT AGTTATGTTG CCGATGGCGT ACTAGCTGCT GGTCCATCCG GCCGCTGTTT AGACGTCGGA TGTGGATACG GTGTTTTGGT TCCCCATCTT ATCGAGAGCG GGATTGCGCT CTCTCAGATC TACGGCGTTG ACCTGAGCAC AGAAATGATT CGCAATGCTC GAGAGCAGCA TCGCGGAGCT ACGTTCGAAG CCGCAGACTT TTTAGAAGAA TATCAAGATT TGAACGATGA GGTCGGATTC GACAGTATAA TATTCTGCTG TTCATTGCAT GATCTACCTG ATCTTCCCAG GTCTTTGCGT AAAGCTGCAT CTCTACTACG CTCTCAAGGG AATCTGATAG TTGTTCACCC ACAAGGTGCA TCACACGTGA CCAAGCAAAT GAAGTCCAAC CCTGTCATGG TGAAAAGAGG TTTGCCAAAC GCGGAGGAGC TGCGTGCTAT GAAACTTGAA GGGCTTGAAT TGCAAATCGA GCCTACCAAA GAGGGCTCAC GAGAAGAGCT AGAAAGAGGC TATCTAGCGG TTTTTCGCAT AATATAATGT CACTTGCGCT ACGGCTAGAG AATGTAGATA GGATGAAAAG TTTTTATGG
|
Protein sequence | MASRRSIPAA LLLLLVAGLI STINGFSSRQ MRSHSCANLT QCWSKGFGGE GRAGFGAKTP PAIKKANQRS AVKRAQKSYG GASAREIAQA TQQKIENEMM NLPPHYQMAT QLYQQLQTRN AHVADLTVLE QAGLSLQELD GAKRAQDKLE RLYLEYDFSE NDLHNVFQRI TWDASADAKA AKAMLGEMPK EISDRVDRAC SYVADGVLAA GPSGRCLDVG CGYGVLVPHL IESGIALSQI YGVDLSTEMI RNAREQHRGA TFEAADFLEE YQDLNDEVGF DSIIFCCSLH DLPDLPRSLR KAASLLRSQG NLIVVHPQGA SHVTKQMKSN PVMVKRGLPN AEELRAMKLE GLELQIEPTK EGSREELERG YLAVFRII
|
| |