Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42151 |
Symbol | |
ID | 7202833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 347752 |
End bp | 348960 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181890 |
Protein GI | 219123143 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGGGC GCGGTGGTGG TGGTGGACGT TCGTCACGTC GTTCGGCCGC CTCGGCCAAC GTCGACACCA CCAAACTCTA CGAAACGCTA GGTGTGGACA AGAGCGCGAC GGCACAAGAA ATCAAAAAGG CCTACCGCAA ACTTGCCGTC AAACACCATC CCGACAAGGG CGGGGACGAA CATTATTTCA AAGAAATCAA CGCCGCGTAC GAAATCCTGA GCGATTCCGA GATGCGGACC AAATACGACA AGTATGGTCT GGAAGGTCTC GAAGAAGGCG GCGGGAGCGG CGGGGCAGCC TCCGAAGATC TGTTTAGTAT GTTCTTTGGG GGAAGAGGAG GTCGTCGAAG TGCCGGACCC CGACGTGGCG AGGATGTCAA TCATCCGGTC AAGGTATCGT TGGAGGACCT GTACAACGGC AAAACAGTCA AGCTAGCCGT CAATCGTCAA GTTCTGGTTG GAGAAGCCCG CGTATGTACC TCCTGTGACG GCCACGGGAT GGTAATGGAA CTGCGACAGA TTGCTCTAGG CATGGTGCAA CAGATTCAGC GCGCGTGTCC AGACTGCGAA GGCGAAGGCT ACCAGTGCCA GAAGAAAAAG GAACGAAAAG TTTTGGAAGT GTTGATTGAA AAAGGAATGC AAAACAAACA AAAGGTTGTA TTCCAGGGAA TGGCCGACGA GAAACCAAAC ATGGAAGCAG GCAATGTCAA CTTTATTGTA CAAGAAAAAG ATCACGAGCT CTTCAAGAGA AAGGGTGCTG ATTTGCTCAT TTCCAAGACC CTGTCGCTCA AGGAGGCACT GTGTGGATTT GCATGGAAGG TAATGCACTT GGACGGCCGT GAAGTCATCA TCAAGTCAAA GCCAGGAGAA GTCATTCAAG CTGAAGCCGC TGGAGGTCGT CCGTTTGTCA AATGCGTCCC CAACGAGGGC ATGCCGAGTC ACGGGAATCC CTTTGTGAAA GGGAATCTGT ACGTGTTATT CACGGTACAA TTTCCGAAAG ATGGAGAGAT CCAACCTGCG GATGTAAAGC AGCTCAGACG GTTTTTGCCG GGATCGGCCA TGGAATGTGA CTACGACGAA GACACTGCCG AAGTTGTCCA TCTGGAAAAC GCCGACGTGC GTAGCTTCGG TAAAGGAGGG GTGCAAAATC AAGACGCAGC TTACGATTCT GACGGGGAAC AAGCTAGTCC GCAATGCCAA CAGTCTTAA
|
Protein sequence | MHGRGGGGGR SSRRSAASAN VDTTKLYETL GVDKSATAQE IKKAYRKLAV KHHPDKGGDE HYFKEINAAY EILSDSEMRT KYDKYGLEGL EEGGGSGGAA SEDLFSMFFG GRGGRRSAGP RRGEDVNHPV KVSLEDLYNG KTVKLAVNRQ VLVGEARVCT SCDGHGMVME LRQIALGMVQ QIQRACPDCE GEGYQCQKKK ERKVLEVLIE KGMQNKQKVV FQGMADEKPN MEAGNVNFIV QEKDHELFKR KGADLLISKT LSLKEALCGF AWKVMHLDGR EVIIKSKPGE VIQAEAAGGR PFVKCVPNEG MPSHGNPFVK GNLYVLFTVQ FPKDGEIQPA DVKQLRRFLP GSAMECDYDE DTAEVVHLEN ADVRSFGKGG VQNQDAAYDS DGEQASPQCQ QS
|
| |