Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43163 |
Symbol | |
ID | 7196916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2209625 |
End bp | 2211274 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176932 |
Protein GI | 219110361 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAA TAACGAAGAA ACGCCGGCGG GAACGTAAGG ACGTGCTCGC GAATGGATTG TTCAATATTC CCGGTCTCAC CGGTGAGGCG GACAATTCTT TGGAAACCTG TTTGGCCGAG TTTGGGACGG CAACGGCAGT ACCACTTTTC AAGACTGAGA AAACTTCTCA CGAGCCATTG GAGACACATT CACACCAAAC ACGAAACATC AACGTCACAT CAACCGAAAT CATCAAGCCA AAAGATCAAT CATTCTTTGC TGCCTGTCCC TCGCATACCA AGGATTTGTC GGACAGGGTT TCAAGATGGA ACCGAGAGTT TCGGACATTT GTCGATCAAG GGACTTATAC GCCAGGATTA CTTCCATCCT TTGATGTTGA AGTGGCCCGG CATTTCCAGG TCAAAGATTT ATCGGTTTAT TTGTTGGAGT CATGTCCAGG AATCAAGATT CCTGCTTTCG AACGCTGGTT AATTGATTCC AAAATTGAAG AGCGCCAGCG GATAGCTTTG AACCAAGAAG AACTCTCGCA GCATAACGAC TTGATCCTCA GTCATACAAC GCTAGAATAT GGCGCTTCGC AGCGCTTAGT TGCAGAGTTA AGCGACGCAG GAGTTTCCCA AGATAGGGCA ATCAAGGCGG TTAAAGAGCT CTGTCGACGC ACCCAAGCTG CGATTCCTGA ACTTGCATCC CAAACCCGAC GCTTCGCGCT GCGCACGCCA TTACGCAAAG GCGACCGAAT TGATGTGGTA AAGGACAGTC GCGTTTTCTC GCTAGTCTTC CATCGCAAGA GTTGGAAGAA ACCCTTTCGG GTCAAAATCA ATGTCTCGCA TTACCACAAG TTGAAAACAG CTTTTCTACG TGTCCACAAT TCGGATCACC AACTGAAACC TATTCTGTTG TATGATCATG GAAAACCGAC AAAAGCGATT CATTCGTTTC ATTTGATTAT TATGTCGCTA CTCTTGCGCT ACTCTGCTCT TTCGGGTGGG CAGCTTTTGG TGGACCTCCG GGGAGGGGGC ATGCAAGGAG CTGTGCACGA CGAAGTCTTC GAAGCGTTGC AGACTTGCTT TCCAAACGAA TCGTTTCTCG AATGCTTCGC ATCACCGTTA AATTGCTATG CCGCAAATTT CGGCTCAGCC TTTACCGACA TCGATTTTCA TTTTGGATCG GTTGGCGACT TTTTAGACCA ATCAATCTCA CACGGCGTCT GTGAAGCGAA TCCACCGTTC TCGCCTGGTC TCATGGATAC CATGGTAGAT CGAATAGAAT ACAATCTGAC GTTGGCCGAT CAGACGTCTT CCTGTCTGAC GTTTGTTGTT ATTATTCCGA CAGCCTCTAC CTCGGAAGAT GTCCGTACCG CTAAACGCTT CGCGACCAAG TCTTTTCAAC GCATGCTTGG AAGTGCTGCT TGTCGACTTC ATATTTCCTT GGCAGCGCGG GACCACGGCT ATATTGAAGG TGCGCAACAT TTGCGACCAA CGAGGTACAA GGAAAGCAAT TTTGATACAA GTGTGATCCT ACTACAAAGT TCAGCGGCCA GAAAAGAAAA CATCGATGAA AATAATCTGG AAAAGCGACT ACGTTCCGCC TTTACAAGTC GTCACAAAGC TGAGGTTGAC ACACGCAAGG AACAGGAATT ATCGGAATAA
|
Protein sequence | MPKITKKRRR ERKDVLANGL FNIPGLTGEA DNSLETCLAE FGTATAVPLF KTEKTSHEPL ETHSHQTRNI NVTSTEIIKP KDQSFFAACP SHTKDLSDRV SRWNREFRTF VDQGTYTPGL LPSFDVEVAR HFQVKDLSVY LLESCPGIKI PAFERWLIDS KIEERQRIAL NQEELSQHND LILSHTTLEY GASQRLVAEL SDAGVSQDRA IKAVKELCRR TQAAIPELAS QTRRFALRTP LRKGDRIDVV KDSRVFSLVF HRKSWKKPFR VKINVSHYHK LKTAFLRVHN SDHQLKPILL YDHGKPTKAI HSFHLIIMSL LLRYSALSGG QLLVDLRGGG MQGAVHDEVF EALQTCFPNE SFLECFASPL NCYAANFGSA FTDIDFHFGS VGDFLDQSIS HGVCEANPPF SPGLMDTMVD RIEYNLTLAD QTSSCLTFVV IIPTASTSED VRTAKRFATK SFQRMLGSAA CRLHISLAAR DHGYIEGAQH LRPTRYKESN FDTSVILLQS SAARKENIDE NNLEKRLRSA FTSRHKAEVD TRKEQELSE
|
| |