Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42995 |
Symbol | |
ID | 7196220 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1733373 |
End bp | 1735149 |
Gene Length | 1777 bp |
Protein Length | 549 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177360 |
Protein GI | 219111217 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACCA AACCGACGGC AACGGCAGCG CCGTCGCTGA ATATGTATCA GAAAATCGCG GTGATTCGAC ACTCCAGCGA TAGCTTGACT TACGATGACA GCGACGGAAA CTCAAAATCC GTGTCCGTCC AGCAACTCGG GGACGCGATT GCCACCGCGG TACGTAGCAA AACCCCGTCC GTAGCGCTGA TCCTTCGAAT CGTGAGATAG CGAAAGGAAC ACAGTACGGA ATGGATTTTT GCATTCCATA GCCATCTGAC GCAATGCTTT ACTTTTTCCT CTCCAGACAC CACTGGTTTT GCCAAAGTCC AAGCTCGACA TTCCGGTTCC CCGAATCAAC AACGTAGAAA GTTACGAGCG TGACGTACCG GCAACGTATC AAAAGCCAAT TTCGTACGTG CGATGTCACA GACCTTCACG GGCGGAATTG AAAGCAATGG TAGAATACGT CGCCGACCGC GAGGACCAGG AATGGCTGAC GAACAACACT AAATTTGGCG GTGCGGTCGT GTGGGACGAG GGATTGGACA CGTTGCAACA GCGAAAGCCT CAGCTACCGT TAGCTCTCTT GGAACGCATT CTAGATTTGT TCGAGAAGGA AACTGGCTTT GACGCCATCA TGACATCAAA TCAAGCAGAA GCTATGGTAT TTAAGAACAT TCCGCTTATT TATCAAATCT TCCCGAATAA ACCTCGGAAT GGGGTGGTGA CAACCAAAAC AGTTCTCCTG GAAGTATACA ACTACTGGCT TCACAAGCGT TCCAAGCTCA AACGCCCCTT ACTGCGACGC TTTTGGCCGG TCACTAGTAG CGACGACACC AACCCTCATC TTGTTTTCCG ACCTCGGGAA AAAGAGAAAT ACAAACTACG TAAAAAACGT CAAAATGACA TGGACGCGTA CCGAAAAATG AAACAGCTTC GCAACGATTC GGACAATCTG CGTGCGGTGC TGGAGTTGGT CCGTCGACGA GAAGAGCTTG CGCGTGCCCA CATCAAGACT CAAATGGAAT TATTCGAACA GCGTATGTAC GACATCGTCG ACACGACCGG ACTACCCCGG GAATTGAAGC ATGTAGACAA AGATCAGCTT AAGCGGGTGT TGGACACGCC ATCCTTTTTC GACATCTACT ACGGAGGGCG GAAAAAACAG ACCGCTCGGT CCCCTGTTTT CCCTAGTGAT ATTACAGCGC GTGAAGCTCG CCCTCTCTTG AGTAAGACCC TCCACGACAA CGCTAGTAGT GCCTCCCAAG AAACACCAGC GATTGTCGCT GGACAAAATA GTGGTGAACC CGCTCCCTTG TTCCTTGATC CATTACAGAC TCGAGAGACG TATGCCACTT CTTGGCAAAA TGCTGTGCCT CACGTAACTT CGTACATTGA ATCTCATGCC GAACCGACTT TTCGGTTTAG ACATCGACCG CGGGTTGGTC GCGGTGGTCG ACTTTGTATC GATAGAATGC CCCGCCCGCC GAATCCGACC GGTCCGACCA CTACTGTCGT CACCGCCGGT CGCGGTATGC CCCAGTCACT GACCCATAAG GACCGCCTAC TCGACCTGCT CCCCAAACCA CTCGATCATA TTTCATTGAG TCGAAAAATC GAATCGATGT CTGTCGAAGC TATCAAAGAA GACCAAGAAG CCAACGTGTT AGCGGCGGCA ACCAATGGTG ATTTAGACGA AAACGATGCG GACGAAGTGC TGGTGAAGCT CGACGACTGG CTGGAGACGG ATGATCAACC ATGGGGAAAC GAACGGTTTG CAATTGGTCC GCTTTGA
|
Protein sequence | MATKPTATAA PSLNMYQKIA VIRHSSDSLT YDDSDGNSKS VSVQQLGDAI ATATPLVLPK SKLDIPVPRI NNVESYERDV PATYQKPISY VRCHRPSRAE LKAMVEYVAD REDQEWLTNN TKFGGAVVWD EGLDTLQQRK PQLPLALLER ILDLFEKETG FDAIMTSNQA EAMVFKNIPL IYQIFPNKPR NGVVTTKTVL LEVYNYWLHK RSKLKRPLLR RFWPVTSSDD TNPHLVFRPR EKEKYKLRKK RQNDMDAYRK MKQLRNDSDN LRAVLELVRR REELARAHIK TQMELFEQRM YDIVDTTGLP RELKHVDKDQ LKRVLDTPSF FDIYYGGRKK QTARSPVFPS DITAREARPL LSKTLHDNAS SASQETPAIV AGQNSGEPAP LFLDPLQTRE TYATSWQNAV PHVTSYIESH AEPTFRFRHR PRVGRGGRLC IDRMPRPPNP TGPTTTVVTA GRGMPQSLTH KDRLLDLLPK PLDHISLSRK IESMSVEAIK EDQEANVLAA ATNGDLDEND ADEVLVKLDD WLETDDQPWG NERFAIGPL
|
| |