Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39088 |
Symbol | |
ID | 7194747 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 304403 |
End bp | 306693 |
Gene Length | 2291 bp |
Protein Length | 685 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183069 |
Protein GI | 219125610 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.876501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGAA TAGGTCTTCT TTCAGCGCTA TTGCTGTTGA GTGCTGCGTT TGCGGAACAG GGGTTGCGTC GAAAGCGTAA TTTGCTCGAT ATTCAAGTGG AATCGGAGAT CCATCATGAG ATGATTTCCC CGTACAACGA GCTTGTCCTT GATGATTTTC TTTGGGGAGG CCGCAAGCTC ATGAAGGTGG ATGGAAAGAG CAAATCGAAA GGTACAATGG ATTCTTTCAG TGCCATGGCC CTGAAATCAA CCAAGAGTCC AAAGAGCACG AAAGCTGGAT TGAATTCCCC CCAAATGACG ACTGTTGGAA TCGGATCATT TTCGCCGAAG TCCAAGAAGA AGAGTCAGAA AACAGAAGAT TGTTACAGCG ACGTTTTTGG GTACTTGTTT TGCGACTCTT CTATGTCTTT CATGTCGATG TCGGTCTCAA TGTCAATGGT AGCGACTCCA ACAATGCCAA CGAATCCGAC GACGCCAACC ACTCCAATAG CCCCGCTTAC TCCAACAGCC CCGTCGACCC CATCAACACC GACGACGCCG ACCGCTCCGA CAACACCGAC CTCTCCAACA GCACCGACGG CACCGACATC GCCAACAGCA CCAACGGCAC CGACATCGCC AACAGCACCA ACGACACCGA CAATTCCAAC GGTCCCAACA CAAGCGCCGA TCGATGGGTG TAGTAGCCTC CCGCGCGATG AAGCTTTGAG GACTCTTGTT GGTGCTATTA CTGACGAAAC TACGCTTGCT GATTCAACCT CTCCGCAAGG ACAGGCTTAC CAATGGATGC TTAACACTGA CCCTGCCAGT CTCGATCCTT GTACTGACGA CAATGTTTTG CCTCGGTATG CTTTGACAAC GTTCTACTTT TCCAGCAGTG GAGCTTCCTG GACAAACAGC TCTGATTGGC TCAGTGGCAT TTCGGAGTGT GCGTGGTATG GCGTTGTATG TAGTAGCACC GGAACCGTTA CGGAAATTGC TCTGTGTAAG TTGCGGTTTT TTAAATTTAT GTGTGACTTT TCGACTTCCT CACAATATGC ATTGCTCTTG CAGTCGAAAA CAATCTTGCC GGTACTCTTC CAACCGAACT GGAGGCGCTG ACTGACATGC AAAGGCTGGA CGTCTTCGAC AATGAGCTCG GAGGACCGAT TCCAAATATT CTTTCCAAGT GGCCTTTCCT CGCATTCTTC GATGTTGAAA AGAACCAGCT CACAGGATCG CCCTTTGTGG ACGCGATCGG TCTTTTTAAT TTGCAGTCCT ATCGTGTCTC GCTGAACGCC TTTGTAGGCG GTACGATCCC AGCTGATTTG GTTGATAAGT TCCCAAACCT GGTCGAGCTT TGGCTATCCA ACAGTACTAT GATTGGCAGT ATCCCAACAG AGATTGCGCT TATGGACAAC CTTCGTACGT ATCGTGTTGC CGCATGCGAC AGGTTTTATT TTGGTGTCTC TCACCCAACA ACGTTCTCTT TCCATTTCTA TAGAATCTAT ATTTTTATAC GGCAACTCTT TGACTGGAAC TCTTCCTTCC GAACTAGGTC AACTCAATCT CGAGCGTTTG CAAGTTCAAG ACAACATGTT TGCTGGAAGA ATTCCCACAG AATTGTTCCG CAGCACCAAC CTTGCTGATT TGCGACTTGA TACGAACAAA CTTGCTGGTC CTATACCTAG CGCGGTAGGC GCCTTGACCA ACTTGATCGA CTTGCGATTG AACAACAACG TGCTGACTGG GACACTGACC ACCGAAATCT CTCGTCTTTC CAATCTCGGT GCGTTTTTAC TAGAAACTTC TTCACTCTCA CTTTTGCTTC GGCATCTAAC CTGTTTTCTT TTCGTTATGT ATACTTCGTT CTAGTCTTTT TTGTCCTTCA AGACAACCAG CTCACCGGGA CGTTCCCGGA CGCGTTCGAG CCGTTCCAAG CCTTGCAGTT CGTCGATGTG TCCGGAAACA ACCTGACCGG CCCCTTGCCG CGTACCGTGT TTGACGTTCC CGTGATTGAG ATTCTGTACT TTGACGACAA TGCGTTCACG GGAACAATCC CGGGCAACTA CGCGAATGCC TCGAGCCTCC GCGATCTGTG GCTCAACGAC AACCAACTGA CGGGCACCAT CCCGCCAACG CGTCCCGCTG AGCTCGCGAG TCTCGAGGAA TTCCGTTTGG ACGGGAACAA CCTGGTCGGG ACTGTGCCGT TATCATTGTG CGCGCTGCCT TCGAGCACGG CCTTGACGGC CGATTGTGGG GGTCCGTCCC CGGCCGTCGA ATGCGTATGC TGTACCTTGT GCGTGGCGTA A
|
Protein sequence | MRRIGLLSAL LLLSAAFAEQ GLRRKRNLLD IQVESEIHHE MISPYNELVL DDFLWGGRKL MKVDGKSKSK GTMDSFSAMA LKSTKSPKST KAGLNSPQMT TVGIGSFSPK SKKKSQKTED CYSDVFGYLF CDSSMSFMSM SVSMSMVATP TMPTNPTTPT TPIAPLTPTA PSTPSTPTTP TAPTTPTSPT APTAPTSPTA PTAPTSPTAP TTPTIPTVPT QAPIDGCSSL PRDEALRTLV GAITDETTLA DSTSPQGQAY QWMLNTDPAS LDPCTDDNVL PRYALTTFYF SSSGASWTNS SDWLSGISEC AWYGVVCSST GTVTEIALFE NNLAGTLPTE LEALTDMQRL DVFDNELGGP IPNILSKWPF LAFFDVEKNQ LTGSPFVDAI GLFNLQSYRV SLNAFVGGTI PADLVDKFPN LVELWLSNST MIGSIPTEIA LMDNLQSIFL YGNSLTGTLP SELGQLNLER LQVQDNMFAG RIPTELFRST NLADLRLDTN KLAGPIPSAV GALTNLIDLR LNNNVLTGTL TTEISRLSNL VFFVLQDNQL TGTFPDAFEP FQALQFVDVS GNNLTGPLPR TVFDVPVIEI LYFDDNAFTG TIPGNYANAS SLRDLWLNDN QLTGTIPPTR PAELASLEEF RLDGNNLVGT VPLSLCALPS STALTADCGG PSPAVECVCC TLCVA
|
| |