Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38549 |
Symbol | |
ID | 7203496 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 366573 |
End bp | 370388 |
Gene Length | 3816 bp |
Protein Length | 1128 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182671 |
Protein GI | 219124774 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCAC ACGGTCAGGA AACTGTGGTG AGCTTCGACG AGCTGTCCCA GGCAAGTTCC GAAAGCGAAA GCAAAAAATC TTCCAGTAAT GGCGGACGGT CGGATGATGA AGCGAGACAA ATCTTATCCA AGCAGGAATC TGAGGACGTT TTCCGTCTCC GTGTCACAGT AATTCTGGTT TTGTTTGCAG CCGCCATGGC AGTCTCTGCT TCTATTTTTC TCATGATCTC TACCTCAGAA CACCAAGAGT TTGAAAACCA ATTTGAAGGT ATCGCTGAAC GAATTGTTGA TGCCTTCGAA GGAATCCCTC AACAGAGAAT TGGCGCTGTG AGTTCTTTGG CTATAGCGGC AAGCGCGCAT GGCGTCGATC ATCATCAACA TTGGCCGTTT GTAGCCTTGT CTTCTTTTCA GGAGCGTTCA ACCTCGATTC GAAACCAGGC CCAGGCTCTC TTTGTCGCTA TCGCACCATT GATCAGTCAG ACTGATCGAA GCGAGTGGGA GAGCTTTGTG TCAAGCAATG CATCCAATTG GATTGACGAA GGCTTTCTCT ACCAAGAGCA AATGGGGTTA AGCCAATTTG CTCCATCAGG CAACGCGTCT ACCATTATCG CTGGAGCTCC TTTAATTTCA TCATTGAATG AAAATGTATC CTACGAAAAC AGTTTGGGAG AAGGTCCTTT TCTTCCATAC TGGCAAACAT CTCCGATTCT AGAAGAAAGC CTGCTAAATG TTGACATTCT GAATGATCAA GAAAAGGCGG AAAGTGCTCG TACCTGTCTT GAGACCAGAT CCGCGGTCTT TGGAAGGCTT GAAAGCGCTC CGGCTGCAAG CAATGACACT GAATCATCCT TCTATACGCT CTTCTATTCC AAGCTCTTAA GCATCACGGA GGGGGAGTAC GTAAATTACA AGGGTGACCC TCTGAGCACC TTGTTTGTAC CCGTTTTCAG GTCTTTGAAA AAAAAAAAGA TGAACCAGTT GCTGTAATCG TAGCAACAAT GCACTGGGCC TCTCTTTTCA AAAACTTGCT TCCTACAAAT ATTGAAGGCA TTCACGTCGT GTTGGAAAAT CCGTGTTACG GGTCCTTTAC ATATGAAATC GAAGGGAGAA ATGTCAATTC TATCGGCAAT GGTGTAAGTA GACAACAATG AGAGGGACGA AAAGGAGATC TAGAGCCTCA TCGGTGTTCT ATACGTTTTC TTGTACATTT CACAGGACCA CCATGAAGTA GCCTTTGAAA ATACGTCGAA AAACGCGAGC CTTTTATCTG TGGAGAATAC TGTTGACGGA ACGCTGAATG GTCTGCCGTT GTATGAAGGT GAATGCCCGT TTAGCATAAG CGTTTACCCA ACGGCAAAGT TCAGAAGCAA CTTTATCACC CTGACTCCAC TAGTCGTCAC TCTTGCCGTT GCCCTGATTT TCGTCTTTAC TGTCTTCATG TTCATCATGT ACGACCGCTT TGTTGAACGA CGGCAGAAAA TTGTATTACG AAAAGCAGTT CAAACAAGTG CAATTGTCTC TTCAATGTTT CCGAAATCAG TCCAAGATCG TCTTTTAGAA GCCAATGAAA AAGCTAGGAC TCAGGTATCG ACTGGTGCGA ACAGCCAGAT GAACTCTTTT TTCAATGGAG CCGAGCGCAA CAACTACGAT CAGCACGATC CTATTGCCGA TCTCTTCCCA AACTGCACTG TCCTTTTTGC GGATATCGCT GGCTTTACCG CATGGTCGTC TTCACGTGAC CCTGCTCAAG TCTTTGTACT CCTTCAAGCA GTGTACCAGG CATTTGATGC TATCGCAAAC CGTCGCAAGG TTTTCAAGGT TGAAACGATC GGTGATTCGT GTAAGTAATA CCTCCCTCTT CCCTTTGTCA TGGTCGATTC ACTAACAGAA AGCTTCCGTG AACTTAGATG TGGCTGTCGC TGGGCTTCCT GAAGTGCAGG AGAAGCATGC GGTGGTCATG GCAAGGTTTG CTTGGGAATG CCTTATCAAA ATGCATCAGG TTACGAAGGA TCTGGAAGTC TCATTGGGGC CTGATACCGG AGAACTTTCC ATGTATGTCT GCATTACTTC ACGTTGCTTT CAAAGAGTTT GTCAACTTTC ACATGTTCAG TCTTTCTGTA TATTTTTAGG CGCGTGGGTC TTCACAGCGG TGCGGTCACA GCAGGTGTTC TTCGAGGGGA CCGAGCTCGT TTCCAGCTCT TTGGTGATAC TGTCAACACT GCTGCCCGAA TGGAGAGGTA CGGACATTGT GTATACAAAT AATCAAATTG TCAAAGCAGA CTCTAAATCA AATTGATCAC ACTTCAACAG CACCGGCGTG CGAGGTAAAA TTCAGATTTC ACAGTCCACT GCAGATCTCA TTATAGCTAG CGGGAAAGCG CATTGGATCA AACAGCGCCC GGATTCTGTC GAGGCCAAGG GCAAAGGCAC TTTAACAACT TTTTGGCTTC AACCTCGTGT CAAGGAGGGA TCAAGCAATT GCTCAAGCGA AACAGAAGAT GCTTCGAATC CACAATTCCA GTATGAGATA GGTGACAAAC CCCTTAGAAA GAGCTACGCA TTGGTTGCCA AGCAAGACCG CCTTATCGAT TGGATCGTCG AGCTCCTCGC TGAGTATGCA AGAAAGATTG TACGTGCGGA TTTGCCGTAG GCTAAGACAA ACAGTAAATC ATGTCCACTT CTCACCTAAC ATTCTCTGCT TCCTTGCAGA TTTCAAGGCG TGGTGTTCTT CAAACCAAGC CAGACCGGTG TTTCGACCTC TTCTACGAAA CGCCCGAAGG TAAAACATGC CTGGACGAAG CCACAGGAGC TGTATCCCTT CCACAATTCG ATCATGAAGC GGGTTCGAAA AACGCTGACG AATCTGTTGT TCACCTTGAC GCTAACTTTG TTCAACAGCT TCGCGAGTAC GTCTCTATTA TCGCATCCAC GTACAGAGAG AACGCCTTTC ACAATTTTGA GCATGCCTGT CATGTCGCAA TGTCGGTAAA CAAACTCCTC AAGAAGATCA TTTCACCCTA TTGGAGACCC GACGAAATAA ATGGAGGTTC AGGCGGCCTG GCGTTACGCC TCCACGAGTT TACACATGGG ATCTACTCAG ACCCGCTGAC ACTGTTTGGC ATAGTGTTTT CGGCTTTAAT CCACGACGTA GACCATCGTG GTGTCTCCAA CGTCCAGCTC ATAAAGGAGG AACGTGAAAT GGCAGAGGTC TACCGGGGCA AGAGTGTCGC TGAACAAAAC TCGTTGGACA TTTCTTGGGG GCTCCTGATG TCGTCACAAT TCAAGGAGCT CCGTACCTGC CTCTTCCACA ACCGAGATGA GATGATGCGA TTCCGTCAGG TGATCGTCAA CACGGTGCTA GCCACAGACA TTTTTGACAG GGAACTCAAC GAGCTGCGTA CGAAGCGGTG GAGAATGGCA TTTTACGAAA GCCATCCAGA CACAACTTTT GGAAACGATC TCAAAGCCAC CATATTAATA GAGCATATTA TCCAGGCGTC CGATGTGTCG CACACAATGC AGCACTGGCA TGTATATCGC AAATGGAACG AGCACCTGTT CCACGAAACG TACTTGGCGT ATACGGAAGG GCGTATGGCA TCTGATCCGT CAACGTTTTG GTACGAGGGG GAACTTCGTT TTTTCGACAG TTACATCATT CCGCTGGCCA ACAAGTTGCG CGATATTGGC GTCTTCGGTA TGTCGAGTGA CGAATACCTC ATCTACGCAC TCAGCAATCG TCAAGAATGG GAGCAAAAAG GCCAGGAAAC AGTGGCGGAA ATGATGAAAA AATACTCTTC GTATCGCAAG AGGTGA
|
Protein sequence | MSSHGQETVV SFDELSQASS ESESKKSSSN GGRSDDEARQ ILSKQESEDV FRLRVTVILV LFAAAMAVSA SIFLMISTSE HQEFENQFEG IAERIVDAFE GIPQQRIGAV SSLAIAASAH GVDHHQHWPF VALSSFQERS TSIRNQAQAL FVAIAPLISQ TDRSEWESFV SSNASNWIDE GFLYQEQMGL SQFAPSGNAS TIIAGAPLIS SLNENVSYEN SLGEGPFLPY WQTSPILEES LLNVDILNDQ EKAESARTCL ETRSAVFGRL ESAPAASNDT ESSFYTLFYS KLLSITEGEY VFEKKKDEPV AVIVATMHWA SLFKNLLPTN IEGIHVVLEN PCYGSFTYEI EGRNVNSIGN GDHHEVAFEN TSKNASLLSV ENTVDGTLNG LPLYEGECPF SISVYPTAKF RSNFITLTPL VVTLAVALIF VFTVFMFIMY DRFVERRQKI VLRKAVQTSA IVSSMFPKSV QDRLLEANEK ARTQVSTGAN SQMNSFFNGA ERNNYDQHDP IADLFPNCTV LFADIAGFTA WSSSRDPAQV FVLLQAVYQA FDAIANRRKV FKVETIGDSY VAVAGLPEVQ EKHAVVMARF AWECLIKMHQ VTKDLEVSLG PDTGELSMRV GLHSGAVTAG VLRGDRARFQ LFGDTVNTAA RMESTGVRGK IQISQSTADL IIASGKAHWI KQRPDSVEAK GKGTLTTFWL QPRVKEGSSN CSSETEDASN PQFQYEIGDK PLRKSYALVA KQDRLIDWIV ELLAEYARKI ISRRGVLQTK PDRCFDLFYE TPEGKTCLDE ATGAVSLPQF DHEAGSKNAD ESVVHLDANF VQQLREYVSI IASTYRENAF HNFEHACHVA MSVNKLLKKI ISPYWRPDEI NGGSGGLALR LHEFTHGIYS DPLTLFGIVF SALIHDVDHR GVSNVQLIKE EREMAEVYRG KSVAEQNSLD ISWGLLMSSQ FKELRTCLFH NRDEMMRFRQ VIVNTVLATD IFDRELNELR TKRWRMAFYE SHPDTTFGND LKATILIEHI IQASDVSHTM QHWHVYRKWN EHLFHETYLA YTEGRMASDP STFWYEGELR FFDSYIIPLA NKLRDIGVFG MSSDEYLIYA LSNRQEWEQK GQETVAEMMK KYSSYRKR
|
| |