Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35442 |
Symbol | |
ID | 7200428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 928468 |
End bp | 930741 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179941 |
Protein GI | 219118328 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.725158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGTGA GGGATCCTAG ACTTAGAATT GGAGGGAAGG TGACGGCAAA GGCTTGTCAT GTTGTGCATC TGAGCAAGTG CGCACGGAGA TATGGCGTCA ACAAGCACTC CAAGCGGCTT GTTGGAACGG TTCTAGACGT CACGACCACC CCTGTATCCA TTACAACCGG GCGTACCTTT ACTTTGATAA CAGCAGTTTA TGATTTTGGA GAGAGTTTGT TCAAGGAAAA AACACTGAAC ATTCGGAGTG TAAAGGCATT TGTACCGCCA GAAGATGAAG GAATGTCCTT AATTGAGGAA TTAGCAGCAG AGGCTTTGCA GGCAGCAGAA GCAGACATGG AAGCCGGAAA CTTGATGGAA GAAAGTGTCG AAGCCCCGGT AGCCGAAATG GTTGAAACCC CGGCTGACAT AGAGCCCGAT ACCTTGGTCG ACACAGAGCC CAATACCCCG GTTGACACAG AGCCCGATAG CCCGGTAGCC GAAATTGTCG AGACCCTGGT TGACACTGAT ACCTCGCTTG ACACAGAGTC CGAAAACCCG GTAGCCACAG TGCACCAAAC AGAGTGGTAT GTGAATGAAA GAAAAACCCG GCTGGATGTG AATGGCCATG TCTATGTTAG GCACTTCCAT ATCCGTACTT CAGTTGGTGA CCTTATTGGT CAAGACTCTG ACAATGGGGT GAGATTTTCG CGCCTCGAAT ATTTTCTGCT CATGTTTCCG CCGACCCAGC TGACTACTAT GTGTCGGCTT ACAAATACTC AGCTTGCACA GCAAAACAAG AATCCAATCA CATCCGGAGA ACTTCTTCGG TTCTTTGGAA TGCTCATACT CACTACAAAG TTTGAGTTCA GTAGCCGGGC CCAACTATGG TCCACCACTG CACCCTCCAA GTACATTCCT GCCCCTTCAT TTGGACGCAC AGGAATGTCC CGGCAACGGT TTGACAATAT CTGGAAATAT ATCCGTTGGA GTGAACAATG TCCAGTCCGA CCCGATGGTA TGAGCACTCA TGTTCACCGA TGGCAACTTG TTGACAACTT TGTCACAAGG TTCAATGAGC ATCGTAGCGA AAACTTTGTA CCTTCCCATC TGATTTGCGT GGATGAATCT ATCTCAAGAT GGTATGGGCA GGGTGGGGAT TGGATAAACC ATGGTCTACC AAATTATATT GCAATTGATC GTAAGCCTGA GAATGGGTGC GAGATTCAAA ACGCAGCGTG TGGACAATCC GGTATTATGC TTCGATTGAA ACTTGTAAAG GGAAAGACAA TAACTGACGA CGAAGAGGGT GACAAGGAGG ATGAGTATCT ACCGCATGGT GCAAAGATTA TCAAAGAACT TGTTCGTCCT TGGTGGGGGA GTGATCGGAT TGTGTGTGCT GATTCTTATT TTGCCTCCGT TGTGACAGCT GTCGAGCTTA AGAGGATTGG CTTGAGATTC ATTGGGGTTG TGAAGTCGGC AACGAGAAGA TATCCAATGG CCTACCTTTC ACAGTTGGAA ATGACAAGTA GAGGAGAATG GAAAGGATTG GTGACAGACG GAATCTTGGA TGAAAGTTGT GACCTGATGG CTTTTGTATG GGTGGACCGA GACCGTCGAT ATTTTATATC AACAGCATCC AATCTGAATA GAGGCTGGAA TCCAGTTCGC TACCGGTGGA GACAGGTGGA TACATCACCT GATGCAGACC CTGAGAGGGT GGAGATCAAT ATTGCGCAAC CAGTTGCAGC AGAAGTGTAT TATTCTTGCT GTGCAATGAT TGACAGACAT AACCGGAGTT GGCAGGATAC ACTGATGCTT GAAAGAAAAC TTGGCACATG GGATTGGTTG ACACGAGTCA ACTTATCAAT TTTTGGTATC ATTGTTGTGG ACACATGGTT AGCCTACAGC CAATGTACAG GAATAGGAAA GTCTGCTGGA CGAGAAGAAA AGCAGAAGGA TTTCTACAGT GCCTTAGCCG AGGAGCTGGT GGACAACCAG TACGATAGTG TTGGAAGTCG CAAAGTTGGG AGGGATGAGT TGGACAAGGA TAGCCCAACC ATCTCCAGAA CTGGAGAGCC GCAATGTGGT CTCTCCGCAC ATCTAACACC CACCAAAAGA AAAAGAAAGA ACAAAGATGG TACTATTAAA AACCAAAGAC AGCAGGGAAG GTGTTTGGTG TGTTCCAAGA AGACCACATA CAATACTGCC GGGCAAACCA GCAGCCTGAT GCATCGGGTC CATCGGCATT TGCCAAAAAA ACCGTTGATC GTCAATGTCG CACGGAAAAT TTGA
|
Protein sequence | MPVRDPRLRI GGKVTAKACH VVHLSKCARR YGVNKHSKRL VGTVLDVTTT PVSITTGRTF TLITAVYDFG ESLFKEKTLN IRSVKAFVPP EDEGMSLIEE LAAEALQAAE ADMEAGNLME ESVEAPVAEM VETPADIEPD TLVDTEPNTP VDTEPDSPVA EIVETLVDTD TSLDTESENP VATVHQTEWY VNERKTRLDV NGHVYVRHFH IRTSVGDLIG QDSDNGVRFS RLEYFLLMFP PTQLTTMCRL TNTQLAQQNK NPITSGELLR FFGMLILTTK FEFSSRAQLW STTAPSKYIP APSFGRTGMS RQRFDNIWKY IRWSEQCPVR PDGMSTHVHR WQLVDNFVTR FNEHRSENFV PSHLICVDES ISRWYGQGGD WINHGLPNYI AIDRKPENGC EIQNAACGQS GIMLRLKLVK GKTITDDEEG DKEDEYLPHG AKIIKELVRP WWGSDRIVCA DSYFASVVTA VELKRIGLRF IGVVKSATRR YPMAYLSQLE MTSRGEWKGL VTDGILDESC DLMAFVWVDR DRRYFISTAS NLNRGWNPVR YRWRQVDTSP DADPERVEIN IAQPVAAEVY YSCCAMIDRH NRSWQDTLML ERKLGTWDWL TRVNLSIFGI IVVDTWLAYS QCTGIGKSAG REEKQKDFYS ALAEELVDNQ YDSVGSRKVG RDELDKDSPT ISRTGEPQCG LSAHLTPTKR KRKNKDGTIK NQRQQGRCLV CSKKTTYNTA GQTSSLMHRV HRHLPKKPLI VNVARKI
|
| |