Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39341 |
Symbol | |
ID | 7195058 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 245285 |
End bp | 248973 |
Gene Length | 3689 bp |
Protein Length | 1167 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183327 |
Protein GI | 219126151 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.338277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGGAA ACCCCCCCAA AGCTATGATG GCAGAGTCCA TTTACTTGCC TCCAAAGCCA ACCTTGGGTG GTCTTACTTG CCAATCGGCC CGTGAAATGG TTGCCTACAC TGGCGGGAAG CCCAACGCAA CGTGGACCAA TCTTGCTGAA CCTACCAAAT GGGAAACGCC TAACGTAATT TGCCCCAACG GAGTCAGCGA CAAATGTAAG AACTGGAACT TTCTTACGCA GGCCCCAACA AAGTTGTTCT CAAAGTCGGA TACGTCCTAC CTACAAACCA CCTGGATTGC TGACCAGGTC AAGCATTTCC GCGAGGGCGG CCTCAACTCA GTCATGTACC GCCCTAGTAT TGCGGACCCG ACCGATATGG TAAATGTCAT GGAACACCAT GAGCAGGTGA CCTTGACCCA TGTTCTTGAA GTCGAAGAAG ATGTTTGCAG CAAGTGGGAC GCCTACAGTT GTTACAACAA CAAGGCAAGC TCCAACCGCA TCCTCAACTC AATCAGTGCT GAGCTTAGTG AAGAATTGAG GCTTCGGATG CTCCCGACGG ACACAGCGGC CGTCTTGTGG ATGAGAGTTA TGAAACTGGT GGTGGACGGC TCTATTGAGC ATTACAACCG CCAGAAGGAT GCCCTTTGTG CCCTCAGCCC GTTGTCCGAG CCTGGACAGA ATGTACACAC GTACTCTGGT AAGGTCCGCC TCATTTGCAA CGAGCTCTGG CATGCCCGCC AGTGGGAATG GCCTTTAATG CTTGTGATTG TCCGCCAGTT GTGTCTTGTT ACGGTCAAAG CTTTCCAGTC AATGTTCTTA CCAATCAAGA TGGCAATGGA CCGTACCTTG ACAGAGATCT CTTATCTTGA CCGGACTTTG GCAACTGCGG TGATGGTAAA GAAAGGGTAC CACTATACCC AATTTCTAAC CCTTGTTGAG GACACTTACA AGTCTCTTCT TGACAACAAG GACTGGCCGC CAGCAAGCAA TCAAAAGGAG ACCCAAGGTG CCCCAACTGC TTTCTTGGCT GGGATGTTGG AGGTCCAGCT CAATGCGCTG GTCCAATTGC ATGTCAACAA GACTTTGAAT GCTGAAAAGA AAGTATTCAA ATGTTTTAAG TGCGGCCAAA CTGGCCACTT CTCTCGCGAT TGTAAGCAGC CGGCCTCAGA GGGAGACAAG ACTCCGTCCG ACAGCAAGAA GCCTAAGGCA AAGGAAACCG GATGGCGTGT TGAAGCCCCT GCTGCTGGAG CCACTCAGAC TAAGGTGGTC AACAGTAAAA CATACCATTG GTGCGCTACT TGCAACAATA AAGACGGCCG CTGGAACCTC ACTCACGTTA TGGCCTCGCA TACCACTGGA GCCGGTCGCC GGGGAAAATT TGATTCACCG GCCCCTTTGG GCCTCATGGC TGAAGTCTCC TCTCCTGATG GTGTCCTTCC CAGTGACGGT CGCTTCTGGT GACTGGAGGG TCGGGGGGAC CCCGATCTTT CTTTCCGTTT GCCGCCAAGT ATTGCTCTGT CATCTCTGAC GGCCTATGCA CTCCTCACTT TGTTTGATGA GCCCATTCCG CCGGGAGAAC AGCAGTGCCT CTCAGTTACC GCATTGTGCC GCTCTCCAGC TGTGGCCGGT GTTGCTGTCT ATGCCGGTAC CAGTGTGGAT GCCAAACCGG ACAGTTTCCC AGTCATTTGG GACACTGGTG CTTCGCTCTC AATCTCCCAT GAAGCCGGAG ACTTTGTTTC TGAGGTCCGT CCTCCACCAA CGCCGCTGGT TTTGAAAGGC TTGGCCAAAG GCCTTAACAT TGTGGGTGTG GGCACTGTTG AATGGTTGGT ATTGTCTCAG GACGGGATGC CTTGTCTGCT GTGTTTGGAT GCCTATCTTG TCCCATTGGC TGGGCAGCGT CTCTTGTGTC CGCAGTCGTA CATCCAGCAG CAGCAGCATC TCCTACCCAG TGACCCTGGC AAATTTGTTG TTGATTGTGA TGGTATGTCT TTGGTTGGTG CCGGAGACAA CACTGTCCGT GTTCCCTTCC AGACCTCCAA TAACTTGCCG TTGTGCATGG CCTGGTTGCC CTCTGGCTCT CCTTCCCTTG TGGCAGAACT CAATTTGTGT GTGACCAATG CCCAAAATCA GAATTTGAGT TTGGCCCAAA AGGAGTTGCT TTGTTGGCAT TATCGCCTTG GGCACTTGCA CTTTGAGTCC ATTTGCCGTC TTCACCGGAC GGGCGCTTTG TCCCAGAGTG CCAAAGTCCG TGCTCTCCAC CGTTTGGCTG CCAATTGTGA CCTTCCAAAG TGCGCATCCT GCCAATTCGG CAAGGCTAAA CGTTGTCCCT CTCCTGGTAA GGCCCAGACC ATTGTCCCGG CACATGATGG CTCCATCAAA AAGGAGCATC TATCTCCTGG ACAACAGGTC TCTGTTGACC ATTTCATTTG CTCTGCTAAA GGCCGGCTCT CCTCTTCCAA GGGGAAGACA ACCGATGATC GGATGTTTTC GGGCGGCTGT TTGTTTGTTG ACCATGCTTC TAGCCTGGTT CATGTTGAGC ATCAAGTTTC TCTCACTTCC CATGAGACCT TGCAGGCTAA GCACCGCTTT GAAACCATGA CCCGGGACCG TGGCGTGACG CCACAGTCCT ACTTGTCTGA CAATTCAACG GCCTTTACCA ATGCTGAATT CACTGTTGAG CTCCGCATTT TCCGCCAAGT CCAGCGTTTT GCCGGTGTTG GCGCTCATCA CCACAATGGT GTGGCTGAGC GCAACATTCA GACCATCATG GCAATGGCCC GCACCATGAT GTTGCATGCT GCTATTTGTT GGCCTGAAGT TGCTGATCCG TCTCTCTGGC CCATGGCTGT TGACTATGCC ATATACCTGC ACAACCACCT CCCCACCGTC TCCGCTGGTC TGGCTCCTAT TGATGTCTTT ACTGGGAGTA AGTGGCCCAT GCACAAGTGT AATGATCTCC ATGTCTGGGG GTCTCCAACA TAAGTGCTCG ATCCGACGCT TCAGGACAGG AAGAAGCTGC CTAGATGGAA GCCTTGCTCG AGACGTGCTG TTTTTGTGGG TTTCTCTCCG AAACATTCCA CAACCATTCC CCTTGTTGTC AATCTTGTCT TGGGGGCCAT TAGCCCGCAG TTTCATTGTG TTTTTGATGA CTGGTTTTTG ACGGTTTTTT CGGATCCGGA CCGGATTCCT GATTTTGAAC AGTCTCCATG GACCAATTTG TTCAGCAAGA GCCGTTTCCA GTACCCGTTT GACGCCGATG ATGGTTCTCC TCCTCCATTG GAAGAGCAAT GGCATGACGA GTTTGCCGAG CGTACGGCCT CTGCAGCTTG TGAGCTCATT GTTCGCGATG CCCAGGACGA AGCCTTGTCC GGAGCCAATG CCGCTCCACC GCTTGAACCT GTACCATCCG GCCCCTCTCT GGCCGGAGTG CCCACTTCTC AGCCACCAGA GCCCCTCAGC GCTCCTCTGC CGACTGTTCC GCGTGTTCGT TTCAGTCCCG ACGTTGTTGC TCCACTGGCT CAGAGGGAGC CGGACCCCGC TCCTTTGGCT GTCCCCTTGG TTCCTGCTTC GGAGCCTCCG TCGCCTCAGC AGAGGGAGCA GCCCCCTGCT CCTCCTATTG TCCGTCGCTC GAGTCGTTCT ACTCTGGGAA AGCCGAGCAA ATGCTTTGCC AATCTGGAAT GGCACCTGGC CCAGACTGA
|
Protein sequence | MSGNPPKAMM AESIYLPPKP TLGGLTCQSA REMVAYTGGK PNATWTNLAE PTKWETPNVI CPNGVSDKCK NWNFLTQAPT KLFSKSDTSY LQTTWIADQV KHFREGGLNS VMYRPSIADP TDMVNVMEHH EQVTLTHVLE VEEDVCSKWD AYSCYNNKAS SNRILNSISA ELSEELRLRM LPTDTAAVLW MRVMKLVVDG SIEHYNRQKD ALCALSPLSE PGQNVHTYSG KVRLICNELW HARQWEWPLM LVIVRQLCLV TVKAFQSMFL PIKMAMDRTL TEISYLDRTL ATAVMVKKGY HYTQFLTLVE DTYKSLLDNK DWPPASNQKE TQGAPTAFLA GMLEVQLNAL VQLHVNKTLN AEKKVFKCFK CGQTGHFSRD CKQPASEGDK TPSDSKKPKA KETGWRVEAP AAGATQTKVV NSKTYHWCAT CNNKDGRWNL THVMASHTTG AGRRGKFDSP APLGLMAEGR GDPDLSFRLP PSIALSSLTA YALLTLFDEP IPPGEQQCLS VTALCRSPAV AGVAVYAGTS VDAKPDSFPV IWDTGASLSI SHEAGDFVSE VRPPPTPLVL KGLAKGLNIV GVGTVEWLVL SQDGMPCLLC LDAYLVPLAG QRLLCPQSYI QQQQHLLPSD PGKFVVDCDG MSLVGAGDNT VRVPFQTSNN LPLCMAWLPS GSPSLVAELN LCVTNAQNQN LSLAQKELLC WHYRLGHLHF ESICRLHRTG ALSQSAKVRA LHRLAANCDL PKCASCQFGK AKRCPSPGKA QTIVPAHDGS IKKEHLSPGQ QVSVDHFICS AKGRLSSSKG KTTDDRMFSG GCLFVDHASS LVHVEHQVSL TSHETLQAKH RFETMTRDRG VTPQSYLSDN STAFTNAEFT VELRIFRQVQ RFAGVGAHHH NGVAERNIQT IMAMARTMML HAAICWPEVA DPSLWPMAVD YAIYLHNHLP TVSAGLAPID VFTGMLDPTL QDRKKLPRWK PCSRRAVFVG FSPKHSTTIP LVVNLVLGAI SPQFHCVFDD WFLTVFSDPD RIPDFEQSPW TNLFSKSRFQ YPFDADDGSP PPLEEQWHDE FAERTASAAC ELIVRDAQDE ALSGANAAPP LEPVPSGPSL AGVPTSQPPE PLSAPLPTVP RVRFSPDVVA PLAQREPDPA PLAVPLSFYS GKAEQMLCQS GMAPGPD
|
| |