Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47047 |
Symbol | |
ID | 7202141 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 267563 |
End bp | 271764 |
Gene Length | 4202 bp |
Protein Length | 1325 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181172 |
Protein GI | 219121644 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTCC TTGTGGTCAC AACGAAGATA TAGAGACGTT GTGGTCTTCC TGAGACAACT CCCCTTAAGT ATAAGATTCA TTTCATTCGG AGTAGAAAAG AGAGAATAGT AAGTAGTACG AGTAGCTCGT CGACATTTGC TGATGTTGCT TGTGAAGAAA ACAATGACCC TGCAATAAAT CATCAAGTTT GATTTACTGT CAATTGACGT GTGACAGCGA GTGCTCGCTG GAAACATAGT GGAACCCCCC GCCCCTCCCA CCCCGAAAAC ACGACACGCC GCCGGCACCG GAACGACCAC CGATAGGCCG GTCGACAGGA AAATTGCGGA CGAAAAGTTG CTTTCGGCAA AACCCCAATT GACTGGGAAA CAGAAGCGTG AGGCTATCGA AACAAACAAC GGTCATCCGC GGAACACAGT GTGGCTACTA GCGACCAATA CATCACAAGC CGCGCTCGTT TCCGTTGCCT CGTGCCGAGC CAGCTCGGTG GGCGTCGGAA TCTCTGGGAA GACAAGCATT CGTCCGAGCT GCTGGACACA CCACTACCTG ACAGGCAGGC AACACCGTGA GCGCTACCGT TCCATTCCGT ACGTGAGCTA TGAGTCATCC ACAGCAAGAA CCTTGTGATG GAAATGCTAG CGAACATCGA CTGCCGGTAT CGCCTCGGCT ACAATCGCCG TACCGAGATT TCGTGTCGCC AGGTCGGGAA TTCATCCATT TCGACGAGGA AGAAGACAAC AGAGCGACGA TTTCCCGAGG CTTGAACAAG GCAGCATCGC GTCGGATCAA ACGAGGAGAA TGCCCCGCTT GTGGAGCCAA ACTTTTCAAA AATAGTATGC TGGGCAAGAA GAAGACTCCC TTGACCATTC CGGGGCAATC ATTGAACGGT CGCTGTCTAT TTTGCTTTCC TATTGTCCGA GAAATCGTCG GATCACACTT GGATCGGTAT CCGGACGAAC CAGGCAAAGC GGTCGTGCCT CGATTTCTGT CGGTCCCCAC TGACGATACT CCCGACGATG GTACCGTCAT GTCTACCATT ACACTCGATC ATCATCTAGG ACAATTTGCG CAAGAAGGCG AAAAGGTCGC CCCTCCACCC CGACTACCTC CACCGACCTA CCCTCCGCAA CAGGCTTCCC AGAGTGATCA CATGCGCTGG CCGCCTGGGA CCGACCGCAG TACAGCTCCC GTACTTACAC CACCTGCTTC ATCGCGAAGA CGTAGATCGA GCTTTGACAA TGATGAAGCT GACGAAAACG ATCGAGGGCC GGCACTACCC TGCCGATCAG TGAGTCCGTC ACGGTTTAGT GAGCTCGACC CGTTGAATTC GGAGTTCCGG GCTAATCCAA TGGTCGCACC GTTACGGAGA GCCTCGAATA ATTTCGCTAC GTTCATGGAT CACGACCCAC AGCACGAGTC CAACAGCATC CAACCCCGTC GAGTACAACC CAGTTCCGCC CGTGCACGGG TGGAATGGGA CGGCTACCCC CTCACTGTTC AGTCACACCA GCATAGCGGA GGGTGCTTAT TCCCGAGTCC ACATCCTCGC CCATTCTGGA AAAGTCAACC CCGAACTCTA CAGACCAGTA AAGAAGCCGA CGACGAGAAG GAGGAAAAAT CGGAGATTCA ATCACCTGCC CATGCGCATG ATCCCCCGTG GCAACCATCC GAGCTGAAGC CAAAAGGGAG TCCCCGGGAA ATTGTTTTTG ATTCGTCCCG GAGTCCCTAC GACCCTCCCG AACCGGCTAG CGTGAACTAC AACACCCTAT GGCAACAAAT GTCGAAGTCT ACGCCCATCG CTCATGAAGA AATTGTAGTC GACCCAAACA CATTGCCTTT GGATGATGGT AACGCATCTG CTTTGAAGCA AGAAAGGAAA CCAATCGTTG ACAGCGCCGA CCATAGAGTT CGGAACGGCA GTGACCCTGG GTGGGAATAT TCTATGGCCT CGTTTGCCTA CCATCAGGAT TCTATCTCCT TCCAAAATAG CAGTTTTAGT TTTCTGTCGC ACACGAAGTC AACATCGCCA CAGTCCGCCA CCCAAACTTC ACCTAATTTG GAATTGGCAG TATCGTTGGA GGACATTCCT CCCTTGGTAA AGACTTTGCG TGATGATGTA CCCAACACGC ACGAATCGGC GTTGCGGCAC TTAACTGCGA CCGTTTGGCA GTGCGGGAAC TTGGCCCGTC AAGCAGTGAT TGAAGCAGGT GGCATTCCAG TTTGGACCAT GATTGTTTGG CAAGATATGA ACGACGAGGC CATCCAAGTG GCTGCGCTCG ACCTAATTTT CGCCGTGGCA ATTGGAGATT GCATTGATGC CACTTACGAT TATTTGGCCA ATGATACGTT TGATTATGCT GTAGATGCCT TGCTTATTAT GATGCAATCA CTAATTCACA ACGAAGATGT CCAGACTTTT GGCTGTCGAG TGCTGGCCTG TTTGGCAGGT GCTTCCGGTC GTAACGCTAA AGTCAACGAC GGGGCTTTGT CAGGTGCGGT GCTTACTGTC TTGCGAGCAA TTGATTCCCA CAAGCATTCC TTGTCTCTTC GGGAGTGGGG AATCCGTGCA TTGTATCAGC AGTGTGCGTT GTCGAAAGAC ATGAATAGCA ACAGAAAAGC GTTAGTGGAA GCGAAGCTGG ACCACAATAC AAGTGGACTA GACGTGATCT CGTATTGCTT GGACGAGGTG GGGTCAAATG CTGTCATGGC TGAATGGATA TGCAATCTTT ATTGGTGTCT TTCTTCAAGT CAAGAGATCG CCCAGATTCA AGTACCGGCG ACAGAACCGC TATTGGAAAT GACGAATATC GTACGAAAAT ATCAGAAGAG CCGAGGGTCG GTACTGCTTC TACAGGCGGC GCTAGCAGCT ATTTCAAATC TATGCATGCT TGCGGAAAAT CGGAAAGGTC TAGATACCAC TGAGGTGGTT CTCCTTGCTT TGGAGGTGCT CGATTTTCAT CAAGGATGCT CCAGTGTCGC TGTGGAGGTG TGCGGTCTCA TAGCGAGTCT CCTTCCTACA GTACGAAGCA CAGAATGCAT CCCCGCAGGA TCTATTCAGA CCCTGTGTTC AATCTTGGAG TTCCCTCGAG ATTTAAAGAT GAACAGAGAA GCCCTACGAG CATTGAATGC TGTGTTGGCT TCTTCTACTC TTGCACGAAA GCGTCTTTGG GAGACGACGT CGCTATCCTG GCTCACAGAG GTCTCCCGTC TTCATTGCAA TTCGGTCGAA TGGCAGGCAT TGAGTTGTGT CATGCTTTCA AATCTGTTAA TTGCGAATGA ATTGGACACA GGCGAAACAG AGAAGTGGAT TCTCTCTGAG CTGTATTTGA TAATATCGCG GTGTACCGAT GCACTAAAAG TCCAGGAGGT GGGAATTAGT ATCCTTTCAA AAATTTCGAA AGATGAGACC CTTTCGACAC TGTTGGACGA GGAAAGTTTG AAGCTGGTTG TTGATATGAT GTCGAAGTTT CCGCTCTCAA AAATTATACA GAGAAAAGGC TCCTTCCTCA TCTTGAACGT AGCCCGTGGT CCCATAATAC TCAAGCACTT GTCAGTCGCA GAGCGATGCG CGAGCTCACT AGTTATTACA CTGCAAAACC ATCTTGAAAC GTCCGATATT ATCGGATTCG CTTGCGATGC TATTTGGGTA CTTATCCACG GTTCCGACAT GCTTAAGGAA ACCGTTGTAA CACAAGGTGG CATTGATGCT CTGTCTTGCG CTCTGGTGTT GCATCAAAAC GAAGTTAGTA TTTTGGAAAA GGCTTGCGGT GTACTGTCAT GCTTGAGCTC AAGAGAGTCT CATATTCAAA CAGTGGTCAA CGCACAGAGT GTTTTTAACG TTGTTGATGC TATGCGGAAC AACCCTAATT CCGCTTCACT TACTCAGTAT GGATGTTTGT TGCTAAAAAA TGTCATCGTT ACAAGCAGGG AGCAGTCAAT ATTGGCCTCT GGTGCAATTA GTGTTGTAAC TGCGGCAATG TTGAAGCATC CCCACGAGAG TGGCATGCAA AGAGAAGCAT GCAGTTTTCT CTGGGCCATT ACGTCGGCAT CAGGCGACTG TAAATCGAAA GTGCTCGCAT TGGATGCGGT ATCCCTTTTG ATGACGGCGC TATCAAGTGA TAAAAAGGAT GTTCAAGACG CCGCTCGTGG CGCCTTCAAT ACTATTGCTC TCACATCCAA TGAGAGTCTT TCTGCTGTAT AA
|
Protein sequence | MDVLVRVLAG NIVEPPAPPT PKTRHAAGTG TTTDRPVDRK IADEKLLSAK PQLTGKQKRE AIETNNGHPR NTVWLLATNT SQAALVSVAS CRASSVGVGI SGKTSIRPSC WTHHYLTGRQ HRERYRSIPY QEPCDGNASE HRLPVSPRLQ SPYRDFVSPG REFIHFDEEE DNRATISRGL NKAASRRIKR GECPACGAKL FKNSMLGKKK TPLTIPGQSL NGRCLFCFPI VREIVGSHLD RYPDEPGKAV VPRFLSVPTD DTPDDGTVMS TITLDHHLGQ FAQEGEKVAP PPRLPPPTYP PQQASQSDHM RWPPGTDRST APVLTPPASS RRRRSSFDND EADENDRGPA LPCRSVSPSR FSELDPLNSE FRANPMVAPL RRASNNFATF MDHDPQHESN SIQPRRVQPS SARARVEWDG YPLTVQSHQH SGGCLFPSPH PRPFWKSQPR TLQTSKEADD EKEEKSEIQS PAHAHDPPWQ PSELKPKGSP REIVFDSSRS PYDPPEPASV NYNTLWQQMS KSTPIAHEEI VVDPNTLPLD DGNASALKQE RKPIVDSADH RVRNGSDPGW EYSMASFAYH QDSISFQNSS FSFLSHTKST SPQSATQTSP NLELAVSLED IPPLVKTLRD DVPNTHESAL RHLTATVWQC GNLARQAVIE AGGIPVWTMI VWQDMNDEAI QVAALDLIFA VAIGDCIDAT YDYLANDTFD YAVDALLIMM QSLIHNEDVQ TFGCRVLACL AGASGRNAKV NDGALSGAVL TVLRAIDSHK HSLSLREWGI RALYQQCALS KDMNSNRKAL VEAKLDHNTS GLDVISYCLD EVGSNAVMAE WICNLYWCLS SSQEIAQIQV PATEPLLEMT NIVRKYQKSR GSVLLLQAAL AAISNLCMLA ENRKGLDTTE VVLLALEVLD FHQGCSSVAV EVCGLIASLL PTVRSTECIP AGSIQTLCSI LEFPRDLKMN REALRALNAV LASSTLARKR LWETTSLSWL TEVSRLHCNS VEWQALSCVM LSNLLIANEL DTGETEKWIL SELYLIISRC TDALKVQEVG ISILSKISKD ETLSTLLDEE SLKLVVDMMS KFPLSKIIQR KGSFLILNVA RGPIILKHLS VAERCASSLV ITLQNHLETS DIIGFACDAI WVLIHGSDML KETVVTQGGI DALSCALVLH QNEVSILEKA CGVLSCLSSR ESHIQTVVNA QSVFNVVDAM RNNPNSASLT QYGCLLLKNV IVTSREQSIL ASGAISVVTA AMLKHPHESG MQREACSFLW AITSASGDCK SKVLALDAVS LLMTALSSDK KDVQDAARGA FNTIALTSNE SLSAV
|
| |