Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45159 |
Symbol | |
ID | 7200339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 352800 |
End bp | 354851 |
Gene Length | 2052 bp |
Protein Length | 683 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179193 |
Protein GI | 219116797 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0183197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCCG AAACTGAACG CCGAAGCTCT GATTTAGAAA GGGAAGGGCA AAAGATCGAC GTTTCGATGC TATCGTACTC GGAGATGACT TTGGTCAGCT CCGGAGAAGT AATGACACGC GAGACCAAGG CTGCGATTCG CAAAAAGCTT GGGTATTGCA CGGAATGTCC AGGGCGACCG GTACTTGTCT TCGACGTGAA GAGGTCAAAA TTAAACCCCT TGTGGTTTTC AAAGAAGGTC AGGAGTATAG ATGGGGAATG CAATGATGGG CATTGCCTGA AATGCCATCC TGATTTTGAT CCAAAGAAAC CCTCGGAACG AAGAGCCCTT CGGCGCTATT CGGAAAAGAG TTGCGGGACA AACTCAACTG TTTCCATGGG CTCGTCACGC TCGCTAAGGT CTCAAGAGAG TGATAGTAGT TCACCTAATA CTCCTGTGAG AACCACTCCT CGGCCCAAGA CCCGTGCTTT AGCCAGGTCT TTGTCAAATG AAACGGACCT TCAAGCCATC AAGCGAATGC CTCGGCTAGA TAGTAAACGT ATCATGGAGA ACTCCGACTC CGCCAGTGAT GAACGACATT ATGATCAAAG AGCCAACCGA CAACATTCTG ATGGGAGACC CATGCCTCGT CGCTCCAAGT CTGCTAATCT TTCTCCCGAA ACCTCGCCCA ACTCCGGTGA CTCAGGCGAA GGTCAGAACA TGACTCTTCG AGGAAACAGC GAACACCTTC GCTCGTCCAC GAGCCACCAC GAAGGTCGAG TGACTGTTGA ACTATCTCGC AAAGTCGTCG ACACCCTTTA CAATGATGAT ATCAAAGAAG CCTCCGTTCA TTCGTTTTCT ACAGTTGAAA GATCGGAAAG CTCCCCACCG GCTAGTCCGT CGGATAACTG GAGTCCCACA CCTAGTCCGA GACCTCGCCG GAAAATTAAG CAACAAGATT TGCAGTCTTC TGCCGTTACC GAACTTCAGG GACTACTGGT TGATCTTGTC TCCAGCGGCA GCCATCATGT TCTGTCTGAA ATTCTTTGTG ATTCAATGAA GTCGAATTCT AACGACTTGG AGATTCAGCT GTTATGCCTT CGTACAATAT CTTCAGCGTT CAAGCAGAGC AACGTTGGAA TCTCCAAGCT TGTCTTGGCC CGTGCACATA TAGAAATCAT CAATGCGATG CAACACTTTC ATTCATCGAT GGAGATACAA CAGAGGGCAT GTGCTGCAAT CTGGGCATTA TCGCGAGACA ATAGCGCAAG GACAGCTTTG ATACGTAAGG GTGTCTGCGA CTTGATTCAA AGAGCAATGT CGAGGAACTT TGGTGATAAT GTACTTGTAG AAAAGGCCTT AATGGCTCTC CGAGCACTCT CAATCGACGA AGAAGCACGC GACATTCTGC ACCGCGTGAG CGTGGTGCAC GACATTGTTC AGGTCATGGA GTGTCACTGT ATGAACGCGG ATATCCAAAG GGATGGATGT GCAGTCTTGT CCAACATGGC AGTCGAGTTT GAGAAGAAAC AGGCCTCCGT TGTGTCGGTA CACGTTCTGA GAGCGGTGGT TGCCGCTTTG AAGACGCACT TAAGTGATCC TGTTGTTGTC GAGAGTGCAT GTTTCGCTCT TAACAACTTC TCGTACGAAG AAACAAATCT GAGGGCTCTT CGTCGTTTTA CTGAGATATT CCCTCTTCTA GAACGGGCCC AGCAAATCAG CAAGGGTGAC CATGCATTTG CTGTTGTCGA AAAGCTGCAA ATTTCTCGCG CTGAAGATGA GTCCTTGGAG GAACAACTGC ATGCGTACTT GCTGCATCTT ATTGAACAGA AAGCCGATGT ACCGGAAGTT GTGGAAGAAG TTGTAGAATT CCTCCGTGCT AACGAATGGT CTCTGCGCAT GATGGCAGAA ACCCTCAGTG CGCTGCGTCT TCTCGCCACT AAGGCAACTT TCCACAAAGA AATTCTTCTT GACCAGTTAT CAATACCTGA TCTCGAAAGA ATATTGAGAG TGTTTGAAGA CGATATTCGT ATTCAAACAG AAGCACGCCG ACTTATCGAA TGCTTTAAGT AA
|
Protein sequence | MASETERRSS DLEREGQKID VSMLSYSEMT LVSSGEVMTR ETKAAIRKKL GYCTECPGRP VLVFDVKRSK LNPLWFSKKV RSIDGECNDG HCLKCHPDFD PKKPSERRAL RRYSEKSCGT NSTVSMGSSR SLRSQESDSS SPNTPVRTTP RPKTRALARS LSNETDLQAI KRMPRLDSKR IMENSDSASD ERHYDQRANR QHSDGRPMPR RSKSANLSPE TSPNSGDSGE GQNMTLRGNS EHLRSSTSHH EGRVTVELSR KVVDTLYNDD IKEASVHSFS TVERSESSPP ASPSDNWSPT PSPRPRRKIK QQDLQSSAVT ELQGLLVDLV SSGSHHVLSE ILCDSMKSNS NDLEIQLLCL RTISSAFKQS NVGISKLVLA RAHIEIINAM QHFHSSMEIQ QRACAAIWAL SRDNSARTAL IRKGVCDLIQ RAMSRNFGDN VLVEKALMAL RALSIDEEAR DILHRVSVVH DIVQVMECHC MNADIQRDGC AVLSNMAVEF EKKQASVVSV HVLRAVVAAL KTHLSDPVVV ESACFALNNF SYEETNLRAL RRFTEIFPLL ERAQQISKGD HAFAVVEKLQ ISRAEDESLE EQLHAYLLHL IEQKADVPEV VEEVVEFLRA NEWSLRMMAE TLSALRLLAT KATFHKEILL DQLSIPDLER ILRVFEDDIR IQTEARRLIE CFK
|
| |