Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46560 |
Symbol | |
ID | 7201700 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 708160 |
End bp | 711988 |
Gene Length | 3829 bp |
Protein Length | 1245 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181059 |
Protein GI | 219120650 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.585047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTCTC ACCTTGTCCT CGGCTTAGCC ATTCTTTTCC TTGGCGCTAG CCAGTGCAAC AGCAACCTGA TTCGTGGTAA TTCTTCGATG TCTGCGAGCG AAGCCGGCGA TTCTGAAGCG CTAACAACAA CCTTATCTCG GCGTCTTGTC GAAAATTTTG TTCCTTTGAG GTGCAATGCT GCCATCGGTA AAGCCCCTTG TTCATCTTTC GTCACGATGT TTGGCAATGA CTCTGTTCGA TCAAACAGAG TCATTATTCC ATGTGGCAAG TGCGTTACAA TGGACTATCC GGGACCGCAA CTAACACTAA ATGATGGCAT CGACATACGA GGCAAACTAG TGTTTCCAGA CGGATACCGA CTCACGGTTC GGACGAACTT GGTGTCGGTC CAAGGCGAGC TTGAGATGAG AAGTTCTAAA CCAGTTGATG GAAACCCCGA TGTACGGTTT GTAATGCAAG GTACTGAGAG TGCGTCGTTC GCACCGATCA GCAGCAACGC CGCAAAATGT GGAGGAAATT GCGATGCAGG AGCTCGATCC ATTGCAGTGG CTGGTGGGAA GGTCAATCGT AAGTTCGGAT TGTTGCGAAA ATGGCATCCG CATTTCCTGC ACACATATAC GTGACTGTCG GCACTTACAG AAATCCCTAA TTGTTGCAGT CAATGGCCTA CCTACTAGCA CACCAACCTG GCTACATATC CAAGACGTCC TTGGTAGCTC GGCTATTGTT GTGTCCAATT CGGTCCGTAA TAAATGGGCT ACTGGTGCAA CTATTGTTAT CACCTCCGAA CACCAAGGTT ACTTTGGCGA GCAAGTTCGT AAAATCACCA GCATTTCGAA CGTCGGCTCC AATAGCGTCC GTCTCAATCT CGACCAACCC ATCAACCGTC CGGTCACGCT TCGCGACAGC CCAGACTTTG CCACCGAGGT AGCCTTGCTG TCACGCAACA TTGTCTTTGA AGGGGCCCCC GGAACGAAGG GTGGCCACTT TTGGATCATG CACACTCCCC GAGTGAGGCA GCGTATCGAA GGAGTGGAAC TCGTTAACTT CGGACAAGAA GGCCTCTTGG GTAGATACCC AATCCACTTT CACATGTGCG GGGATGTCTC AGGCTCGGTA GTAGTCAAGA ATACCATTCG CAATTCCAAC CAGCGCTGCA TTGTCGTCCA CGGCACTAAC AACCTCCTTG TTCAAGAAAA TGTAGCCTAT TTCACCAAAG GGCACTGTTA CATGTTGGAA GATGGCATCG AGACGGGCAA TCAGTTTGTG CGAAACATTG GCATACGCAC CGTCAAACCG GAGGTCATCA TTCCGAACAT GGGTAGCAAT GGCAAGGAAT CTGATAGGTC TGCCAGTACC TTCTGGATCA CGAACGCTGA CAACTCGTGG ATCGGAAATG TAGTAGCCGG ATCCGAAGCT CTGGGCTTTT GGTTTGAGCT GTTGGTCCGT GGAAACCTGG CCAACGAGCA CCAAGACTTT GATCCTATGA TGGTTCCGAC CCGCAAGTTC GAGGACAACG TCGTTCATAG CGTATTTGGG GTAGGGATGA CCTACTATTT GAGCGGTTAC ATCCCGGAAA CACTGCAGTA CTTCAAAAAC AACAAGTTCT TCCGCAACCA TCACCTTGCG CTCCGTATCC ACCGGTCACG AAACATCGTT CTGACTGGCA ATAAGTTCTC GGACAACAGA TATGCCATTC AAATCGATCG TGACGAAGAA ATTCATGTCA CCGACACAAC CATTGTTGGC TATTCCGATC TATTCAAGGA CGTGGTCAGA AGGAATCGCT TTGCCCAAGC ACCTTGCGCC CAAGGGATAT CTTTCCAATC GACTAATCCA TGGCAGAAAA AGATGGATTC GGAGCTCAAC GGTGTCATTT TGGACAAAGT CAGATTTTCT GGATTTTCCA ACGCGGTGTG TTCGTCTTCA ACCGCAATCG AGCTGGATTC CCGACTCGAC GGGTACAAAT CGTTCGAGAT GTTCTCACAG TACTCCGGCG CCACCGTAAG TGACGCAAAC TCGATCGACT TTTGTCGTGG AAAGTCTGCC GGTGCACGTG ATGTGTACGT GTCCGATACC ACAGGTTCTC TTCTCGACGG CGTTGCCTCT GCTCCTTCTA CTCTGATGGT CAACTCGCCG GAGCTGGCAA GCTTTGTCAA TCCCAGTGCA TGTACCGAAA ACGCAGCTCG ATGCTACACC TACTGTAGCA ACACTTGCTT CCGCACCGTT CACTACTACG TCCCAGTGGG CCAAAGTCGA GACTACAAGC TCAAAGTATG CGACCGCAAG GACGCGTCCG ACTGTACCGT ACTCTCCGGT TACGTACACT TTAACCACCC CTGGCCTCGC CGATTTGCCG TGCATGTCCC GTCAGGACGG GAGTACGACA CTTACTTCCT CGACAAGGGA GTGCCTGTGT ACCCAACGAA TGTCGAGATT GTGTTCCAGG AAAAGCTTTG CCCAACGGCA CCGGATGATG ATGATATCGC TCTTCTCTAC AAGGCTCGAG GTACCACCTT TCCCCCCACG CCTAGTCCGA CGACAGCTCT CGCCGCATGT GGAAACTTGA TCGCAAACTC GGACTTTGAG CGTGGTTTCA ACGGATATTG GGATGCACAA GGAGTCGGTA CATTGTCTAC CACAGCTGGC TATAAATCTG CCACAGCTAT GTACTACGCC TCTGGTAATC GCAACCGGTA CTGGGTGGGA CCATCACACC AATGGCGAGA AGGTCTGGAC TTGAAATGTC TAAAGCAGGG TACCACATGG GAGTTTTCTG CTCGCTTGAA GCTTGTCGAC TCAACGACTG GAAGGGGAGC CTCATGTAAC ACAGGCTCGT CCTCGGAAGG CGAAATGTGC CCTCAGGTGC AGCTCATTGT GCGCGACCAA TCTTGGACTC AGCATTCTTT CAGGGTCAGC GGCTTCGACG GTGGGGACAC TTGGGTAGCC AATGGGTTTA ACGAGTTCAA AGGCTATTGG ACAATCCCAG CGAATGGTTC AGGATGGCAA GGGGGTGTCG CAAACATGCG AGTGATACTT TCCGAATTCC CATTTGGTAT GGATCTGGTT GTTGACGATT TCAGCATGGT ACTGACGCTT GACACGAACA AGAGTCTCGC ACAGTCTCCA ACCTTGAGTC CGGTGCTGGC CGTGCCTCCT CCAACCCAAG CTCCTATCCG CATCCCAACT CAGGCTCCTG TCCGCATTCC AGCACAAGCT CCAATCCGCA TTCCAAACGA AAGTCCGCTG ATTGGTCCCG GAACGGACCT GAATGCATGC GCTAAGATGA TCGCGAATTC CAACTTTGAT CTGGGGTATG AGGGATACTG GTCAGTTCCT AGCGGCGGAA GTCTGTCAAA CAAGGAAGGC TTCAGTTCCT CGACTTCCAT GTACTACGAT TCTGGGAACC GGCGAAGATA CTGGATAGGT CCTGAGTACA AATGGCCAAA TGGTGTTGAA TATGGGTGTT TGTTGCAAGG AACAACATGG AAGTTTACCG CTAAATTTCA ACTTATCGAC GCTGCAACTG GTAAAGGAAG CTCTTGCAGC GTTAATTCAA GTAAAGAAGG AGAAATGTGT CCCCGCGTTC GACTGACTCT CCGAGACGAT GGGTGGACTC TTCGAAACGT GAGGATCGAT GGATTCAGCG CGGCGGATAC CTGGGACGCC AACGGCATGA ATTCGTTCAC GGGCTACTGG ACTGTCCCAG AAGACGGCCC GGACTGGCAG GGTCGTGTCC GTAATATTCG CTTGGCGATT GTTGATTTTC CCTTTGACAA GAATCTAGTC GTTGATGATT TTCAGATGGC GTTTTTCAAT ACGAACTAG
|
Protein sequence | MYSHLVLGLA ILFLGASQCN SNLIRGNSSM SASEAGDSEA LTTTLSRRLV ENFVPLRCNA AIGKAPCSSF VTMFGNDSVR SNRVIIPCGK CVTMDYPGPQ LTLNDGIDIR GKLVFPDGYR LTVRTNLVSV QGELEMRSSK PVDGNPDVRF VMQGTESASF APISSNAAKC GGNCDAGARS IAVAGGKVNL NGLPTSTPTW LHIQDVLGSS AIVVSNSVRN KWATGATIVI TSEHQGYFGE QVRKITSISN VGSNSVRLNL DQPINRPVTL RDSPDFATEV ALLSRNIVFE GAPGTKGGHF WIMHTPRVRQ RIEGVELVNF GQEGLLGRYP IHFHMCGDVS GSVVVKNTIR NSNQRCIVVH GTNNLLVQEN VAYFTKGHCY MLEDGIETGN QFVRNIGIRT VKPEVIIPNM GSNGKESDRS ASTFWITNAD NSWIGNVVAG SEALGFWFEL LVRGNLANEH QDFDPMMVPT RKFEDNVVHS VFGVGMTYYL SGYIPETLQY FKNNKFFRNH HLALRIHRSR NIVLTGNKFS DNRYAIQIDR DEEIHVTDTT IVGYSDLFKD VVRRNRFAQA PCAQGISFQS TNPWQKKMDS ELNGVILDKV RFSGFSNAVC SSSTAIELDS RLDGYKSFEM FSQYSGATVS DANSIDFCRG KSAGARDVYV SDTTGSLLDG VASAPSTLMV NSPELASFVN PSACTENAAR CYTYCSNTCF RTVHYYVPVG QSRDYKLKVC DRKDASDCTV LSGYVHFNHP WPRRFAVHVP SGREYDTYFL DKGVPVYPTN VEIVFQEKLC PTAPDDDDIA LLYKARGTTF PPTPSPTTAL AACGNLIANS DFERGFNGYW DAQGVGTLST TAGYKSATAM YYASGNRNRY WVGPSHQWRE GLDLKCLKQG TTWEFSARLK LVDSTTGRGA SCNTGSSSEG EMCPQVQLIV RDQSWTQHSF RVSGFDGGDT WVANGFNEFK GYWTIPANGS GWQGGVANMR VILSEFPFGM DLVVDDFSMV LTLDTNKSLA QSPTLSPVLA VPPPTQAPIR IPTQAPVRIP AQAPIRIPNE SPLIGPGTDL NACAKMIANS NFDLGYEGYW SVPSGGSLSN KEGFSSSTSM YYDSGNRRRY WIGPEYKWPN GVEYGCLLQG TTWKFTAKFQ LIDAATGKGS SCSVNSSKEG EMCPRVRLTL RDDGWTLRNV RIDGFSAADT WDANGMNSFT GYWTVPEDGP DWQGRVRNIR LAIVDFPFDK NLVVDDFQMA FFNTN
|
| |