Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50519 |
Symbol | |
ID | 7199241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 294198 |
End bp | 297146 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185412 |
Protein GI | 219130521 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGACG TTGTGTTCTA TTCCCTCGTA GCTCTCGCCA AGGCGGGCAT TGAATGCTGT CAAAACGCCC AAGTCTACAA GGCTGAAGCA GTCCGCATCG GCGAACGGCT AACGATGATA GTGGCTCTAG CGCGTCAATG GAGAGGTGGT TGTAGCAACG AACAACTCGA ATGCTTTCAC AGAGTCGTGG TCAATGTACA CGAATGCATG AAAGCTGCCT TTTTGTCTGA GAGCAAGAGT GTTGCATGGA ACAGAAAGTT AAAGGTGACG TTACAATCTC AAATGCTGCT CGAGAATCTC GTCCAAGCGG AGAGCCAGTT GAACACCGCC ATCGCTGACT TTCAAGTGAT GCAGTCCAAC ACAATCTTTT CAGACTTTCT TGACTTCAAA GAAGGGGTCA AAGAAATGCT TGATCGGTTT GGTTCGTGTG TTATGAACCA ATCGGGCTCC GTTACAATCA AACAGCAATT TGATCAGGTC TTGGCGGAGA CTCAAGATCA AGCTCCCGAA ATCGGCATTT CCACTCCCGA GGATGGTAGA TACAACGCAA CAATAAAATA TGACCCTGCT GGTCAAACGG GTGGCAAGCT ACAGCAGCGG AAAAAGTCAA TGTTTCGTCC ATCCGAAGAA GACGTGCTCG CAATTACGCT CAGACCTTCC TTGTTAGTCT TTTGCGACGA TCGCAACAAC CTTCTCGGAG GTGGAGGATT CGCAGAAGTC TTTCACGGAA CCTATGACGG ACGACCAGTC GCCATCAAGC GCCTCAATGT ATCCATTCGA GATGTGATGT CACTATCGAC TAAGCAGATT ACTAGCGATG TGGAACAACT TGCTGCCGAA GCTATCCTAA CCCACAAATG CGGTGCACAT TCCAACATCA TCCAGGTACT TGGCTGCATC ACCGCACTGA ATAAAACCAC GAGACCCTTG CTCGTAATGG AACTGATGCA CATTACGCTA TTTGATGCGC TGCATGACCA CCGTGTAAAA GATAAACTGA CATTTTCCCA CTGCCTCTTT CTGTTAAAAG GTATTGCTGG AGCCTTGGAG TTTCTCCATC TTCAAGGAAT TGTCCACCAT GACATCAAAT CTCTCAACAT TTTGCTGAGC GAAGACCTCA CCGTAGCCAA ATTGGCCGAC TTTGGAGAAT CGAAAGTGAA AGGTCTTAAC ACAACAAAAC CACGGCTTGA GAGAATAATG ACAACGTCTT GTCATCAGGG CAACATAATT GCAGGGACAG CAGCCTACCA GGCACCAGAG ATTCTCTCGG AAGATGTCAA TGACATATCA CGCGTTTGCG AAGTCTATTC TTTCGGGGTC ACAGTTTGGG AGTGCGTGAC AAGAGAGATA CCACATATGG GTAAAAAAGA AGGGTCTATA GCTCTTTTGG CTGCAAACAA GAAACACTTG CCTATGCTTG CGATGCCCTT GCACCCCTCA AAGGATCTTC CAACAACAGA AATCGGTTCC TGGGAAGCGC TGAGAAAAGT CGCCGCATTG TGCCTCTCCC GCGACCGCTC GATGAGACCC ACTGCTTCTG GAGTTGTTGA GCTTTGGCAC CACGTCGACA CTCCTTCATT CCCTCCGGAG AGCTTGTGTT ATGCCAGTAC CAGGTGTGTT TCCGATCCTC CACTACCGGC TCACTCGGCA TCTCAGAAAT CAACAATTCG AGGCCACGCG TTGGATGCGC CAGATTTCGA AGAAGAATCC AAGGCAGGAG GTCTGAGGAA ACGTCGCTAC ACTGTACTGT CAGTTATCGC AATCCTTATC GGGTTGATAG TGATCATAGT TCTTCGCTTA CAGAGGTCCA GGGCATCTTC AGGTAAATCT ACGACATTCC CTGAATCGGA AGCGCCGTCT GGTATTTCAA TCTCCACTCT CCCGCCCACC CAATTACCAA GGGCATCTCT GATACCGCAA ACAGAGGCTC CGGTCTCTTT GTCCCCACCT ACAGGTCCGC CGCTGTCATC CCAATCTCTA ACAACCCAGC CATCATCAAC ATCCATGCCG CCGAGAGCAG GTCCGGTTTC CACGACCCCG CCCACTATTT CACCTCTCAC GGCCCAATCT CCAACAACAA GCACGCAACC ACGTCTGTCC TTTCAAACAA CACAAGAGCT TTACGATGCT GTTGATATTT ACACTGCCGC GACGGACTCC ACAAATTCTA CGGCGGCAGT GACGTACGGC TATCCCATCG GATCATGGGA TGTGTCCCAA ATCACCGATT TTTCGCAAGT CTTCGATAGC TTGAATCGAA ACAGCGCGGT TGGGATTTTC GACGAAGATG TGAGCGGCTG GGATGTCTCT GCCGCAACGA CCATGTCTGG CATGTTCAAG GGTGCGTCTA CTTTTAGCGG CGATCTTTCG TTGTGGAACG TAAGCCAGGT AAGGGATACG TCATCCATGT TTGAGGGAGC AACCTCGTTC GACGGTAACG TCTCGCTATG GAATGTTGGG CAGGTAACGA ACATGTCTTC CATGTTTTTT GAGGCGAGTG CCTTCAATGG CGACCTCGCT TCGTGGAACG TGGGGCAGGT AACGAACATG AGTTCAATAT TCTTCCTTGC GTCTAGCTTT ACAAGCGATC TTTCGTCGTG GAATGTTGCA CAGGTAACGG ATTGGTTTGC CGCGTTCAAA GGAGCAGCCG CCTTTACCAG TGACCTTTCC AAGTGGAATG TGGGAAAGGT CACAAACATG CGTTTGATGT TCTACCACGC GTTTGACTTT AATAGCGACC TCACGTCGTG GGATGTTAGC CAGGTGACGG ATTTGTCGTC AATGCTCGAG GGTGCGACCG CATTCACCGG CAACCTCTGC TCGTGGCTCA CACAGATTCC ACCGAGTTGC AACGTGGATC GAATGTTCTC GTTCGCCTCG TCCTGTTCAG ACCTTGCAGC CACCGTACTC CCGGACGGAC CCATGTGCCA TGCTTGCGTT CCAATGTAA
|
Protein sequence | MADVVFYSLV ALAKAGIECC QNAQVYKAEA VRIGERLTMI VALARQWRGG CSNEQLECFH RVVVNVHECM KAAFLSESKS VAWNRKLKVT LQSQMLLENL VQAESQLNTA IADFQVMQSN TIFSDFLDFK EGVKEMLDRF GSCVMNQSGS VTIKQQFDQV LAETQDQAPE IGISTPEDGR YNATIKYDPA GQTGGKLQQR KKSMFRPSEE DVLAITLRPS LLVFCDDRNN LLGGGGFAEV FHGTYDGRPV AIKRLNVSIR DVMSLSTKQI TSDVEQLAAE AILTHKCGAH SNIIQVLGCI TALNKTTRPL LVMELMHITL FDALHDHRVK DKLTFSHCLF LLKGIAGALE FLHLQGIVHH DIKSLNILLS EDLTVAKLAD FGESKVKGLN TTKPRLERIM TTSCHQGNII AGTAAYQAPE ILSEDVNDIS RVCEVYSFGV TVWECVTREI PHMGKKEGSI ALLAANKKHL PMLAMPLHPS KDLPTTEIGS WEALRKVAAL CLSRDRSMRP TASGVVELWH HVDTPSFPPE SLCYASTRCV SDPPLPAHSA SQKSTIRGHA LDAPDFEEES KAGGLRKRRY TVLSVIAILI GLIVIIVLRL QRSRASSGKS TTFPESEAPS GISISTLPPT QLPRASLIPQ TEAPVSLSPP TGPPLSSQSL TTQPSSTSMP PRAGPVSTTP PTISPLTAQS PTTSTQPRLS FQTTQELYDA VDIYTAATDS TNSTAAVTYG YPIGSWDVSQ ITDFSQVFDS LNRNSAVGIF DEDVSGWDVS AATTMSGMFK GASTFSGDLS LWNVSQVRDT SSMFEGATSF DGNVSLWNVG QVTNMSSMFF EASAFNGDLA SWNVGQVTNM SSIFFLASSF TSDLSSWNVA QVTDWFAAFK GAAAFTSDLS KWNVGKVTNM RLMFYHAFDF NSDLTSWDVS QVTDLSSMLE GATAFTGNLC SWLTQIPPSC NVDRMFSFAS SCSDLAATVL PDGPMCHACV PM
|
| |