Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41238 |
Symbol | |
ID | 7199061 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 274194 |
End bp | 276580 |
Gene Length | 2387 bp |
Protein Length | 738 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185164 |
Protein GI | 219130002 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAG AGAAGAACAA TGCTTGCGAC TTGATGTTGG ATCGCTTAGC GCAGAATGTG TCGCGGAATC CTGCAAAACG AGCCGTTGCA TTCTTGGCCG GGGGGCGCAA CGGAGGGAGT CTCCAAAAAG AGTTGACTTA TCAAGAACTG GAGACCGAAA CAACGAAAGT AGCGAAACAT CTTATTGGGA AGGGTATCGA AAAAGGAGAA TGGTAAGTAG CAAAGTGGCT GGCGGATAAA GAGTGAATGA AGCAACGCCG AAGCATCCTG TCTCATGCCT TTTCCTTTCC GTTCTTCTAT TTAGCGTGGT GTTAGTCTAT CCTCCGTCAC TGGACTTTAT GATAGCATTT CTGGCGTGTC TCAAGGCCAA TGTCGTGGCG GTACCTGTGT TCCCGCCAAA TCCCCTCCGT CGCGATACCT TGGCGATGTT TGCCAACATT GTACAAGGAT GTGGTGCCAA ACACGCCTTG ACGAATACCG AATACAACCA CGCCAAAAAA ATGGCCGGCA TCCGTGACGT TTTTACAAAG TTTCAACGTC CGACCTCGGG GTGGCCAGAT GACTTGGATT GGACTACAAC GGACACTCTC AAAGAGCCGA GAAATTCTGT CAACTTGCCG CAACCCCCTA GCGACCGATC GCAAGTCGCC TTTTTACAGT ACACAAGTGG ATCGACCAGC GAACCCAAAG GTGTAATGAT TACGCACGGC AATCTTGCTC ACAATCTGAC CATCATTACT AACGATTCTC AAGCCAAAGA TGATACAGTT GTCGTATCTT GGCTTCCCCA GTATCACGAT ATGGGCTTGA TTGGTTCGTA CCTTGGCGTC TTGTTTTGTG GTGGGACGGG GTACTATCTG TCACCCCTCT CCTTCCTACA ACGACCCATG GTCTGGATAG AGGCAGTTTC CCGGTATCGG GCCACCCACT TACAAGCTCC CAACTTCGCG TTTAAGTTGA CTGCGCGCAA ATTCAGTATC GATGCCTCGA ACACTGAACT TGACTTGTCC AGTGTCCGGC ATGTCATCAA CGCCGCCGAA CCAGTTGATG AAGAGTCTAT CGATAACTTC TACAGGATTT TCGGCAAGTA CGGATTTGCA AACGTTATTT ATCCCACCTA CGGTTTAGCA GAACATACTG TCTTTGTATG CTCGGGTGGC AAACAACGCC TCACTGTGGA CAAAGCCAAA CTCGAAATTG ATGCCAAAGT TGTTATTTTA GAGGACGACG ATCACCAAAG CACTGATATC AAGGCTGTTT CCAAGCTTAT TGGCTGCGGC TTCCCTTCTC GTCAAAACGT TGACGTTCAA ATAGTGGACC CAGAAAGTTG CAAAGCTTTG GCTGGAAACT TGGTCGGTGA GATCTGGATT CGTTCGCCTA GCAAAGCAGC CGGCTATTTC AACAAGCCGA AAGAGACAAA AGAAGATTTT CACGCGGGTC TTGTCAGTGA CGACGGTAGC AGCATTGGCA ACGCGGTAGG TGGTTACCTG CGCACTGGAG ACCTTGGCTT TCTACACAAA CATGAGCTTT TTATTTGTGG TAGGCTGAAA GATCTCATTA TCGTCGGTGG CCGGAATTAC TACCCACAGG ATATAGAGGC GACAGCTGAG GCTTCGTCGG ATCTAGTGCG ACTAGGGTGC TCTGCTGCTT TTACAATCGA TCCAACCCAT GAAGGTGGCG AGGAGGTTGC GCTTGTTATG GAACTCAAAG AAGCGCCATC TTTGAAAGCT ACTCAGACAG TTTGTGAATC ACTGGCGAAC CAGATCAAGT CCGCTATCAA TCAAGAACAC TCCTTAGGAC TGACAGATAT TGTGTTTTTG CACCCGCGCA CGGTTCCGAA GACGAGCAGT GGGAAGATTG CACGGTCCTG GTGCCGAAAG GGATTCATCG CAGGATCATT AAAGATAATC TTTCGCAAAT CATTCAAGAG TCAATCATTT TCACTGGAGA TGGAGGAGAC AACATTTGAC ACTCCTACGC CTCGTCCGGT GTCTTCGGAT CAATCAAGTA AAATTCGAAG CATGGACAAG AAAGAGATTC TCGCCAAGCT TTCGACCGAT ATCTCTCGAG TCGCATCTAT TTCTCCCGAT GCGTTGGACA AAAGCGCAGC TCTCATATCC ATGCTCGACA GTCTTTCGCT CTCTCAATTC AAAGGTATGT TGGAGAACAG TTATTCGGTC GACATCTCGG ACGAGTATCT TTTTCGCGAA TCCACGACCT TACTGAAACT AGTGGAAGTG GTAAAATTAG GTTACGCGCC TGATGACGAA GCAAATACTA CCCCTGCAAC CTCTGCGTCA AACGGAGCCA TTTCAACGCC TGGTCAAGCT AAAGGCATTG CTGGGGTTTT GGGCTGTCCA CCCGGAGTGG TGTGTACAAT ACTCTAG
|
Protein sequence | MSEEKNNACD LMLDRLAQNV SRNPAKRAVA FLAGGRNGGS LQKELTYQEL ETETTKVAKH LIGKGIEKGE CVVLVYPPSL DFMIAFLACL KANVVAVPVF PPNPLRRDTL AMFANIVQGC GAKHALTNTE YNHAKKMAGI RDVFTKFQRP TSGWPDDLDW TTTDTLKEPR NSVNLPQPPS DRSQVAFLQY TSGSTSEPKG VMITHGNLAH NLTIITNDSQ AKDDTVVVSW LPQYHDMGLI GSYLGVLFCG GTGYYLSPLS FLQRPMVWIE AVSRYRATHL QAPNFAFKLT ARKFSIDASN TELDLSSVRH VINAAEPVDE ESIDNFYRIF GKYGFANVIY PTYGLAEHTV FVCSGGKQRL TVDKAKLEID AKVVILEDDD HQSTDIKAVS KLIGCGFPSR QNVDVQIVDP ESCKALAGNL VGEIWIRSPS KAAGYFNKPK ETKEDFHAGL VSDDGSSIGN AVGGYLRTGD LGFLHKHELF ICGRLKDLII VGGRNYYPQD IEATAEASSD LVRLGCSAAF TIDPTHEGGE EVALVMELKE APSLKATQTV CESLANQIKS AINQEHSLGL TDIVFLHPRT VPKTSSGKIA RSWCRKGFIA GSLKIIFRKS FKSQSFSLEM EETTFDTPTP RPVSSDQSSK IRSMDKKEIL AKLSTDISRV ASISPDALDK SAALISMLDS LSLSQFKVEV VKLGYAPDDE ANTTPATSAS NGAISTPGQA KGIAGVLGCP PGVVCTIL
|
| |