Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_24775 |
Symbol | |
ID | 7196544 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1511953 |
End bp | 1514266 |
Gene Length | 2314 bp |
Protein Length | 711 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176801 |
Protein GI | 219110099 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATACCTGCCT GGCTCAACGA TCGTCTATTC GAATACCAAA GGATAGGACT CAAATGGATG TGGAAGTTAC ATCAGGAAGA AGCTGGAGGT ATCATTGGCG ACGAAATGGG GCTCGGAAAA ACGGTACAAG CGAGCTCGTT CATTGGAGTC TTGGCGGCTT CACGCAAGCT AAAATCTGTT TTGATTATAT CGCCGGCGAC CATGCTTCAG CATTGGCTCA ACGAGTTGGC TGTTTGGGCT CCTGGCCTAC GGCGCATTTT AATTCATCAA TCTGGGGAAG GTGACGGATC GTCACGAAAT ATAACCGCGA GTCTTTTGAA ATCTTTAGCA AAGTGGCTGA AAGGAGTCAG AGCTGATCGA TTGTATGAAC CTATAGACGT AGAAGACCTG GAGACGTTAC CTTCCCACTC CTTCTGTGGG ACGGGTTATG TCGTGCTTAC AACCTATGAA AACGTACGCC GCAATACCGA TATTTACACG GAACACGCAT GGTCGTACGT TGTGTTGGAC GAGGCGCAAA AGATTCGAAA TCCAGACGCA GACATAACGT TAGCTTGCAA GCGAATACGG ACTCCTCACC GGTTGGCTAT GAGTGGCACG CCAATTCAGA ATGATCTAAA AGAATTGTGG TCGCTCTTCG ATTTTGTTTT TCCAGGACGA CTCGGCACTC TCCCTGCTTT TGAACAAGAG TTCGCCGATA CCATCAAACG AGGAGGATAT TCCAACGCGT CTCCCATGCA AGTACAGCTT GCCTACCGCT GCGCCATGGT CTTAAGAGAT CTGATCAATC CCTATCTCCT TCGGCGTCAG AAAAAGGACG TTATTGAGGT AAGTCGAATG CCTGGTAAAA CCGAGCATGT GCTTTTTTGC CGGCTAAGTC AACGACAGCG GGCTCTCTAT GAGGCATTCT TGCTGTCGGA CGAAGTTACA AAAGTTGTGA AGGGCTCCAA GCAACTCTTT GCTGCAGTCA CAATGCTACG AAAGATATGC AATCATCCAG ATCTGGCTTG TGATCCAGAC GAGGCTTCTT TTGAGTCTTT TGTTCGGAAC GGTTACGTGA ATCAAGGCGA CCTCGACGAA GACTTGTCAG ACCTTGACAG TGACATTGGA GAGGAGAAAT CGCTTGTGGA GCGGTCTGGA AAACTCGAGG TCCTGTCCAA AATCCTTCCG CTGTGGAAAA AACAGGGACA TCGTGTATTG ATATTTTGTC AATGGCGCAA AATGCTAGAC ATCATTGAAC GCTTGATTAT GTTGAAAGAA TGGAAATTTG GACGACTCGA TGGCAATACC AACGTCGCTT CACGACAACG ATTGGTGGAT CAGTTCAATT CAGATGAGTC CTATTTTGGA ATGCTATGCA CAACCCGAAC TGGAGGTGTT GGGCTCAACT TGACTGGTGC CAATCGAATC ATTCTATATG ATCCAGATTG GAATCCGCAG ACTGATGCTC AAGCCAGGGA ACGCGCTTGG AGATTTGGAC AAGAGCGTGA AGTGACAGTT TATCGTTTGA TCACTGCTGG GACTATCGAG GAAAAAATCT ACCAACGCCA AATATTTAAA ACTGCGCTTT CTAACAAAGT GTTGCAGGAT CCGAGGCAGC GTCGCCTGTT TTCTCAGAAA GATTTACGGG ATCTTTTCAC TTTGAAAGCT GATGCTGGGA GTGTTCGATC TGGAGGGGAA GGCCTCACAG AAACTGGAGC AATAACTCGC CATGGTGGAG TCGTTAATAT CGACGAAGAT CCGACTGATG AACCATCGCT TGACAACGAC GAAGCTTTAA AAACAGTTAT GAGGAGTAGA GGGCTCGCGG GAGTATTCGA TCACAATTTT GTCGAAATCG ACTCGACGAA AAAGTCAAGA ACGCTACGGG AAATGGAAGA GGAAGCAAAA AGGGTTGCAA AAGAAGCTAT CGACGCTTTA CAGCAAAGTG TCGCAACTAA GCAGCGGTTC GTCCCAACTT GGACGGGCTC CGAAGAAACG CAACAGAGAC GCTTTGGAAC TCCCAAAATG CACATGGTCG ACTCGAAAGA CAGCCTTAGT TCGCGAACGT TGTTAGCATC TATCCGCCAA AGAGACAACG CTGTTCATTC CGGCGGAAAC CATTTGCCTC CATCCGACGA AAGTCAAGAG TACGCCAAGC TTTTGGCCAA AATAAAAGAT TACGTGTACC GTTATCGACC GACAACAGAT GATTTGCTGA AGGAATTTGA AAGCGTCTCA AACACTGATG TCGCGATTTT TCGCAGGCTA CTAAAGTCAG TGGCGAATGT GGATTCTGGA AGGTGGTTAT TAAAATAGGA AACAGTTCAC CGTC
|
Protein sequence | MWKLHQEEAG GIIGDEMGLG KTVQASSFIG VLAASRKLKS VLIISPATML QHWLNELAVW APGLRRILIH QSGEDLETLP SHSFCGTGYV VLTTYENVRR NTDIYTEHAW SYVVLDEAQK IRNPDADITL ACKRIRTPHR LAMSGTPIQN DLKELWSLFD FVFPGRLGTL PAFEQEFADT IKRGGYSNAS PMQVQLAYRC AMVLRDLINP YLLRRQKKDV IEVSRMPGKT EHVLFCRLSQ RQRALYEAFL LSDEVTKVVK GSKQLFAAVT MLRKICNHPD LACDPDEASF ESFVRNGYVN QGDLDEDLSD LDSDIGEEKS LVERSGKLEV LSKILPLWKK QGHRVLIFCQ WRKMLDIIER LIMLKEWKFG RLDGNTNVAS RQRLVDQFNS DESYFGMLCT TRTGGVGLNL TGANRIILYD PDWNPQTDAQ ARERAWRFGQ EREVTVYRLI TAGTIEEKIY QRQIFKTALS NKVLQDPRQR RLFSQKDLRD LFTLKADAGS VRSGGEGLTE TGAITRHGGV VNIDEDPTDE PSLDNDEALK TVMRSRGLAG VFDHNFVEID STKKSRTLRE MEEEAKRVAK EAIDALQQSV ATKQRFVPTW TGSEETQQRR FGTPKMHMVD SKDSLSSRTL LASIRQRDNA VHSGGNHLPP SDESQEYAKL LAKIKDYVYR YRPTTDDLLK EFESVSNTDV AIFRRLLKSV ANVDSGRWLL K
|
| |