Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34362 |
Symbol | |
ID | 7199779 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 374349 |
End bp | 377951 |
Gene Length | 3603 bp |
Protein Length | 837 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178990 |
Protein GI | 219116390 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000328786 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAT CGACACGCCG CTCTTCTCGC CGCCGCGAGA AAACTGTGTA CAGCGTCGGA GATCTTGTCG AGGTGAGTTG GTGGTGCGAT ACTTTTTCAT AAACCTTAGC CGCGCTCTTA TTTCACTCCA TTTGAATGCA TTCTTCAGGT GACCCGCGAC GAGTCAATTG CTACGGGTAG ACTTGCATCG AAACAGACCG ACACTGCTAA ACCCCGTTGG CTTGTAAAGT TTGACGAATC TTCATGGCCA AGCATGGAGC TATTGGAGAC TGAACTTGGG CCTATTCTTG ATAGAAGCGA CGACAACGCG TCTCAAAAAG AAAAGCAAGT AAGGCAAAAA TCATCGTACG AGTTAGGAAC CACTTTGGGT GGGCGCGGCT CCCCAAGTAA GCGATCTTCC TCGCCGATGG TATCAGGCAA GAGCGAAAAC GGAAAGGCGT CCCCGATATC TACGAACAGT TCAGATTCCA AGAAGAAAGT AGAGTTCATC GCTCTTCAGG AAGAGTCTGA CATGTCTGGG TCCAAGCAGA GACCCGGTTC TTTATCCAGG GAGGAGCGAA GCAAACGTCG TCAGGCCATG ATTGAGCAGG ACAAACTGAA TTGGTCGAAG CCAGTTATGT CGCGACCCCC GAAAAAGAAA AAGGCGCAGC GCGATGAAGA AGTTGTTCGG GTACCAATGC TTACGGGTAC ACTTTTGCTA TATCGCGGTG CTCATCGCCG GGCCGAATTT GTGCGTAAGT TTTGAATGAC ACTGAGTTAA AAGCTTACAT ACGCAGATCA TATGGTAAAT TTGATTTGTA GGCGGATTTA GATCAACGAA ACATATTTTA CATATGATGG CAATTGAATA GTAGTGTGCT AGAGACGCTA TTCCTTAGAC TTTATATTCG CGAATATAGC GCATTCGGAT AGGCGCGTTG CAATAGAAGT ATTGGAGACA AAGAAACTGC AGTAACGTTC TTAACTCCCA TGTCTTGTCA CGAGCGCTGT GTTCGGAGGT TTTCCTCGGA AGAATGTCGA TGCGAAGGCT GTAGAGGAGG AGTGTGTGAG ACGTTCGTTC ACAGTCATGT CAAGGAAGCA TGTGAAACGA AAACACCACA ACATTTTTCA GTTTTTTGCA TTCATCATTT TTCTAAAGGA CGTTTTAAAG CGTTAATTTG AAAAACTGTC GAGACATATA CATAGAAGAA ATCCCAAGGA AGAGCTAGGT AAAGTAAAGA CTTTTCACTT TTACATTCTT TCGATGTCGT ATCTTGTAGT TACAGATAGT GATATCAACC CACCTGCGAG GCGAGTCGCT TTCACCAGTT TTGGCGAAAA TAGCGGCGGA GTGGCAGTGA TTGTGCCGTC TGTTTTAAAT TTCGCCCACA TGAATCTACA GAAGTAAGCT TGAGACTTGG CATGCATGCC CGTTTCCGGA CCATTCAACA AGTGCTTTAT TTCGATTTTT TGACTTATGG TACTAAGCAA CAGCAAGGAC GAACACGGAG ACTGGGTGGC ATTCCGTGAC ACGCCAGCTT CTTTCGAAGC CAGGAGTTTA AAGGAAAAGA GAAGTGTACT CAATGTTGCT TCCAAAAATA TCTCGCGCAG AAACTCCGGT ACAGGTTCTT GGATGGGAAA ACCCATAAAT GCTGATGCAT CGTGGGTCCC GGTGGCCTCC CCAGCACCAG GAACGTATTC GCCTCGTCCG TCCGGTCCAC AAGGTTCTCG CATATTGCTT AGAGAGTTCG TGGGGAATCG CAAGAAAGAG CGGTCTGATA TTTTGACCCC ATCTCAACCA TTTATGTGTC GAAGCAGCGC AACACAGTCA TCCAGCTTAC AAGCTTGCAA GAGTGACATC TTGTCAGGAA AGAAACTCAA GAAAGGTAGA TCGGGCTCGT CCCCAGTGCC TCGTATTGTA ATCCCTCTGA AGCCGGACCC TTCGATACTT CGTACCCGTA CACGTAGTAC CACAGTGTCA AGCGGGGAAG ACGAAGATCA GAATCGATGT TCCCAAGCTC AAAATGGTAT TAGGGCTGCT AGTGCACATG AAAGGCGAGG ACGGTCCAAA AGCCGCAGTC GGCATTCGCG ATCACCGTCG CCTAGAACTA GGTCGAAGTC AGTTTCTGGG AATAGTCAAT CAAACAACCG ACGTGGGCGA TCCAGTAGCC GCCCACCTGT AACACGGCCG AACACTGTGC GTGAGTCTCG CGTTAAATCT CGATCAAGCA GCCTGACTAG GATCACAAAT CACAGGGGAC CGACAGTGGT TCCATCACCT TCAGCAAGGT CAGTAATTAA CTTTCCCCAT GCATCTCTGT CGACCACGTG TCATCGTAGA GAAAATGGCA GTACCGGACC AGGTATGCTC CCCTGTTCCA AACAGAGACG GGACCCTAAG ATCGGACGAG ATATCAGCTT TGGAACGGTG CATTTATCTG ACTCATCATC GGTATCAATC AGAAGTGAAA AGAGTGGACT GTTCGAGAAA GTTTTTGGCT TCCCAGGCGG ACAAGCTTTG CAGAAACCTC AAGTAAAGCA CTCCATTTCG ACACGACCCC GTATTCTTCT AGCTGCAACA GTGTACCACA ATACAGCGAC TGGTCTATGG ATCACAACAA TCAATACAAA TCAACGAGGA GTATCAAAAA ATCCTGCACA AGCGAATAAA TTTCTAAAAG CATTCTCGTT TCCTACAGAA AAGGAAGCTC GAGAGTCAGC TATCGCAAAC GCCCCACCGA AAATGGTCTC CTTTCAAGAG TCAGCTAAAT GCTTCCATTG CAGGAAACTT TTCGCAGTTT TCAAGCGCGC CTGTCATTGT CGAAACTGCG GTGTGTGTAT TTGTGCCAGC TGCTCAATAT CTTGGCCTGC TAAGATGCTT CCAGAAACAT ACAACCTGAA AAATGAAGCT TCCTTGAAAG TTTGTACAAG TTGTGATACT CTTAGTTCTC TCTTCAAAAA AGCGCTCTTG GAAGCGAAAT ATGAAGAAGC GATAGCAATA TATGAGACTG GTAACGTCAA CCTGCGTACT CCTTTTCCTC CTGCTAACAA AAAGGATGAA GTTCTTTATC CCATCCATGC TGCTATTGAG GGCGGCAACC TTAAGCTTGC GCGTTGGCTT GTCGAAGACC GCTTCTGCCC TCTAAAGCAA ATCAGAGCCG GGCGATCGAA GTCAGATAAA AACGCACTTA TTCAGACGTC GAAAGGGCGA ACTGTCTTGA GTATTGCTAT GGAGTTCCTT CGCATCGGTA TTCTACGATT TTTGGTTGTT GAAAGAGGAA TTTCCGTATT CGAAGCTACG GATACAAGAA GCGCCCTTCG GACTATTGAG GCGGCCTTGG TTGCTCTGCC CTGCTCTTCA GAAGGAGATG GAATTCGAGA AGACGGGGCT TCCATAGCGC GGTGGGACCA AGCCTACTTC GACGATATGT CGGAACCGAG TAGCCTCGGA GACGATGATA ATGTCACAAT TGTAAGCCGA TCGGTTCGAA CAAGAACGAA CACGGGCGAC TGTTGCATAA TTTGCATGGA TCACAAAATT AATTGTGTTG CGACTCCCTG CGGACATCAG GTATGCTGTT TGGGTTGCAG TGCGAGCCTT TCGGCATGCC CAGTTTGCAA TAA
|
Protein sequence | MTESTRRSSR RREKTVYSVG DLVEVTRDES IATGRLASKQ TDTAKPRWLV KFDESSWPSM ELLETELGPI LDRSDDNASQ KEKQVRQKSS YELGTTLGGR GSPSKRSSSP MVSGKSENGK ASPISTNSSD SKKKVEFIAL QEESDMSGSK QRPGSLSREE RSKRRQAMIE QDKLNWSKPV MSRPPKKKKA QRDEEVVRVP MLTGTLLLYR GAHRRAEFVL TDSDINPPAR RVAFTSFGEN SGGVAVIVPS VLNFAHMNLQ NKDEHGDWVA FRDTPASFEA RSLKEKRSVL NVASKNISRR NSGTGSWMGK PINADASWVP VASPAPGTYS PRPSGPQGSR ILLREFVGNR KKERSDILTP SQPFMCRSSA TQSSSLQACK SDILSGKKLK KGRSGSSPVP RIVIPLKPDP SILRTRTRST TGTDSGSITF SKRRDPKIGR DISFGTVHLS DSSSVSIRSE KSGLFEKVFG FPGGQALQKP QVKHSISTRP RILLAATVYH NTATGLWITT INTNQRGVSK NPAQANKFLK AFSFPTEKEA RESAIANAPP KMVSFQESAK CFHCRKLFAV FKRACHCRNC GVCICASCSI SWPAKMLPET YNLKNEASLK VCTSCDTLSS LFKKALLEAK YEEAIAIYET GNVNLRTPFP PANKKDEVLY PIHAAIEGGN LKLARWLVED RFCPLKQIRA GRSKSDKNAL IQTSKGRTVL SIAMEFLRIG ILRFLVVERG ISVFEATDTR SALRTIEAAL VALPCSSEGD GIREDGASIA RWDQAYFDDM SEPSSLGDDD NVTIVSRSVR TRTNTGDCCI ICMDHKINCV ATPCGHQCEP FGMPSLQ
|
| |