Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16455 |
Symbol | |
ID | 7198680 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 340121 |
End bp | 343013 |
Gene Length | 2893 bp |
Protein Length | 683 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184866 |
Protein GI | 219129376 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTTTCGGGA AGGATTTGAA ACGGACTCTG GTACAGGCCG GCAAGCTCGC CGACCAAATG GGTTCCACCA CGGTGGGATC GCAACACGTT TTCTTGGCAC TCCTCGAATA TTCCGAAGGT GGTGGCGGGA AAGACAAGGC CCCAGCTTCA GCCGCGACGT TGGATCCGGA CAGCGACGAG TGTGGAGGTT GGGCGGTACT CGTGAAAATG AACGTCCTGG ACGATACCGT CACGGCGTTG GACGTTTGCG AATCGCTCCT GCAAAATATG GCCGACCAAC CCGACCAAGC CCGCAGGGAA CTAGTCACGG GCGCGGGTGG GTCGGGCAAG ATGCCCACGC TTACGGAATG TGGAGTTGAT CTGACGCAAC AAGCTGAGGA TGGATTGTTG GATCCGGTCT ACGGAAGGGA CGACGAAACA CGAGCCTGTG TACGCACCTT GATTCGGCGG CGCAAAAACA ATGTTTGTTT GATCGGTGAA CCAGGTACGT TGGATCCGTG GTGGCCTTGG TAGGGTGGGT CGGTACCGAG CGCCGTTCGT ACGGATTCTT GAGCTCGTCG GGGTTTTGGC AATGGCAAGA ATATCTAACA CTGTGCCGGC TTCTTTTGGA TTTTCGCATC GAGTAGGTGT TGGAAAGGTA CGACATTTTC CGTTGTTACA AACGGGTGCG CATCGTTGGG AATCCGTACG GACTCTCACT TTGCCTTTGC GTTTTCTTTT CTGCTGGTGT TGGGAACGGT GCGTAGACGG CCATCGCCGA GGGCGTCGCG CAGATTCTTG TGGACGAGAA GCTCTGTCCG GCCCGCCTCA AAGGCCATCG CCTCTTCAGT TTGGAACTAT CCAATCTCGT GGCGGGCACC AAATACCGAG GAGAGTTCGA AGAACGGTTG CAGTCCATTA TTAAGGAAGT CACCGACCCC AAGGCTCCGC CAACGATTCT TTTCATTGAC GAAATTCACA ATCTCGTGGG GGCAGGGGCG GCGGAAGGCG GCATGGATGC GGCTAATTTG CTCAAGCCCG CGCTCGCTCG GGGGGAACTG CAACTCATCG GGGCGACCAC AATTATGGAG TACCGCAAAT ACATCGAAAA GGACGCAGCC TTGGAGCGCC GGTTGCAGCC GGTTATGGTC AAGGAACCGT CCGTTGTGCA GACCATTGAT ATTCTACAAG CCGTCCAGTC GAACTACGAA AAGCACCACG GGGTCACGTA TACATCGGCT GCGCTCAACG CCGCCGCCAC CTTGTCCGAT CGATACATGA CCGATCGATT TTTGCCCGAC AAAGCGTTGG ATCTGTTGGA CGAAGCGGGG GCTATTGCGC ACTTGGAAGA GCCCGATGAG GATGTCACAC CGGAAGTAAC GGAACATACC GTCGCGATGG TAATCAGTGA ATGGTCCGGT ATCCCCATGG GAAAACTGGA AACGCAAGAA ATGGATCGTT TACAGGCACT CGAAGGTGAA ATGGGACGTC GCGTCAAGGG ACAGGGCCGG GCCATTCGTG GTGTAGCCCG TGCCGTCCGT CGGGCCCGGT CCGGATTGCG CGATCCACGC CGACCCGTCG CTTCGTTTTT GTTTTGTGGG CCGACCGGAA CGGGCAAGAC GGAATTGTGC AAGACACTGG CGGAGACCTA CTATGGCTCG GAAAAGGACA TGATTCGTAT TGACATGTCG GAATACATGG AAAAGCATTC CGTGAGTCGG TTAACAGGGC CACCACCAGG GTAAGCAAGT AGCGTTGAAC AATGGTGGCT GACGTTCTTC GCTTATCGTC ATGGAATTCC TCACGAATTT GTTCTGCTTT CACTCGTACT CTCAGATACA TTGGATACGA AGAAGGGGGT CAATTGACGG AGGCCGTACG CCGTGCACCA CATTCGGTGG TGTTGCTCGA CGAATTGGAA AAGGCGCACG GAGACGTTCT CAATATTTTG CTCCAAGTTA TGGAAGATGG GATGTTGACG GATGGTAAGG GCCGAACAAT CAACTTCAAA AACTCCATTT TGGTCATGAC TTCCAATGTG GGGAGCCGGC GGATTCTGGA AGTTGCTCGT TCTGGTAGTG GCGCTTCCGG TGCTATATCT ACGCGGCCCA TCCCAGTCGC CGCCGCTTCT GCCGACACCA TGTCTATTGA GCCCATGAAG CCGGAAGAAA TTTTAAAGAA AATGCAGAGC AACCCTGAAG CAGCCTCTCT ACTGCTAGAA GCCTCGTCGG ACCCAAAAAT TATGGGTGCC ATCCGGACCG CCATGAACGG TTCTCCAGCC GATCTGCTGA AATCAGGCCG AGAAGATCCG GAAGTAGCAA AGTTTTTGCA AAGGCTGTGG GGTGTACTGC AAGATGAAAA GCCTTTGGCG AATGGTGACT CGAAACGTCC CAAGTCTGGT TTGGAAACAA TTCGCAACAG CTTTGAAGAT ACCGTGTCTG AATGGACCGA GACGGTCAAG GACCGATTTG CTACAAGTGT TATTGACCAG ATGAGCGCTG CACAGCATAC AGGGGACAGT ATAGTCCAAA AAGATCATTA CCTCTATGCT GAACTCGCTC AAGTCGTGAA GGAGGAGTTG GAAAGAGAAA TGAAACCGGA GCTACTGAAT CGAATTGATG AGATTATTGT TTTCTCGCCC CTCTCGACGG GCGACCTGTG GATGATCGCC GAGTTGATCG TTGCCAAAAT TTCAGAAAGG GCATTAAAGG AACAAAAGCT GGAACTGAAG ATTGATCGAT CCGTGATAGA GCGAGTAATG GCCGAAGGAA GCGCCAACGC TGACCAGTTT GGTGCTCGTC CCATGCGCCG AGCGGCTCAG CGTTTTGTCG AAGACAGTCT GAGTGACGCG ATCATTCAAG GTTTTCTTCA AGAAGGTGAA GGTGCTACCG TCTCTCTCGC GTCGAAGGCC TCC
|
Protein sequence | PFGKDLKRTL VQAGKLADQM GSTTVGSQHV FLALLEYSEG GWAVLVKMNV LDDTVTALDV CESLLQNMAD QPDQARRELV TGAGGSGKMP TLTECGVDLT QQAEDGLLDP VYGRDDETRA CVRTLIRRRK NNVCLIGEPV GVGKTAIAEG VAQILVDEKL CPARLKGHRL FSLELSNLVA GTKYRGEFEE RLQSIIKEVT DPKAPPTILF IDEIHNLVGA GAAEGGMDAA NLLKPALARG ELQLIGATTI MEYRKYIEKD AALERRLQPV MVKEPSVVQT IDILQAVQSN YEKHHGVTYT SAALNAAATL SDRYMTDRFL PDKALDLLDE AGAIAHLEEP DEDVTPEVTE HTVAMVISEW SGIPMGKLET QEMDRLQALE GEMGRRVKGQ GRAIRGVARA VRRARSGLRD PRRPVASFLF CGPTGTGKTE LCKTLAETYY GSEKDMIRID MSEYMEKHSV SRLTGPPPGY IGYEEGGQLT EAVRRAPHSV VLLDELEKAH GDVLNILLQV MEDGMLTDGK GRTINFKNSI LVMTSNVGSR RILEVARSGR DSIVQKDHYL YAELAQVVKE ELEREMKPEL LNRIDEIIVF SPLSTGDLWM IAELIVAKIS ERALKEQKLE LKIDRSVIER VMAEGSANAD QFGARPMRRA AQRFVEDSLS DAIIQGFLQE GEGATVSLAS KAS
|
| |