Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50740 |
Symbol | |
ID | 7197023 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 1322340 |
End bp | 1325201 |
Gene Length | 2862 bp |
Protein Length | 800 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178119 |
Protein GI | 219112735 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTGAAAGAC TGTGCCTGTG CGTTCATCAA GGAACCACTG CCCAGATCGA AGTCCCCCTG GCGCAAACGA AAGCCCCTTC ATCAATCAAC CAGCTGGTTG ACGTACACGA AAGAATTGGA AAGGAAACTC ACGAGACCGG GTAGTTTTCG TGGTCTGACG GCCGAACAAT GGTTTCATCC TCCAAATCAA CATGGATGAT GGCGGGGGCT TGCCTTATTT TGATGGCATT TCAGGTGCGT TAAAGCTTAA CCTGTGTATT CATTAAAACG TGTGGTAGTC GATGTCTGGT GACATAGGGA AAATTGGTGC GGCGGCTACT GTGGACTCCC GAAAGATTGA CTCGTTTATT CCAGGATATC TTTCTCACAA AAGTGGCTTC TTATCGCATC CGTGTTTTCA TTCTGTAATA TTGTGGGACA CAGGTCCAAT CATTTACGTT CGTTCCGGCA ACACGAGCAA CATCGACAGT CAAACGGGTT GCCCCGGCCT TTATGAGTGC CGTCGCTGAC GACAAATCGG AAACATCCGA GACGGAAGCG GCGGCACGCG CTCGTCTCGT CATGGAAGCC GAGGCCGCCA TGAATGGTGG CAGTAGTAAG ATTGGCATCA AAGACACCAA ACTTATGGAT CTGGGTGGTA GGCCGTTTCC GTTGTCCATG ATTGTTGGTC AGGATTCCAT CAAACAGGCC TTGTTGCTAT CGGCAATTAA TAATCGTATG GGTGGGGTAG TAATTTCGGG AGGCAAGGGG ACTTCGAAGT CTGTCATGGC CAGAGCTCTG CATCAACTGC TACCTCCGAT TGAAATCCTT AAGGATTCTG CCTTCAATAT TGACCCCGAA GGTGAATTTG GACTCGATGA TTTCACTCGT ACTGAAATCG ACAATGGCGG CACTCCGTTG GCGGACCGCG AGACGGAAAT CATTCCGTGT CCCTTCGTGC AAGTTCCGCT TAACGTGATG GAAGATCGTT TGATAGGAAG TGCCGATTTG GAAGAAAGTG TCAAATCGGG CAAGACGGTT TTTGCTCCTG GCTTGTTGGC AAAGGCACAT CGAGGAGTTC TCTACGTCGA CGATATCAAT CTCCTCGACG AAGAAACAGC CAATATCCTG TTGAACGTTG TCTCCGACGG ATATGTCCTT GTTGAGCGTG AGGGTATATC ACTGCGATAC CCTTGCCGTC CGTTATTGAT TGCTACCTTC AACCCGGACG AGGGAGAACT CCGTGACCAC TTGCTTGATC GCATCGCAAT TGCGCTTTCG ACCAACGCTG ATCGCTTGGA TATTGGACAG CGCGTAGACG CCGTCGAAAG TGTACTAGAT TTCGCTAGTT CCGGAAAACA AAAGACCGAC AAGGCAGAAG TTGCTCTGCA GGAGGCTATC GACAACGAGG ACGATCTCAA AACAGCCATC GTTTTTGCTC ACGAATACAT CAAGGACCTG AAGGTTGCGC CTTCGCAGAT GCAGTACTTG TGTGAAGAGG CCATCCGCGC CGGGTGCCAG GGACATCGTG CTGAAATTTT TGCCTGTGAA GTGGCTCGTG CGAGTGCCGC ACTGGAAGGT CGCCAGGTAA CTAGTGAGGA TCTGCGTCTG GCTGTCAAGC TTGCGATTGC TCCCCGTGGA ACCTTTATAA ATACACCGAT GGATCCGGAC GAGATGATGC CCCCGCCACC GCCACCTCCG CCGCCGCCAC CGCAAATGGA CGACCAAAGT CAGGATCAGG ATGAAGACCA GGAAGATGAC GATGAACAAC CGGATGAAAA AGAAGACGAG GAGGAAGACG AAGATCGCGA AGATGAAGCC CCTGATGTAC CGGAAGTACC ACAAGAGTTC ATGTTCGACA TTGATGCGAC CCCAATGGAC CCAGATCTCA TTGACTTTAC TAGTCGGGAA CGTAGCGGCA AGGGAGGAGG CCGTGGTCTC ATTTTTTCGC AGGACCGTGG CCGTTACATC AAGCCCATGT TACCGAAGGG CAAGGTAATT CGTCTGGCTG GTAAGTTCAG TGATCAAAAT GTTTCCTTGG AATAAGAAGC TGTACTATGT ATCTTATACT TGCATGACAT TGCTTGCAGT TGACGCAACT CTCCGTGCGT CTGCTCCTTA CCAAAAGTCA CGTCGTGAGC GCGCGGTTGG TACCTCAAAG GAGGGGCGTG GGGTCCACAT TCAACAGTCG GACGTGCGCA TCAAGAAGAT GGCACGTAAG GCAGGTTCAT TGATTATCTT TGTCGTGGAC GCTTCCGGAT CGATGGCTCT CAACCGCATG AATGCCGCTA AGGGCGCGGC CGTCAGTCTC CTGACGGAGG CATACCAAAG CCGTGACAAA ATCTCACTCA TTCCCTTCCA GGGAGAAATG GCCGATGTTC TGCTACCTCC CACGAAATCC ATCACCATGG CTCGGCAACG GCTCGAACAA ATGCCTTGTG GAGGAGGCTC TCCACTAGCT CACGCACTGC AATTGGCAAC GCTCACCGGT ATTAACGCCC AGAAGAGTGG GGACGTCGGT AAGGTTGTGG TCGTATTGAT TTCCGACGGT CGAGCGAACG TTCCCCTTTG TGTGTCCATG GGCGAAGAAT TCGACCCAGA ATCAGACGAG GATTCCAAGG ATGGCAAGCC CAGTCGAAGT TATTTGAAGG ACGAAGTATT GGCGTGTGCC AAGCGACTGG GATCCCAAGG AGGCTTCAAC TTGCTTTGTA TTGATACGGA AAACAAGTTC ATTTCGACCG GTTTGGCCAA GGAAATTGCC GATGCCGCTT TGGGCAAGTA TCACCAGATT ACCAAAGCCG ATGGGAAAGC GATCGCCAGC GTGACAAGCC AAGCGCTGAA CCAGATTAAA TCCAAGTAAA GATTTGACCA AATTGTCGTC GT
|
Protein sequence | MVSSSKSTWM MAGACLILMA FQVQSFTFVP ATRATSTVKR VAPAFMSAVA DDKSETSETE AAARARLVME AEAAMNGGSS KIGIKDTKLM DLGGRPFPLS MIVGQDSIKQ ALLLSAINNR MGGVVISGGK GTSKSVMARA LHQLLPPIEI LKDSAFNIDP EGEFGLDDFT RTEIDNGGTP LADRETEIIP CPFVQVPLNV MEDRLIGSAD LEESVKSGKT VFAPGLLAKA HRGVLYVDDI NLLDEETANI LLNVVSDGYV LVEREGISLR YPCRPLLIAT FNPDEGELRD HLLDRIAIAL STNADRLDIG QRVDAVESVL DFASSGKQKT DKAEVALQEA IDNEDDLKTA IVFAHEYIKD LKVAPSQMQY LCEEAIRAGC QGHRAEIFAC EVARASAALE GRQVTSEDLR LAVKLAIAPR GTFINTPMDP DEMMPPPPPP PPPPPQMDDQ SQDQDEDQED DDEQPDEKED EEEDEDREDE APDVPEVPQE FMFDIDATPM DPDLIDFTSR ERSGKGGGRG LIFSQDRGRY IKPMLPKGKV IRLAVDATLR ASAPYQKSRR ERAVGTSKEG RGVHIQQSDV RIKKMARKAG SLIIFVVDAS GSMALNRMNA AKGAAVSLLT EAYQSRDKIS LIPFQGEMAD VLLPPTKSIT MARQRLEQMP CGGGSPLAHA LQLATLTGIN AQKSGDVGKV VVVLISDGRA NVPLCVSMGE EFDPESDEDS KDGKPSRSYL KDEVLACAKR LGSQGGFNLL CIDTENKFIS TGLAKEIADA ALGKYHQITK ADGKAIASVT SQALNQIKSK
|
| |