Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42822 |
Symbol | |
ID | 7196427 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1227886 |
End bp | 1230842 |
Gene Length | 2957 bp |
Protein Length | 837 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177245 |
Protein GI | 219110987 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAAAA GCCCACTGCA AACAAAGGAA AAATTCCCTC CAATCGGCAA ATCCCAAAAA TTATCATCAA ATGCCAACCC GACGCAAAGT TTCAATCAAG CCGTGGCACA CGAACGCGAA AGTGGCAAGC CACGAAGTAG TGCAGAGGAA TCTGTTGATG AAGGATGCCG CGTTTGTGGA ATGGATGACA ACTATTCCAG ACTATTACTT TGTGAAGGAT GCAACGGAGA ATATCATACG TACTGCTTAA CTCCTCCACT TGAAAAAGTC CCAGTCGAAG ACTGGTATTG CGGTAGGTCG CAGAAGCAAT GCAGCGTGTT TTCATCATTG GAGCACTGTA TACGCTAACG ATATCTATTT TTCTAGATCG GTGCACAGCT CTTGTCGAAA TCCTGAACAA GAAAAGTGGG GGAGAACCGA TTGGCTCTAT CCCTCTTATT ATATCTCAGG GATCCGAAGG CAATTCCAAG GCTGATCCAT CCCTGCCATC AAATCACTCT ACTAAAAAAA AGTCTTTGGT CGAAAGTTCG CCGTATGATA CTGCAGATGA GGAATTACCA ATGAAAGAGA TTGATACGCG TAACGGATCC ACCGCTCCAG AATACCTCGA TGAAAACACG CTACGTTTGG TTAGGCTGTA CGTTGCGAAG TGTCGACCTC AAGATATGCT AAACGAAGAC GACGTGATTC TGCTCATGCA CAGAATGGAC CGGCTCACTG CATACCAGGC TGAGTTTCAG CTGAATATTC CTGGGCCAGC CAATGAAAGA CAAGAGGTTT TGCTGGAAGA AATTAAAGAC GAAGAAGAAG ACCATGTTCT TGGGCGAGAG TTTTTCCACA AACTTGTCGC AAAAATTCAG AAGGAAAATG TTGAATCACT CGAGAAACGC ACACTCGCCT GGTTAAAGCG GGCTACTGGG ATCCATAGGA GTAGTACTGA ATCGTTCGTA AGAGACAACA CATCTAAAAC AGGCTCCTCT AGAACGAAAG GGTCTGGTCG GCGATGCACT AGAGGAATGG TTCGTCAATT CAGAAAAAAA AAGAGTGAGA AAGGTCTCGA AATACTAAAA GGGGCATTGG ATGAGCTGGA GCCGACGCTT GGACCCTTTC ATTCCGCTTT TCGGAAGCGC AGGCGTTCAA TATCGACCTA CGATGACGAC GAAAGAAGTT CGGGGAAGCA GCGAAAAAAA CAGGAACAAG TACAACGAAA GAATCAAAAA GGAATGAAGG CTAAGATGAA TAAAGAGATG AAGAATGGGC TTAAAAGAAC GAAGCCAAAT AACGAAAACA AAAGGAAAGC CTCTCCAAGC TATCCCAATC TGCTTCATGC TTTGGATTGG AAGAGACGAA AGCAGCTACA TTCGAAAAGG AATAAACCTG GATCGAGCAT ATTGCTGGAC ATAAACCATG CGCAGAGGTC GTTGAAGTTG CCAAAGAAGC TCCGAAACAA GGGCATCGTT ATGACTTTGC CTCGCTGGGG CAGCATCAAA CAGGCAGAAT CATTCGCTCC CAAAGAATCG GTGTTTGGTC CCCCTCGTGG GCTTTACTCT GACCAACACT ATTCTTTCTG GTCTTTAAGA CTCATGAACT TTCTTCAAAG CAGCGCGAGG ACTTGGGTGT CGCATGAGTT TTTTTACAGT GACCTTGATA AAGCTTGGTA AGTTTCATCT ACCGTCGGTC TTGCCATTCA TCGAGTTCCA TATGCTCACG ATTTTGCGTT AGGTACAATA GCAGTGCTCT TTCGAAAATG GCAAGAAGGT TTGGTGTAGA TCCAACGATC AGTTTAGATT CAGCAGAATG GAAGTGTGTC AGACGTGCTT TACATGGGAT AAAAGCGAAG CCGCGTCGCT TTTCACGCTG CTTTATTTCT GAACAGCTAC ATGAAAGGGA CGAATTTCGA AGTGGAGTCC GGCTGCTTCA ACAAAATTTA GGTGCATCCC ACGCCGCCTA TGATTTGAAG TCCTGTATCC CCGTGGGATC AGTTGTAACT GCATACAGTC AAACGTTCGG TATGCTTCAA CGTGGTACTG TCTTGACGTT TGAAGCTCGG AACGCCCATT ATCTGGTGCG GTTCGAAAAT ATGGACTTTG GGTACGAGTA CTGCCCAGAC TCTGAAGTCG CGAGCCATGG TTCGGTATTG CCACTCCATG TGAGCGGCGG GGCGGAGTCA AAGACAACTA CCTCTAATGC AATACTTCGA AAGTATTGTG GTAAGTGCAG ATACAGCTCG ATTGACAAAC TAAATTTCTC GTTTGACAGT GCACTGATGT ACTTATCTCT CTCCCCGCAC AAAATTTTGT TGACAGCGCC GGCATGGTCA CGGTCTATGA AGTTGGCAAC GGAGCTCGAA GGTTGCACAG AATTTACCCC TTTTCGCAGC CAATCGCAAA AAAAGGGACG TCGGTCCATT TCGGAGACAG ACAGCCATTG GGCATTTATA AAAGAAGCAG CTGAAGAAGA AACGTTGCAA TGTCTACTTG ATGTCATCAA TACGGCAGCT ATACGAAAGT CTGCACTTTT GAAAGCGATC GACACAGTAA TCACTCCTGC AAATTCAGAG TTGGAGAAAG CAAAATCGGC CATTCAAACA CAAGAACGAG AAGGAAATCT TGCTTTGCTT GTTTTGAACT TGGAGAAGAC GAACAAAACG ATTCGTAATA GTGTCCGGAA AATACGGCTT CTGTACGCCC AAGTATATCC ATTACGAATG TAAGTTGATA TGTGAAAGTT GGCGTAGCAA GCTGCCTTCA CTGACGCAGA AATCTTTATT CACCTCCTCG CAGCACTCAA GTTACGTCTC ATCATGATAC AATTTCCCGG GCCATATACC ATTCTTCTGA ATGCGGATAT GGTCCAATCT CTGATGCGCT TATATATCCC TGGGTTACGA GTCTTTTGGA GAATACCGGG ACTATAGGAA AATTTATCGC GGCATCTCTA CTTCCCCAAC GGCGTGA
|
Protein sequence | MTKSPLQTKE KFPPIGKSQK LSSNANPTQS FNQAVAHERE SGKPRSSAEE SVDEGCRVCG MDDNYSRLLL CEGCNGEYHT YCLTPPLEKV PVEDWYCDRC TALVEILNKK SGGEPIGSIP LIISQGSEGN SKADPSLPSN HSTKKKSLVE SSPYDTADEE LPMKEIDTRN GSTAPEYLDE NTLRLVRLYV AKCRPQDMLN EDDVILLMHR MDRLTAYQAE FQLNIPGPAN ERQEVLLEEI KDEEEDHVLG REFFHKLVAK IQKENVESLE KRTLAWLKRA TGIHRSSTES FVRDNTSKTG SSRTKGSGRR CTRGMVRQFR KKKSEKGLEI LKGALDELEP TLGPFHSAFR KRRRSISTYD DDERSSGKQR KKQEQVQRKN QKGMKAKMNK EMKNGLKRTK PNNENKRKAS PSYPNLLHAL DWKRRKQLHS KRNKPGSSIL LDINHAQRSL KLPKKLRNKG IVMTLPRWGS IKQAESFAPK ESVFGPPRGL YSDQHYSFWS LRLMNFLQSS ARTWVSHEFF YSDLDKAWYN SSALSKMARR FGVDPTISLD SAEWKCVRRA LHGIKAKPRR FSRCFISEQL HERDEFRSGV RLLQQNLGAS HAAYDLKSCI PVGSVVTAYS QTFGMLQRGT VLTFEARNAH YLVRFENMDF GYEYCPDSEV ASHGSVLPLH VSGGAESKTT TSNAILRKYC APAWSRSMKL ATELEGCTEF TPFRSQSQKK GRRSISETDS HWAFIKEAAE EETLQCLLDV INTAAIRKSA LLKAIDTVIT PANSELEKAK SAIQTQEREG NLALLVLNLE KTNKTIRNSV RKIRLLYAQV YPLRMKIYRG ISTSPTA
|
| |