Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41857 |
Symbol | |
ID | 7197913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1231010 |
End bp | 1232650 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178390 |
Protein GI | 219115189 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.968452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCCCG ACAAGTCCAA AAACATTCTC AAAAACATCA ATCTGTCCTT TTATCCCGGC GCCAAAATTG GGGTGGTGGG CTTGAACGGA TCCGGAAAAT CAACCTTACT CAAAATCATG GCGGGAGTGG ATACCGAGTT TGACGGTACC GCCCGGCCCT TGCCCGGAGC TTCGATTGGC TACCTTCCAC AGGAGCCGGC CCTTCCTTTT GCTACCGTCC AAGAATGCGT CGACGAAGCT GTACAATCCT CGCAAGCCAT TCTTGATGAG TATAACCAAC TCAGCATGAA GCTAGCGGAT CCTGATCTCA CCGATGATGA AATGAACAAG ATCATGACCA AGACGGAACA ACTGACCAAT CAAATTGAAG CTGGAAATCT GTGGGAACTC GAGCGAATTG TGGAGCGCGC CATGGATTCC TTGCGGGTGC CACCCGGGGA CGCCAAGACG GCCGTCCTGT CGGGTGGCGA AAAGCGCCGC GTAGCGCTTT GTCGCTTGCT CCTGGCCAAT CACGATATGC TTCTACTAGA CGAACCCACG AACCATTTGG ACGCCGAATC GATCGGATGG TTGGAGCAGT TCTTGGCACA GTTCAAGGGA ACGGTGGTCT GCATCACCCA CGACCGATAC TTTCTCGAAA ATGTGGCCGA GTGGATTTTA GAGCTAGACC GAGGAGAAGG CATCCCGCAC GAAGGCAACT ACTCAAGCTG GCTGGAGGCC AAGAGTAAAC GTCTCGAGGA GGAAAAGAAA AAAGACACCG CGGCAGCCAA GGCTGTTGCA GCCGAACTGG AATGGATTCG GAGCAACCCC AAGGCCAAGG GCAACAAAAG TAAGGCACGC CTCAACCGCT ACGATGAGCT ACTGTCCGCT GCTGCTCCTA CGGAACTCCG GAACGCGGGA CAAATCTACA TCCCCCCGGG TCCTCGGTTG GGCGATGTCG TGGTGGATAT CACCAACATG CGCAAGTCGT TCGATGAGCG CTTGCTAATT AAGGATTTGA GCTTTTCCAT GCCCAAAGCT GGTATTGTGG GCGTCATTGG CCCGAACGGT GCCGGCAAGT CGACACTCAT CAAAATGCTA CTCGGCAAAG AGCAACCCGA CTCTGGTGAG GTCAAAATCG GTGAGACCGT GAACATCGTG TCTGTTGGGC AGGAACGCAT GGATGAGTTG AACTCGGAAA AGACTGTGTT TGAGGAAATC TCCGGAGGGC TCGATGAGCT CGAGCTGGGC ACCCAAACTG TGCAATCTCG TGCCTATCTT TCCTGGTTTG GGTTTAAGGG AGGAATGCAG CAGGCCAAAG TGGGAAATCT ATCAGGTGGC GAGCGCAATC GTGTCCAGCT CGCCAAGATT CTCAAGGCCG GTGGCAATAT GATTATTCTA GATGAACCAT CGAACGACTT GGACGTCGAA GTCTTGCGCA GTCTGGAAGA AGCGCTGTTG AATTTTGCGG GCTGTGCCAT GGTGGTGTCA CACGATAGGT ACATGTTGGA TCGCGTGGCG ACCCACATTC TGGCCTGCGA GGGTGATTCG GAATGGTTCT TCTTCCCAGG CAACTATGCC GAATATGAGG CCAACCGTCT GGAACGCAAG GGCCAAAGCA GCATTAAGCG CGTCGCCTAC GCGCCTTTGC TGAACGCGTA G
|
Protein sequence | MLPDKSKNIL KNINLSFYPG AKIGVVGLNG SGKSTLLKIM AGVDTEFDGT ARPLPGASIG YLPQEPALPF ATVQECVDEA VQSSQAILDE YNQLSMKLAD PDLTDDEMNK IMTKTEQLTN QIEAGNLWEL ERIVERAMDS LRVPPGDAKT AVLSGGEKRR VALCRLLLAN HDMLLLDEPT NHLDAESIGW LEQFLAQFKG TVVCITHDRY FLENVAEWIL ELDRGEGIPH EGNYSSWLEA KSKRLEEEKK KDTAAAKAVA AELEWIRSNP KAKGNKSKAR LNRYDELLSA AAPTELRNAG QIYIPPGPRL GDVVVDITNM RKSFDERLLI KDLSFSMPKA GIVGVIGPNG AGKSTLIKML LGKEQPDSGE VKIGETVNIV SVGQERMDEL NSEKTVFEEI SGGLDELELG TQTVQSRAYL SWFGFKGGMQ QAKVGNLSGG ERNRVQLAKI LKAGGNMIIL DEPSNDLDVE VLRSLEEALL NFAGCAMVVS HDRYMLDRVA THILACEGDS EWFFFPGNYA EYEANRLERK GQSSIKRVAY APLLNA
|
| |