Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45310 |
Symbol | |
ID | 7200014 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 801504 |
End bp | 803913 |
Gene Length | 2410 bp |
Protein Length | 780 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179510 |
Protein GI | 219117431 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTGGG AATCCGATCA TGTTAACGAC ACTCATACCG ATGGTGATCT AGCTAGTGAA AATGGCTCAG TCTGCAGCGC TACGGACGAG AAAGTGCTTG CTCACAAAGA AACGAAAGCA GTGAATATAT TACGCGCAGT CACATTCTTT GTTCTTTTGT TGGCGACTGC GACTGTTTCG CTGTTGGTCT TTTTTTATGG TCGCGACAAG GAAAATGACG AATTTTTGAA ACAATTCGGC TACAACGCGG GAAAAGTAGT CGATTCCTTC CAGGTGAACG CAGAGAAGCG GTTGTCCGCG CTCGAAGGAT TCGCCACTAT GATCACATCT CACGCACTTT TCGCAAACGA AACGTTTCCG ATGGTGACGT TGCCAGACTT TGAGCGAAAA GCGTCCTACA CTCTGCAGCT TGCACAAGTT ATTTCGATTC TCATTTTTCC AATCGTCAGC CGAGAAAATC GCGCCACTTG GGAAAGATAC TCAGTAGAAA ATCAACGCTG GCTTGAAGAA GGTCTTGCTC TGCAAAAAAT CGTTAAGGAT GGAGATGAAG AAGAGGCATT GATGCAGTTG GAGGAGCAAG TGGTAGCCGG AAACTTGGAT GTCGACCCCT TCGCTCACTT GAATATCCCA CCATTCATTT TCAAAGTCGA GGAAGGTGGC ACAGCAGCTG CTTACGAGAC GGGTCCAGGG CCTTATGCAC CAGTCTGGCA ATTGGCGCCG GCAATTCCTG CAGCATTTTT CGTAAATTTT AACGGCCTTT CCCATCCTAG CCGAAAATTA GAGATCAACA CCGTTCTTCG AACCGAAAAG AGGCTGGTGA GCGCAGCAGC CGATTTCTCC AATGACAATG ATCCAAATTC AGCTGGTAGA AAGGCAGTGC TAAATCTCTT TCTGAATCGA TGGAAAAGTG GTGGCAATGA TTACGATGAA GGGCCCGTCA GCGACATAAT CATTCCAGTA TTTACTAGTT TTGGAGAGAA CAAGACAGTT GGCGCTCTGC TGAATTCTTA TATCTATTGG CAGGTTTACC TCACTGACAT TTTAACGGAT GAAGCTGAAG GTATTGTCTG CGTACTGGAA AACAGTTGCT CACAGAGCTT CACTTATCGC ATTGATGGAA AAGATGCAAC ATACATCGGA CAAGGTGACT TGCATGACCC CAGTTACAAT GGAATGATGG TTGAGACCGG ATTCGGTGCC GTGGTCGGAA ACAACAACGT TGATTTCAGT ATTCACGAAC ATTGTTACTA CAATCTTCGA GTTTACCCAT CCAAGGAGAC TGAAGACAAA TACATCACGT TTCAGCCCAT CATGTTCGCT TTGATTTTGG TGGCAGTTTT TGTGTTCACA TCCTTTGTTT TCGTCACATA CGATTGTCTT GTCCAGCACC GCAACAGCGT TGTTAACACA TCAGCAATAC AATCCAGCTC TGTGGTTTCT TCTCTATTTC CTGAACAAGT CCGCAACAGG TTGCACAAAG TATACAAAAG CGAAAAGTCC AAACAGCACA ACCATACTGA CATCTTTAAA AGCATTACCA GCGACGGAAA GTCGAGAGAC GATTTCGAGG CAGCTGACTT AAATGAGTTC GACGATTCGA CTCCTATTGC CGATTTGTAC CCAAATTGCA CCGTTCTGTT TGCTGACATT GCAGGCTTTA CTGCGTGGAG CTCCGAACGA GCGCCTACCG AGGTCTTTAA GCTCCTCGAG ACACTGTATG GAGCCTTTGA TAAAATTGCG AAGAAATACA AGGTATTCAA GGTAGAGACA ATTGGTGACT GCTACGTCGC CGTGACTGGC CTACCCACCC ACGCCACAGG ATGCCCACGC CGTTGCAATG TGCCGATTTT CCAGTTCGTG CAATACTAAG ATGAACCAAA TGATGCATAT CCTCGTTGAG AAGCTGGGTC CGGATACAGC AAATCTGTCC ATGAGATTTG GACTGCATAG TGGCCCGGTC ACGGCTGGGG TACTTCGAGG TGAAAAGGCA CGATTCCAGC TTTTTGGAGA CACGGTAAAC ACGGCTGCCC GAATGGAAAG CACTGGGCAA AAGGGGCGGA TCCACATATC GAAAGCTACC GCTGCGCTCA TTCAGAAGGC TGGTAAGGGT AGCTGGATGA AGATTCGCGA GGAACTTGTA GAGGCCAAGG GCAAGGGAAT GATGCAAACG TACTGGGTCG AGCCGCCGGA CTTTGGTACG ACGTCTACAG GAATTTCTAG TAATCATGAT GTCGAGGACG CCTCAGAGAG CCAGCATCTA CGTTTTACTG CAAATGAGTT CAAAAACAGC AAAATCGATG CAATGAGATT CAAAGAGCTT ATGGATAGCT TGAGGTACGC GGAATCAGCA ACTACCGGCG ATTTGAATGC AGCTCTTCCA CAAGCAAATA CTTCGTCGGA GAAAGATTGA
|
Protein sequence | MKWESDHVND THTDGDLASE NGSVCSATDE KVLAHKETKA VNILRAVTFF VLLLATATVS LLVFFYGRDK ENDEFLKQFG YNAGKVVDSF QVNAEKRLSA LEGFATMITS HALFANETFP MVTLPDFERK ASYTLQLAQV ISILIFPIVS RENRATWERY SVENQRWLEE GLALQKIVKD GDEEEALMQL EEQVVAGNLD VDPFAHLNIP PFIFKVEEGG TAAAYETGPG PYAPVWQLAP AIPAAFFVNF NGLSHPSRKL EINTVLRTEK RLVSAAADFS NDNDPNSAGR KAVLNLFLNR WKSGGNDYDE GPVSDIIIPV FTSFGENKTV GALLNSYIYW QVYLTDILTD EAEGIVCVLE NSCSQSFTYR IDGKDATYIG QGDLHDPSYN GMMVETGFGA VVGNNNVDFS IHEHCYYNLR VYPSKETEDK YITFQPIMFA LILVAVFVFT SFVFVTYDCL VQHRNSVVNT SAIQSSSVVS SLFPEQVRNR LHKVYKSEKS KQHNHTDIFK SITSDGKSRD DFEAADLNEF DDSTPIADLY PNCTVLFADI AGFTAWSSER APTEVFKLLE TLYGAFDKIA KKYKDAHAVA MCRFSSSCNT KMNQMMHILV EKLGPDTANL SMRFGLHSGP VTAGVLRGEK ARFQLFGDTV NTAARMESTG QKGRIHISKA TAALIQKAGK GSWMKIREEL VEAKGKGMMQ TYWVEPPDFG TTSTGISSNH DVEDASESQH LRFTANEFKN SKIDAMRFKE LMDSLRYAES ATTGDLNAAL PQANTSSEKD
|
| |