Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48204 |
Symbol | |
ID | 7203331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 503602 |
End bp | 506712 |
Gene Length | 3111 bp |
Protein Length | 786 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182547 |
Protein GI | 219124515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.3526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCCGC CTTCTAGCCT ATCAACCTCA CAGCGAGATG CCCTCCGTGC ACTTACTGAT GGATTCTTAC CTCCTTTATC GGTTCCTCAG GAGTATCAGC CCAAGTATGA GGCCCAGAAC CGCATCCACG AAGAATATTG GAGTCGACGT GTCTCCGCTG ACTCTGACTT TTTGAAGGCC CTAGAAAGTT CCATTGTCGA CAAGCTTCCC GCCAGAGAAT CTTTTCTTAC TCGATTTCTT TTAAAGGCTT TGTCAACTAG TGTTGGAACC GCGTTGTTGT TTGGTAAACC CACGCTACAC CCCTTTACGG AATGGCCCGT TCCACAGCAG ACGGCTCTGT TGCAATCATT GAAAACCAGC TCGATTGCTA CGAGACGGCA GATTTTCAAT GGATTCAAAC GCTTAATATG TGGATTAGCA TATTCCTATA CGGTCAATGG AACAAATCCT TTTTGGAAAG CAGTTGGGTA TCCCGGGCCA GCCCAAAACA CTCTCCTTTC CAAGCGAGAG GATACGGCTC TCGTCGCCAA GACGATGGAG CAGCAACGAC CGATCCGGGA AGCTCTCGTT CCCATTGATC TGGATACGGA ATATGAATGC GACATTGTGA TCGTAGGATC GGGATCAGGC GGAAGTGTAG CCGCTAGCGT TTTGTCAGAG GCGGGTTATC AAGTGCTCGT TCTGGAAAAA GGCACTTACA TTGCGCCAGC AGATATTTCC AACGAAGAAG CCGACGCGTT GGATCGCATG TACGAGACAC ACGGATTGCT TACAACGAAA GATGGCTCCA TGATGATTTT AGCCGGTGCG ACATTGGGGG GTGGAACGAC TATCAACTGG AGTTGCTGTT TGCCTTTACC CTCGTACGTT CGGGAGGAGT GGCGTTCTGA GCATGGTCTC GTGGACGACT TTAAAGAGGG AGGTGAATTT GAAACTTCGA CGCGAGAGAT TCTCAGTCTC ATGGGTGTCA CGAATAAGAT TACACACAAT GCGCTGAACC AGAAGCTTCA GCAAGGTTGC GATGCTCTGG GATACGAATG GGAAGCGAAT TACGTAAATT TGCTTCAAAC TGCCAACGCA ACAGCAGGCT ACATTTGCTT TGGAGATCGA TACGGCATGA AACGCGGTGC CTTGTCTGTT TTTCTTCCCA AAGCCATTTC TTACGGTGCA AAACTAATCG AAGGATGTCA CGTCGAACAA GTTATTCTCG GAGAGGGAGA AAATGGTCGT CGGAGGGCTG AAGGCGTTCG ATGCAGTGTG GGAGCCCACC GACTTCACGT CGTAGCAAGA AAAGCCGTTG TCGTCGCTGC AGGTGCTCTA CATACGCCCT GTCTATTGCG GCGTTCCGGC CTAAACAATT CGCATATTGG AAAGCACCTT CGCTTGCACC CTGTAACTGT TGCCGCTGGC TTCTCCAAGC CGACTGATCC TATCGAATGC TATCAGGGTG CGCCTTTGAC CACGGTCTGC AACCAATTTT CTCATGGCCC CGCCAACGAT GGATATGGGG CAAAGATTGA ATGTCCAAGC GCGCATCTTG GCCTTTTAGC TGCAGGCTTG CCTTGGACAA ATCCTGAACA ATTCAAAGAT AGAATGCTTC GTATTCGAAA TGGTGTGGTT TTCATCATCG TTCAGAGGGA CAAAGGCGAA GGCACTGTTT CGCTTGCTCG AGACGGAGCT ACCCCAGTTG TGGAATACTC TGTATGCCCA GCTGACAAAG TTAGCATGCA ACAGGCCGTC TGTGGTGGAG TGCGGATTTG TATCGCATCG GAATCCACGG AGGTCACAAC TGCGCACAGT CTCGATGAAG GTATGCACAT CTCTGACGGA GATTTTTTGC AGGAATATCT TTCAAAATTT ACGGCTTTGG GACTGAAGGA AAATGAGGTT GCATTGTTCT CTGCCCACCA AATGGGATCT TGCCGTTTGA GCGCGACCCC CCTTTCTGGC GCGTTGGATC CGAATGGGGA AGTTTGGGAA AGCGACGACT TGTATGTTAT GGATGCAAGT ACGTTCCCAA CTGCTTCTGG GGCGAATCCA ATGATAACGG TTATGGCAAT ATCGTTGATG CTAAGCAATC GCCTCGCTTT ACGGCTACAA CACGTGGACT ATAAGCTTCG TCGAGCTGGA GATATTCAAA AAGCGGAAGA AATGGCGAAG CGCCGACTAG AGCTGCGAAA TACTTTTTCT ATATCGCCCG AGAAAAACAG TTCTGCTGAA CGACCCGGTG CACATTGGAA CAGAATTGTG GATAAATCCT TGTCGATACT CATTTTGTTA ACCCTTATGA TACCGATCTT GCGGTCGTGG TTTTTCGATG TCCCGCTCGT CCAGGATCTA GTCAAGCATC CTATAATGTG AGGATATGGA GGTTGCTTCT TCGCTATCCT TGTATCTTCA GTACAATATT CAATGATAAC ACCGGGGAGG CGGATGGGGA GGAATCCGCA AAATCAGGGA ATCTTTTGTC AATATAAGAC GTTAATGTTT AGAGCAGAAG CGACAAGCCC CGCTGAATCG AAGGGAAATC AGTCTATGGA TGTCAGCTTT CCGAAGATGA TGCGTTCGCC ACTCACATCG ACCTCTTCAA AATACATCAA AATTTAAGAA CGAGTGGACC GATTATAAGG GCTTTACTCA GTCGAGACCA TAGCTGTGAA CTAAACCAGG AGTCCATTTT CATATGTTTC CGATTATCTG TTAGAGCAGC TATGCTGTCA TTGATCGCTA AGCTACAGAC TTCGTTGATG CGACGTCTCG TCCAAGGTTG TCAGGTACCA CAAATAGCTT TTTTGCAACT CAACTGAAGC CCGAAAAAGT CATTGCCCCT TCCACGTTAC AATATTACGC ACAATGAAAC ATGAAAGGAG TTCTTATGCA GCTGGGGCGA ATTTTGGCCA GCTTGGCTCT GAAAGCAACA AATTCAGATA AAAGAAAAGG TACTCCTCTG AGTTTAGCAG CTTTGAGTAA TCCGTTATGA GAAAAGTCAT CAGGGTTTGT TGACGACAGT GAGACGCTTC TGAAGCGTTG TAAAAGTGCG AATGAACGTG TCCCAGATTG TTGAAGAGAG CCAAAATAAG AAGGTTAAAA GACGCCGCAT AATTTCGTTG C
|
Protein sequence | MGPPSSLSTS QRDALRALTD GFLPPLSVPQ EYQPKYEAQN RIHEEYWSRR VSADSDFLKA LESSIVDKLP ARESFLTRFL LKALSTSVGT ALLFGKPTLH PFTEWPVPQQ TALLQSLKTS SIATRRQIFN GFKRLICGLA YSYTVNGTNP FWKAVGYPGP AQNTLLSKRE DTALVAKTME QQRPIREALV PIDLDTEYEC DIVIVGSGSG GSVAASVLSE AGYQVLVLEK GTYIAPADIS NEEADALDRM YETHGLLTTK DGSMMILAGA TLGGGTTINW SCCLPLPSYV REEWRSEHGL VDDFKEGGEF ETSTREILSL MGVTNKITHN ALNQKLQQGC DALGYEWEAN YVNLLQTANA TAGYICFGDR YGMKRGALSV FLPKAISYGA KLIEGCHVEQ VILGEGENGR RRAEGVRCSV GAHRLHVVAR KAVVVAAGAL HTPCLLRRSG LNNSHIGKHL RLHPVTVAAG FSKPTDPIEC YQGAPLTTVC NQFSHGPAND GYGAKIECPS AHLGLLAAGL PWTNPEQFKD RMLRIRNGVV FIIVQRDKGE GTVSLARDGA TPVVEYSVCP ADKVSMQQAV CGGVRICIAS ESTEVTTAHS LDEGMHISDG DFLQEYLSKF TALGLKENEV ALFSAHQMGS CRLSATPLSG ALDPNGEVWE SDDLYVMDAS TFPTASGANP MITVMAISLM LSNRLALRLQ HVDYKLRRAG DIQKAEEMAK RRLELRNTFS ISPEKNSSAE RPGAHWNRIV DKSLSILILL TLMIPILRSW FFDVPLVQDL VKHPIM
|
| |