Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40158 |
Symbol | |
ID | 7195931 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 280653 |
End bp | 283030 |
Gene Length | 2378 bp |
Protein Length | 643 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184220 |
Protein GI | 219128017 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGACT ACGGTAGCCC CGGATCGCCT CGTACTCCAC AAACGTATCA GATTGACTTT TCGGCGGTAG CGAGCGGTAA ACGCATTGCG AGCTCGAAAC GTCGCGTCCG ATGGTACGTC TAGCCTCGGG TACCACCACT TTATCCTCGT GTGAAGCTTC CGTTCGAACC GATACAGCCA GATTGTTGGT GTTTGGGAAC AGACAGTCTG TGTTGTGTTG TGTCGCATTG TTTCGTGTAA CTGGTCCGTT GCTTAGATGA TGGCTGCGTG GACTGGATCT ATTAGGAACA CGAGCGATCC CCATCGAACC AAATCTCACA GTCAACGTGG TATTGCGACT TGCTCTTGCT GTTTTCTTAG GCGCTTTGGA TTCGCCAATT CGGAGGCTCT CGCCAGCGGT GAGACCGGCA CGGCTTGTCG CGGTGAAGAA CACGACGTGA CGCTCGTTTG GTCGATTGCC TCGGGCAAGC GCCTCATTCT CGTCGACGGA CACGAAGTCC ACTACTCCAG CAGTCGCAAC GCCATCTTTG ACTTTTCCTG GACCATGCGC GGCAATCACG TCCTCAAGAT GGTAGCCCAC GTGTCGCCGC CACTCTCTCC AACCCCCGGG TTCCGGCAGT ACGATTTCTT CATTGACGGA CAGTCCTTTT TTACCTTTCC CAAGGTCTTT CGCCTCGGAT TGGCACCCGG ACAGGCTCCA GCGAGTCCAT CAGGCGCATC AACATCCTAT GCTGGTATGG CTCCGACTTC CGCCAGGCGT TCCGCTAGTG GCGAGATTGT TTCGATGGAA GCCCCCCACA ACCCCGATGA GGTACGTCTC ACTCGTTGTC CGGGATTTTG GTGATGAAAC GCAGCATCGG GACGCAAACG AATACAGTGA AACGGATCGT TTAGACGCAA ACGAATACAG TGAAACGGAT CGTTTAGACG CAAACAAAAT TGACCGTCTC GTCTCTTACC ACTCGCATGC TTTCGTTCGT CCTTTAGGAG GAAGCGTATC TTCAAGAAGC CATTCGTCAG TCCCTCCGGG ATGACACACC GGCTTCAACC TCGCGAGGAG CACCGACCAA CCCAACTAGT GATCTCCTCG ATTTTAGTAG CCCTCCAGCC GGTGCCCCGA CCCAGACATA TTCTCCGTCC ACCAACGACC TCTTTTCTCC ACAAGCATCG GCCAGTAACG GTCCCTATCC GTATCAGCAA GAAAGCAACA ACATGTTTGC CTCGCAAGGT TCTATTACGT CGGATCCTTG GGGAGCTCCG GCTCCGGCAG CACCACGGAT CCGTGGGGAG CTTCCGCTCC CGCGACACAT TACGGATACG GAACACCAGC AGCTGGTCCA TTGCCAGCCC TTACCGGGCC TGCGCCCGCC GCTACCGGAT ACGGTGGTTA CAATACGGAT CCCGTTCAGG CCCCGTACGG AGCTCCCGCA CTCTATCCTC CGCCAGCCCA GTCCTACGCC CCCGCACCAG CACAGTACGG TGGGCCGGAA CAAACCCACG CTCCGGATCC GGTGCAAGCA CCGTACCAAG CTCCTGTTCA ATCGCCATAC CAAGCTCCTC CTCCCGCATC GGCGTGGCAA GTCCCGGCTG CGATTGCCAC GCAGCCGCCG TACGGGCAAG ATCCGAGCAT TCCGCCGTCA GTGACACCGC AAGCCCAAGC GACGCCGTCA ACAATAGGAT TTTCTTCACC CCCACCCGAC TTTTCTGGAT TCTCCTCGGC TCCGCAAGCA TCCGAGCCGG CGCAGGCGCC AAGCTCGGAC CCTGTCGTGT TCTCCATGAA CGCTCTTAGC GGCGAACAAA ATGGACTGGT TGACAGCAAC TCGACGGCCC AGTCCGCGTC ACTGGTAGAT CAGGCTTATT CCAAATTGGT CAATATGGAT ACCTTTTCGT TGGTTTCGAA GAATGACGAA GCTCGGTCCA ATCCTTTTGA CATGGGTAGT ACTACGGTGG GTGGAAACGT ACCATTGGCC CAAATGAGCA AGCATAAGAG TCAAACCGCA CCAAAGAAAG AAGTCATGAG ATCGCCTGCG CCGCCTCCAG GATCAATGAT AGTTGCCAGC AATCACAACG GAAACTGGGG TGGCCAATAC GGACAGCCGC AGCAGCCGGA TATGCAGCAA GCGTATGGTC AGCAACAGTC TCCGATGCAA CAGCAACCTC AATACGGACA GCAGCAGCCT CCGATGCAGC CGCAACCACA GTACGGACAG CAGCAGCCTC CGATGCAGCC GCAACCACAG TACGGACAGC AGCAGCCTCC AATGCAGCAG CCTGGACAAC TCGGGCAACA TGGACAGCAG CACTTTGGGC AAACTAATCA AATGCAGTAT GGTCAAGCGC AGCAACCTCC AGCTCAACCA GGATACAACT ATTTTTAA
|
Protein sequence | MADYGSPGSP RTPQTYQIDF SAVASGKRIA SSKRRVRWRF GFANSEALAS GETGTACRGE EHDVTLVWSI ASGKRLILVD GHEVHYSSSR NAIFDFSWTM RGNHVLKMVA HVSPPLSPTP GFRQYDFFID GQSFFTFPKV FRLGLAPGQA PASPSGASTS YAGMAPTSAR RSASGEIVSM EAPHNPDEEE AYLQEAIRQS LRDDTPASTS RGAPTNPTSD LLDFSSPPAG APTQTYSPST NDLFSPQASA TRKQQHVCLA RFYYVGSLGS SGSGSTTDPW GASAPATHYG YGTPAAGPLP ALTGPAPAAT GYGGYNTDPV QAPYGAPALY PPPAQSYAPA PAQYGGPEQT HAPDPVQAPY QAPVQSPYQA PPPASAWQVP AAIATQPPYG QDPSIPPSVT PQAQATPSTI GFSSPPPDFS GFSSAPQASE PAQAPSSDPV VFSMNALSGE QNGLVDSNST AQSASLVDQA YSKLVNMDTF SLVSKNDEAR SNPFDMGSTT VGGNVPLAQM SKHKSQTAPK KEVMRSPAPP PGSMIVASNH NGNWGGQYGQ PQQPDMQQAY GQQQSPMQQQ PQYGQQQPPM QPQPQYGQQQ PPMQPQPQYG QQQPPMQQPG QLGQHGQQHF GQTNQMQYGQ AQQPPAQPGY NYF
|
| |