Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45098 |
Symbol | |
ID | 7200177 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 192159 |
End bp | 194485 |
Gene Length | 2327 bp |
Protein Length | 647 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179381 |
Protein GI | 219117173 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTGC GCGCGCCCAA ATTGGAGAGC CGTTGGCATC CCGAGCGGTA CTTGGCGAAT GACCATGACA AGTCGGGGGG GTACCGCCGC CGAGTTCCGT TCCCAAGGTG TGAGACAGTA GGGTGCGGTA CGATCTACGG CATTCGAAGC AACACAAACG TTGGTACGCC AACGAGGCCA TGAAGGATGG CACCGTCCAC AAGTACCATA GATCGAAGAC ACGACAGTCC TCCCAGCATG TTGCGCAATC GAGGCGGACA GCAAGTTCCA GTATCGTACT ACGCAACGAG TCCGGAGTCG TCTCCGAGAG AGAACGGGTA TCTTAACCCG GATCCTATTG TAGAAGGAAT TGACGAAAGG ATTCCGCAGT CTGCGTCGGG GAGAAATCAC CTATCCAGAA TATCTAACAG AACAGTGAGA AACCATGCTT CCCTGATAAT GAGCCACGGT GTACATAGCC GCGTTCCGTG TCCGACACCT GTGGTGGTGG TGCAACCAAC TATGGAAGTC GCGACTTCCG GTGCCACGCG ACTTTACGCC TCCCTCCTCG CTTCCCTCCT CGTGAGCATT ATTCTCTACA ATTTGACATC GAATGACACG AACATCACCA TCACTAAGGC GTACAACCCG ACGGATCGAT TCCATCTCCC GCCGCAATTG CCACACCAGT TGCCTTCCAA CGATCCGTCG GCGTACACGT TTGATTCCTT TTTTCTCATA CTCATGCGAT TCCTACTCTT GACGACCGTG GTCGTCAACG TTGCTTACGT AGTCGCCTTT TCTCAGCAAT CCTTGGCTTC TCGATCCTTG TGTCTATCTT CCTCAAGATG GCCAGCTCAA TCGCGTTGCT ACCAATCCGC GAGTCCGGGC GATTCCACGA CGACGACGAC CGTTCCGTCA ACGAACGGTC CACGCCGTGG ATACGCACTG GAAGACGACT TCCACCTCCT CACGGACGAA ACACTCCGCT CCGTTGAATT GGATCGCCGG GAACGACAGG ACTACTCCAT GCAGCGCGTG GCTTTGCGCA CGGGCGTGGT AGCATCCGAA ACGCCGGCAT CGTCCGGGTT GCTGGATCCA CCGTCCCTCA CGTTGGGCCG CGCCCTCGTT CTGGCCGCCG CCGCTATTTA CGGCACCAAT TTTGCGGCCG TCAAGCTACT CGACGAAGCC ATGCCCATGG CACTCAGCGC CGCTCTTCGA TTCTCACTAG CAGCCGTCGT CGTTACCTCC ATTGTCCTCG CCAATGAACG GAAAACGAAC AATCCGCAGA CCCGAGAAAC ACGCTGGGGC GCAACCCTCG CCGGTGCAGA AGTCGGAGCC TGGTACTGTA TCGGGTATAT TTGCCAAGCG TCGGGACTGC ACACCTCGGA TGCGAGTAAG GTACGGACGA CGAGATGTCT AGTGGCGCGT TCGTATATCT TTGTGTGTAC TGACTGGTAA AGCGTAACAA ACAAGCACGA GTCTTACCCA CTCTTCTTGT TTTGGAAATT GTATCTCTTT TGCAGAGCGC ATTTTTCAAT GCACTGGCCG TCATTGTGGT CCCCTTGCTC GATTCCTTCT TCAAGGGCAA AAAACTGGGG GGCCGCGGTC TCGCCTCGGT CGCCATGGCC ATTGGCGGAG TTGCCCTCTT GCAAATGGGT CCGGCCTTGA CCGGGACATC CGTCGGCACC AGCCCCGCGG ATTTTCCCGT TTCGGCGGGA GACATGTTCT GTCTCGCGCA GGCGCTCTTT TTTGGTATCG GCTACTGGAG ATTGGAGGCT GCGGCGACCC AATTTCCGCA TCAGGCCTCG CGCATTACCG CTGGCCAGCT CTGCGCGGTG GCCGCGGGGT CAGTGCTCCT GTTCGTGGGG GCGGATGATC TACCCACTCT ACAAGCACTC GAACACTGGT TGACCGACGG CTTCATCGTC AAGACTATCA TTTGGACCGG ACTGTTCTCC ACCGCGTTGG CCTTGTATTT GGAAACCGTC GCACTCAAAG TCGTGTCGGC CACGGAATTG ACGGTCCTCA TGACGAGCGT ATCGTTGTGG GGATCAGCCT TTGCCTACGT GACCATGGGC GAAATGCTGG ATCGACTGGG ACTCCTCGGT GGTCTCTTGA TTTTGACGGG ATGTGTCCTC TCGTCGACGG GTGGCAGTCC CAACGCTATC GGCAACAGCA AAGACTTTCT CAATAAAGAC AGTGACCATG CGTCGTAAAG GAATGGCCAA GCGATGACTC ATTGTCTCGG GCGACCACAC TTTTTTGGGC GTTGTTGCTC GGGTCGTCCC GTCGACACGG GTACTATCGG AACAAGATAG ACCGATATAA ATACCATTGT ATACAGT
|
Protein sequence | MAVRAPKLES RWHPERYLAN DHDKSGGYRR RVPFPSMLRN RGGQQVPVSY YATSPESSPR ENGYLNPDPI VEGIDERIPQ SASGRNHLSR ISNRTVRNHA SLIMSHGVHS RVPCPTPVVV VQPTMEVATS GATRLYASLL ASLLVSIILY NLTSNDTNIT ITKAYNPTDR FHLPPQLPHQ LPSNDPSAYT FDSFFLILMR FLLLTTVVVN VAYVVAFSQQ SLASRSLCLS SSRWPAQSRC YQSASPGDST TTTTVPSTNG PRRGYALEDD FHLLTDETLR SVELDRRERQ DYSMQRVALR TGVVASETPA SSGLLDPPSL TLGRALVLAA AAIYGTNFAA VKLLDEAMPM ALSAALRFSL AAVVVTSIVL ANERKTNNPQ TRETRWGATL AGAEVGAWYC IGYICQASGL HTSDASKSAF FNALAVIVVP LLDSFFKGKK LGGRGLASVA MAIGGVALLQ MGPALTGTSV GTSPADFPVS AGDMFCLAQA LFFGIGYWRL EAAATQFPHQ ASRITAGQLC AVAAGSVLLF VGADDLPTLQ ALEHWLTDGF IVKTIIWTGL FSTALALYLE TVALKVVSAT ELTVLMTSVS LWGSAFAYVT MGEMLDRLGL LGGLLILTGC VLSSTGGSPN AIGNSKDFLN KDSDHAS
|
| |