Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49425 |
Symbol | |
ID | 7195913 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 218285 |
End bp | 220229 |
Gene Length | 1945 bp |
Protein Length | 504 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184207 |
Protein GI | 219127990 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGGACTCTAC TAGACTCCGA CAACACTGAA CGTCGAACGA ATCAACCTTA CTAATTTGTA TCCCTCTTTT GTAGTTCCCG TTATCGACAT TACACCATAG CTGTAGCAAC ACGATGAGTC TTTTATCGAA ACGACGAATG TCTCCGGTTC TGCTCTGCTT GCTTCCCGTG TGGCTGCCGT TGTGGTTATC GAGTCCCACA CCGTGTGCCG CGTTCTGGAA CGTACAAACC GAAAACGGGG CAGCGGAAGT AGTCGATCAC GACTCGGGCA CGCATAATCG CGATGCCGAT ACTCCGGCGG AATATGGTGT CGATGTCTCT TTTCCCATGC ATCATCACCA AGTCTCCAAC AACTACGCTT GGTTGCCGCA CAATACGGAT TCTTCCGTCC CCACGCCTCC CGAATATCGG GACATGTCTG TCCAACCCCT CGGGAACATG CAGGCTCGAT ACGATACCTT TTTACAGGAA TGCGTCGAGT ACTACGGCAA AAAAGGTGAA CGTTGTGTGT CGACCGAACA AGATCGCATC GCCATGTCGC TTCGACAACC CCAATCGATG CAGAATTATA CGGAACTGGG CTACAAGAAG ATCCGCGCGC CCGAAGCCGT CTACAAGCTT ATTCGGGAGT TCTGGGATCG TAACAACAAG GAGCAAAAGA TCGAGCAGTG GGGTGTGGGA AATACATACA CGTAAGTACA AAGAGAGAAG ATGCAACCTT GGACTGGGGC CTTGCCGAGA AACCATTGTG GATCTAAGGA ACGTTTCCGT TTTTTCTATA GCAACAACTG GAAGGCTCCC ACTTATATGG TATCAGTGGT ATGTAACAGT TTACCAAAAT ATGTGCTCTG CATTAGTGGT TAGCTACAGT GACTTCGCGT TCCTCATCTT ACCGCTGTGC CTATTTCTTT AGGAAGACAA AGGTCTGCGT GGCGGAGGTT ACGTTTTGAA ACAAAAGATT TGGGACGCCG CTCGCGATAC CATTCAGGAA TGGACGGGTG AGGAACTCAC ACAGTGTAGT TTGTACGGTA TTCGGGTGTA CAAAGACGGT GCCGTCCTGG CCCCGCACGT TGATCGGTTG CCCTTGGTCA GCTCGGCCAT TATCAACGTG GCCTCGGAAG TCGACGAACC CTGGCCTCTG GAAGTCATTG GACACGATGG CCGTGCCCAG AACGTCACGA TGGAACCGGG AGACATGGTC TTGTACGAAT CGCATTCAGT CATTCATGGG CGTCCGTTCC CACTCAAGGG ATGGGTTGCT AATCTATTTG TACACTTTGA ACCGACCGGC CATTCGCTGC GGCACAGCGC GGATGTGGAA GATCTCGGGC AAAAAGACGT ACACGAACGT TACAAGGATG CCCTGGCACG GGGCTTTGGT GGACACGAAA ACGAAAACAG TGGATTGCCT CCTTACCTCT TACCCGGAAC TCCGGAAGAA AATCACTGGC GAAAGCAGCA CCCGAGTGGA CACAAGTCCC AGCAAAAGAG TTTTGCCACA GGCACGACCA CGGCACACGT TGCTGCACAA AAGGGCAGCC TCGGTGAACT CAAGGCTGAA ATCAACCGCA AGAAAGACGC TATACATGCA CGGGACGAGA ATGGATGGAC GCCCTTGCAT GAGGGTGCTC GTAGCGGGCA TCTGGACATT GTCAAATATC TGGTGGAACT TGGAGCCGAC GTGAACGCTA CCACGAGCAG TGGAGGAGGT ACCGCGCTTT GGTGGGCTAA GGAAACGTTC GGCGCCGAAG GAAATCCCGT TATTGACTTT CTTGAAAGTA TGGGCGCTCT GAACGCCGGT CCAGAACTCT AGTGCGGTCT CGTTCGCTAT CATGACTTGT ACGTCGCGAT GGCGTTAAGA CAATTAAATT GCGCACCGGA TTTGTGATAT TTTACAGTTA GCCAACCCGT ATATTAATAT TAAAAAGGAC AAGTCGTATG TGTAT
|
Protein sequence | MSLLSKRRMS PVLLCLLPVW LPLWLSSPTP CAAFWNVQTE NGAAEVVDHD SGTHNRDADT PAEYGVDVSF PMHHHQVSNN YAWLPHNTDS SVPTPPEYRD MSVQPLGNMQ ARYDTFLQEC VEYYGKKGER CVSTEQDRIA MSLRQPQSMQ NYTELGYKKI RAPEAVYKLI REFWDRNNKE QKIEQWGVGN TYTNNWKAPT YMVSVEDKGL RGGGYVLKQK IWDAARDTIQ EWTGEELTQC SLYGIRVYKD GAVLAPHVDR LPLVSSAIIN VASEVDEPWP LEVIGHDGRA QNVTMEPGDM VLYESHSVIH GRPFPLKGWV ANLFVHFEPT GHSLRHSADV EDLGQKDVHE RYKDALARGF GGHENENSGL PPYLLPGTPE ENHWRKQHPS GHKSQQKSFA TGTTTAHVAA QKGSLGELKA EINRKKDAIH ARDENGWTPL HEGARSGHLD IVKYLVELGA DVNATTSSGG GTALWWAKET FGAEGNPVID FLESMGALNA GPEL
|
| |