Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33660 |
Symbol | |
ID | 7197942 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 35370 |
End bp | 38672 |
Gene Length | 3303 bp |
Protein Length | 253 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178154 |
Protein GI | 219114717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.83302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTCA AGCGCAAGTT TGTCGCGGAC GGTGTCTTTT ACGCCGAACT GAACGAGTTG TTGACGGCGG AACTCGCCGA AGAAGGGTAC GCCGGAGTGG AAGTCCGCAC GACGCCGCAC CGTACCGAGT TGATTATCCG CGCGACCCGC ACACAGAACG TGCTCGGTGA GAACAATCGC CGCATTCGGG AACTTACTTC GGTGGTGCAG AAGCGCTTCA ACTTTGCCGA CGGTGCCGTG GAGTTGTACG CGGAGCGTGT GCAGAACCGC GGACTGTGTG CGCAAGCGCA GGCGGAATCG CTCAAGTTCA AGCTTCTCGG AGGATTGGCG GTGCGTCGTG CGTGCTACGG TGTCGTGCGA TTCGTCATGG AAGCCCAGGC CAAGGGGTGT GAAGTCGTGG TGACTGGAAA ACTGCGTGGA CAGCGTGCAA AGGGCATGAA GTTCGGTGAC GGATACATGA TCAAGACGGG ACACGCCGGA CAAGTCTACA CCGATTCCGC CGTTCGTCAC GTTCTCATGC GACAAGGAGT CATTGGAATT AAGGTCTCTA TCATGCTTCC GCAAGATCCC AAGGGACAAA TGGGTCCCAA GATTCCGTTG GATGATGTGG TGACCATTCT GGAACCCAAG GAGGAACCGT CTCCGGCGCA ATTGATGGCA CAGCAAGCCG CGGCGGCTGC CGCTTACCAG GCGGCGGCTC CTCCCATGAT GGAAGCACCC GTCGACCCGG CGGGTATGGC TCCCGTCGAT CCCGCCATTC AAGCCGGATT CTAACTATAG TCGTTGCCGT TTACCTACTG GTTTGTGTGT GTGTCGTGAG TGTGTGTACA TATATATATG AGAGTGTGTG TGTTTGTATC GGATAAGCGA GGGTTAGTTC TAACTGTAAG TTAAGGGGGA CGGGTACCTA CAGGAGAGCG CCATACTTGT CTAGTATGCT CGCCATTCGT ACTGCGTCTT ACAGTTCGGG ATCGTCGACG AATCGGGCTT GCAGATTGCC GTCGAGATCG AGTCGACAGG AAGGCCCGTC GGCTTGGTCT TGTTCCCCAA AGATGTACCG GTTGTTGGCG ACGACTTGGT AGAGACCATC GCGTGTCCAG CGGGGCACGA TTCGCTGGGC CAGGTTGGCC GTGGCGCGGA GGACTCGTGG AAGTCCTTCG AGCTGGGTAG CGATAAAGAG GACGGCGTCC GATTCAAACC AGGCCCGGGT GGGCGTGACG ACAACAATGG AGGAAATATC GTTGGCGCGT TTGCCGTGTT GGAGTAGCAA GGCCTGTCCG GTGCGACTCT GCAAACTGGC GTAGCGAAAG ACCCCGTCGT GGGCAGCCGC GTCCCAATCG AGACAGCGGT TGACTCCGGC GTTACAGAAA TTGCACACGC CGTCAAAGAG AATAATGGGC TTGGAACTGG TGGCAAAGGC CGTACTAGCG ACGCGGCGCC AGTCCCAATC GACCGTTGCG GGTCCGTCCG ACGTGGTTTG GGAGCGGGGC GATGGTGTTG ATGGAGTGGT GTCGGTACTG CTGCTGCTGG CACTTGGCGG AGTCGGAGTG GAGGTACTGG CGAGGCTGGT TGTTGCGGTT GCGGGTGCGG TTACGACACG TCCGTTGGTA GGAGTGGCGC AATGGAACCG TGACGACCGC TGCGGTACGG TCACGATTGA AAAGGCTTGG ACGGATGACC ACCGAGCTCC GGTCTCGCTG CCGCGCCAAA CGACGGCGAC GACGACAATG AGCAGATACA CGCACTGATG GAGCCAGAGT CGAGATTGAC TGTGAGATGG CAATCGCGTC ATGTTGGGAA TAACTGTTCG TAGACTTGGT AAAATTGTCA AAGAACGTTG GAAAACAATC ACAAATCACG AAAGGTCGAT CGTGTTGGCC AACACGCAAA AGTGACAAGA GATGGACTAT GTAATAATCG TTCGCGAGCA CTCTGGGAAT TCTTCCTTCG TTTCGTTTGC TTTTGTAAAA ACGGTGATGG GGTTCGCGAA GGGAAGACTT TATAAGGAAA GAGTACTTTC GTATTCTTCC TGTTCTTACT GTTCGTTGTG AGGAACGAAG AATACGACAA TAGCCTGGGC GGCTTCCCGA ACCACAGTTT ACCGTTCTGT GAGCGAACCT GGAATCGAAA ATCCTAGCAG TAAATCCACG ACGTGTGCAA AATGCTTCGG GTTCCTTTGT GCTGATGATA CCGAAGGGTG CTCTGCTCTC ACTGTCGGTA CATGATACCG TCAGTCCGCA GCTGGGAAAC GGGCGAATGA CTGTCCGTCG GTCATTCACT GTCCGTCAGT CTATCAGTCT ATCGGTCGGT CTCACCCTTT CTGCAAGGAA AGTACACGAA ACGAAAAGTG TAAATAAGTG GGGCGGGGCG GAGAGGGAGG GGTTCGAACA GTAGACACGA TTACCGCTCG CAAAATGCAC CGCACGCATC CAGCTCGCCA AGTCTTCCCA CGGGACGACC GAACACCTGG TGCCATTCTG AGAATTTGTG TTGTTGTCTC TGGATCTACC GAGTGTGTGT GTATCCGCTT TCTTCTTTGT GCCAACTTTA GTAGTAGTAA CAGACATGAG TTACGCATCC CCTTCGCCGT CCTACGGGAA TCCTCCGTCG TCACAGCAGC AGCAGCAACA GCAGCAACAA CATACGAGTC CGATGCCGGC CTATTCCACA GCGCCTCCCA CGCAACAAGT ACAACAGCAG CAACAGCAGA CGTGGAACCA GTCGTATCCA CCTCCGACAC AACAACAGCA ACAACTCCAA TCACAGCCTC AGCCGTACTA TCCACAGCAG CAATGGCAAG CACAGCAACA GCAGCCGGTA CAGTATCAGC AGCAACAAAC ACAATCACAA CCCGCTCCCT CGTATCCACC ATTGGCGGCA CGTCCGTCTG GGACGGCTCC GGCTTCCCGA AATCCCCGTC TTTCTCGGCC ACCTTTTGTC GATCCACAAA CCAATATTAT CTACGACACC TCCGACGCCG AATACGAAGG ATGGCTCACC AAACAGAGTA CCTGGTTTAA GGTACGTTCC TCCCGCAAAA CTGTACGCAC GGATAATATA CGATTGTGTG CGAATACAGA ACAGCACTGC CGTGTGGCAA CGTGATAACA ATCCACCAGG CCATTCTGTT TTTGTTTTGG TATTGTTTTT CCGTTCTAAC TTCTACCCCT TTTTTCTAGG AATGGCGTCG CCGGTACTTT ATCCTCAAAG GCAGTAAGCT TTTCTTCGCC AAAAACGAAT ACGCCGCCCC GCACGGATTT GTGGACTTGG CTACCTGTAC CACGGTCAAA TCCGCCGATT TGA
|
Protein sequence | MSVKRKFVAD GVFYAELNEL LTAELAEEGY AGVEVRTTPH RTELIIRATR TQNVLGENNR RIRELTSVVQ KRFNFADGAV ELYAERVQNR GLCAQAQAES LKFKLLGGLA VRRACYGVVR FVMEAQAKGC EVVVTGKLRG QRAKGMKFGD GYMIKTGHAG QVYTDSAVRH VLMRQGVIGI KVSIMLPQDP KGQMGPKIPL DDVVTILEPK EEPNGVAGTL SSKAVSFSSP KTNTPPRTDL WTWLPVPRSN PPI
|
| |