Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_28874 |
Symbol | |
ID | 7202811 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 222620 |
End bp | 224271 |
Gene Length | 1652 bp |
Protein Length | 410 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182028 |
Protein GI | 219123431 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.792573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCCT TCCAACCCAT CAAGCTTCAA TCAATCTACT TCGCAGCAGT TGACTGTGAA CGCTTCACCA CGGCGGGGGA CCTCGTTGGA ATCCCGATTG TTCTTCTTTT TCAGACTGTG ATACTCTCTG TGTTTGATTC GTGAATCTTC TGCACGACTC ACACAGCTAC GACAACAGCG GTGTTGGCGA ATCCGAATGC GTTCAGCCTG CTGCATTCCT GATTCTTGTG GTTGTCCTTT ATCTTCCTCC TGCTTCCCGA CAGGTCATCC ATCGTGTGTC TACTCTACTA GATACCGCAT ATTCCAACGC AGTTTGGATA GTTCCATTCC TAACTCTCCA CAATTCCCCA TGGTGAACGA TCCGCGGGTG ACGGAGCAGC TTTCGCATTT GGGGGGAGTC TACCGCGATC CTGCACGGGT GGATCGGGAC GCCACGGCGT TGTTGCGATC GAGCGTCGGG GAGCGCTTGA CTCCCATTGC GGCGGAATTG TACGAAGACA ATGGTTCCCA TTCCACCGTG CTGGTCTTGC AGGGCATGAT CGCCGTGGAC TTTCGTGGCA CAACGTACCG CCAACTGATG GAAATATACC TACCCGGGCG CTACCCCCAG CGTCCACCCG TATGTTACGT GCGTTTGGCC GAACACATTT ATTTGAAAAA CAATCACGAG CACGTGGGTT CGGATGGGAA AGTCGATATC CCCTATTTGG ACGAATGGAC ATCGCATCAC CACAATTTGG TGGAGTTGGT CATTCAAATG AGTTCGGTTT TTTCCGCGGA CCCGCCAGTC TTTTCCCGCA CATCAGCCGC CACGCCCCCA CCGCCAGCCT ACGCCGCCGC TGCCGTTTAC AATGATACAC GGAGCAGTAC AACGACAACG AATAGCAACC ACGGTAGTAG CATTAGCTAT CAACGCGAAA AACGTCAACG GGAAGTGGAG GCCGAGCGTA GGCTGGCGCA GGACGCTGCC GAAGCCAATG CGGCAGTCGC CGCCGCCCGA CGGGCTGCAC AAGCTGAACA AGAACGCGAA GTCCAACGGT TGGCGCAACA AGCTTGGGAA GAAAGGAGAA TCGACCAATT GCGCCAAAAT TTGACTTTCA AGACCCAACG CCATTTTGTG GAACTCTCTA AAGAAACACA ACAGCAAGTG CAAGCGGATG AGCGTCATAA ACAGTTGTTG ATCCATGCAG AAAGCAAGAT TGACGCGCAA ATCAAAGCCC TGGAAAAAGA GAAGGAGACC TTGGAAAGGC ATTTGTCCAC GACACGCGAG AAAACCATTG CTATTAAAGC TTGGGTACGG TCGCACAAAG AAAGGCTGGC CGAGTCCAAG GAGGCTGAGG CTGTACCGGC TGACAAGCTG GTGCAGCCGG CCAGTGAGCT GCACGGACAA ATGCTGGCTC TGGCAGCCGA AAACGCGGCA TTGACGGACG TCCTTTATTT TCTCGACCGT GGTTTGTACG CGGGCAAGCT AGACGCCGTG GCCCATCTCA AACAGGTACG GAAACTCGCC AAAAAGCAGT TTCTGGTCCA GGCGCATCTT ATCAAGATCA ATCAAGTACT TCTGGACAGA AGTAATAGTC GTAGATGAGC AACACGGAGA TCTCTGTCTG CACACTCAAG ACTTCGTGTT AGAAATTTGC CTTACACACC AG
|
Protein sequence | MRSACCIPDS CGCPLSSSCF PTGHPSCVYS TRYRIFQRSL DSSIPNSPQF PMVNDPRVTE QLSHLGGVYR DPARVDRDAT ALLRSSVGER LTPIAAELYE DNGSHSTVLV LQGMIAVDFR GTTYRQLMEI YLPGRYPQRP PVCYVRLAEH IYLKNNHEHV GSDGKVDIPY LDEWTSHHHN LVELVIQMSS VFSADPPVFS RTSAATPPPP AYAAAAVYND TRSSTTTTNS NHAWEERRID QLRQNLTFKT QRHFVELSKE TQQQVQADER HKQLLIHAES KIDAQIKALE KEKETLERHL STTREKTIAI KAWVRSHKER LAESKEAEAV PADKLVQPAS ELHGQMLALA AENAALTDVL YFLDRGLYAG KLDAVAHLKQ VRKLAKKQFL VQAHLIKINQ VLLDRSNSRR
|
| |