Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20787 |
Symbol | |
ID | 7201661 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 266602 |
End bp | 268739 |
Gene Length | 2138 bp |
Protein Length | 473 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180975 |
Protein GI | 219120475 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0430146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCCAGTCGA ACTAAATACC TTCCATCCTT CTGGATTGTG TGAGAGTGTG TATTAGCGCC CCGTCGCCTC GCATTGCACC TATATCCATC CATGGCTCTT CGTCTTTCTC TTTCCAAGGT ACGTAGACGC TAAATTCCGA ATGTATGGGT GGGATGTTGC GGTGGGCAAC ACAACGTCGT GTGCCGGCAG CGACATGGCA ACCGACACAC GCGGTGCTCC CCCTATCTTG TTGCTGTGGT TGTGTGATTG TGTAATGATT CCATCAAAGT TGCTGGTTCA GTGTCAAATG ATTCCGTGTT GTACAAATTT GTGGAATCAC ACACGAATGG TGTTTGAAAG TCTCTCACTC GCGTTTCCGT TGCCCCACCA TTTTTCGTTG CGACAGTGGA GTCGTCCGAC GGCTCGCGCA TCGTCGCGAT GTGCCTTATC CACCGCCACG GCCGCCTTTC CGGATTACGT CTTGCGGGCG CCGACGACGG ACGTCACGAC GTTGGATTCG GGATTGCGCG TGGCGTCCGA AACGGTGCAA GGGTCGGAAA CGGCCACGGT GGGCGTCTGG ATCGACGCCG GGTCGCGCTA CGAAACGGCC CGGAACAACG GCGTCGCCCA CTTTCTCGAA CATTTGGCCT TTAAAGGGAC GGAACAACGG ACCCAGCCGC AACTCGAACT CGAAATCGAA AATATGGGGG GACACCTCAA CGCCTACACC TCGCGCGAAC AGACCGTGTA CTTTGCCAAG GTCTTTAAGG ATGACGTCGG GAAAGCCGTC GAGATTCTTT CCGATATCCT CTTGCACTCC AAGTTGGACG AGGCCGCCAT TGACCGGGAA CGCGACGTCA TTCTCCGCGA AATGGCCGAA GTCAACAAGC AACAGGAAGA ATTGGTGCTC GATCATTTGC ACGCCACCGC CTTTCAGGGA ACCGGACTCG GACGTACCAT TCTCGGTCCG GAAGAGAACA TCCGCTCCCT TTCGCGTACC GACCTCGTTG ATTACATTCA GCAACACTAC ACCGCGCCCC GGATGGTCAT TGCCGGAGCC GGAGCCATTG ATCACGATCA GCTTTGCGGA CTCGCGAGTC AGCACTTTGG TGAATTGCCC ACCGCACCCA AGGATGGACT CGAACTCGCC ATGGAACCAG CCATCTTTAC CGGATCGGAT TATCTGTAAG TTACCCATCG CGATTGTAGT TGTAGCTGCA GTTGTCGTGG CTATGACGTT GTGTCCTTGT GCCGTGGATG GTTCGGACGA TGATTTCGTT CTTGTGCCGG GGGACACGTT CCGTCTCGGG GAGGGTTTCT TTTTTCGCGC GCTCGTCACT GTTTTTTAGA TTATCCACTC CACCTACTCA ATCTCACTCA CCCACGTGTA CCTCGCCGTT TCGCTCCCCT TTCCAGCGTC AAGTTTAACT CGGACGACAC GGCCCATATT GCCATTGCCT TTGAAGCCGC TTCGTGGACT TCCGAATACG CTTTCCCCCT CATGCTCATG CAAATCATGC TCGGATCCTA CAACCGCACT CAGGGACTCG GACGCAACCA TGCTTCTCGC CTCTGCCAAG AAGTGGCCGA ACACGAACTC GCACATTCGG TCAGCGCCTT TAACACGTGC TACAAGGATA TCGGTCTCTT TGGCGTCTAC ATGGTCGCCC CCGACAAAAA GGTCGACGAC CTCATGTGGC ACGTCATGAA CAATCTCGTC CGCTTGGTCC ACACACCGTC GGAAGAAGAA GTCGAACGCG CCAAGCTCAA CCTCAAGGCT ATTATGCTCA TGGGGCTGGA CGGACACGCC AACGTGGCCG AAGACATTGG CCGCCAATTG CTCACGTACG GACGCCGCAT GACGCCGGCC GAGATCTTTT CGCGTATCGA CGCCGTCACC AAGGACGATA TTCGAGCAAC GGCGGCCAAA TTCATCAACG ACCAAGATCA CGCCCTCGCG GCCGTCGGAG GAATCCATGA ACTGCCCGAC TATACTTGGG TCCGCCGCCA TTCCTACTGG CTGCGTTACT AGACACACAC ACGCACACAC ACGTGTGTTG GTGGAGTCGG CGGGACAGAA TAGTGGCGAA AAACGCGAGC GAGGATCGTC GCTATTATAC ATGGTAACGA GAAAGTAAAG ACCAAAAAAA GACCTTTG
|
Protein sequence | MALRLSLSKW SRPTARASSR CALSTATAAF PDYVLRAPTT DVTTLDSGLR VASETVQGSE TATVGVWIDA GSRYETARNN GVAHFLEHLA FKGTEQRTQP QLELEIENMG GHLNAYTSRE QTVYFAKVFK DDVGKAVEIL SDILLHSKLD EAAIDRERDV ILREMAEVNK QQEELVLDHL HATAFQGTGL GRTILGPEEN IRSLSRTDLV DYIQQHYTAP RMVIAGAGAI DHDQLCGLAS QHFGELPTAP KDGLELAMEP AIFTGSDYLV KFNSDDTAHI AIAFEAASWT SEYAFPLMLM QIMLGSYNRT QGLGRNHASR LCQEVAEHEL AHSVSAFNTC YKDIGLFGVY MVAPDKKVDD LMWHVMNNLV RLVHTPSEEE VERAKLNLKA IMLMGLDGHA NVAEDIGRQL LTYGRRMTPA EIFSRIDAVT KDDIRATAAK FINDQDHALA AVGGIHELPD YTWVRRHSYW LRY
|
| |