Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43655 |
Symbol | |
ID | 7197509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 1114018 |
End bp | 1115559 |
Gene Length | 1542 bp |
Protein Length | 491 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178076 |
Protein GI | 219112649 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGGGG GAAGCAGGAC TCTGGATCCA ATACTGACTT TAAGGGTTCC CCCCTCGTCC ACCGAGGACC ATCCATCCAC TCTAACTTCC ACAAAGGGCA ACCGATCCAC GATTTCTTCC TTATGCTTTC TACCCACAAT AGACAACAGC TTGATAGCAG CATCACCGAT TCCACCACAG TTAAAGGAAT CTTCATCAAA CGAACAAACA TCGTTGGTTT TAGAACACCG AATACACGAC GAAGATGATG ACCAAGAAGA CGAGTCTTCG GACGACGATG ATGACTACGT TTCTAAATGT CAAAGTATTA CTCAGGCCGA GCGAGTGCAC GATGCGGATG TCACATCCGA AGCGCGTCAC GCAGCAGTCG TGGCATCGTT GCGGGGGCAA TGGCTCGCCA CATGTCACAC CAACGGCGAC AGCATTCTGT GGGACTTGGG TCGGCAAAAA GCAACGAATC GCGTGGCTCC CAATCGGGGT CCTGCAATTG CTCTTCGTAG GTGCATTGAA GGAGAGGATT CTGCTACAGG CCTCCATACA AAAATATTGT TGCAAACGCG AGACGAGATG GGTACGGTCA CCTTGCACGA TGCCCAGGCT CCTTGGCTGA GAGACGGTCT TTCCGAGACA GCTCGGGTGG AAACATGTTC CCAAACGTTT TGTCAAGCGG CTCCCTGTCG AGGAGACTGG CGTTTGGTCG CTTTGCCCGC CTACGAGCAT GATTGGGTAA CAGTCCGTGA TTGGCGTGTG CCACCTTCCA ATACCCCAGT TTTGTCCATG CCCGCTTCCG CAGGACGCGC TGCCACTTAC GACGCGGGAG GGGTACACGA TATGCTGACG AGTTTGGCAA TGTCTACCAC TTCAGAATAC GGCCGACCTA TTTTGGCATG TGGCATGGAA AGCGGCAACG TCTTTTTTCA CGATTTAGCC ATGTTACGAG AAAAGTCGTC CATAGCTACA TTTGCGCCAG ATGTCCTTTC CTCGGACGTC AAACTCTCCC AGGATCCCAT ATTGTCACTG GACATGATGC CGTCTTCGTC TATTTCCTCC GCGGGAGCTT CAGTTGTGAC CATTGCTGGA ATGGCTGGTG ATTCCTTAGA ATTGCTGAAT TTATTAGAAA ACGAGAGAGG CACAGTTGCG GTGCTGAAAA CGACATTGAC AGACGCAAGC TCTTCGCTGG TCACTCGGCT GCGGTCTCGA GTCGCAACTT GCCGAGTACA CGAAGGGAGC TACAGTAAAC CAGGAGTAAA TCTCTGTCGG TTTCGTCCCG ACGGACGTAT TTTCGCCGTG GGGGGATGGG ATAGACGGTT GCGCATTTTT GATCGATCTA GAAAGACGTC TCCTTTGGCG ATTTTAAAAG GGCACACTAC GGGCGTGAGC GCCATGGATT GGGCAGCGCA TGCGGCCACC TCGGGTATAT TAGCGACGGG GGACTCGGAC GGATGCGTCT ATGTTTGGCG GTGTTTTTCT TCGTAAGTCC TGTGAACTGT ACAGAGTGTT GTCTATTGAA GCATGGTTAT TTCCTTGCAA GATCAATGAG CA
|
Protein sequence | MDGGSRTLDP ILTLRVPPSS TEDHPSTLTS TKGNRSTISS LCFLPTIDNS LIAASPIPPQ LKESSSNEQT SLVLEHRIHD EDDDQEDESS DDDDDYVSKC QSITQAERVH DADVTSEARH AAVVASLRGQ WLATCHTNGD SILWDLGRQK ATNRVAPNRG PAIALRRCIE GEDSATGLHT KILLQTRDEM GTVTLHDAQA PWLRDGLSET ARVETCSQTF CQAAPCRGDW RLVALPAYEH DWVTVRDWRV PPSNTPVLSM PASAGRAATY DAGGVHDMLT SLAMSTTSEY GRPILACGME SGNVFFHDLA MLREKSSIAT FAPDVLSSDV KLSQDPILSL DMMPSSSISS AGASVVTIAG MAGDSLELLN LLENERGTVA VLKTTLTDAS SSLVTRLRSR VATCRVHEGS YSKPGVNLCR FRPDGRIFAV GGWDRRLRIF DRSRKTSPLA ILKGHTTGVS AMDWAAHAAT SGILATGDSD GCVYVWRCFS S
|
| |